General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model Paper • 2409.01704 • Published Sep 3, 2024 • 83
DocGraphLM: Documental Graph Language Model for Information Extraction Paper • 2401.02823 • Published Jan 5, 2024 • 35
MVD^2: Efficient Multiview 3D Reconstruction for Multiview Diffusion Paper • 2402.14253 • Published Feb 22, 2024 • 5
The Era of 1-bit LLMs: All Large Language Models are in 1.58 Bits Paper • 2402.17764 • Published Feb 27, 2024 • 606
MambaByte: Token-free Selective State Space Model Paper • 2401.13660 • Published Jan 24, 2024 • 53