olmOCR Collection olmOCR is a document recognition pipeline for efficiently converting documents into plain text. olmocr.allenai.org • 3 items • Updated 11 days ago • 89
C4AI Aya Vision Collection Aya Vision is a state-of-the-art family of vision models that brings multimodal capabilities to 23 languages. • 5 items • Updated 5 days ago • 60
Global Exams: Bangladesh (Localized MMLU) [ICLR'25] Collection Exams dataset in Bangladesh (Bengali, English) • 4 items • Updated 3 days ago • 1
Multilingual LLM Evaluation Collection Multilingual Evaluation Benchmarks • 8 items • Updated 7 days ago • 25
Global Multimodal Exams: Bangladesh (Localized MMMU) Collection Vision-language Exams dataset in Bangladesh (Bengali, English) • 7 items • Updated 17 days ago
Retrieval-Augmented Generation [EMNLP'24] Collection Artifacts for "Open-RAG: Enhanced Retrieval Augmented Reasoning with Open-Source Large Language Models" [EMNLP 2024 Findings] • 5 items • Updated 17 days ago • 2