ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling ā¢ 3 items ā¢ Updated 7 days ago ā¢ 93
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper ā¢ 2412.13663 ā¢ Published 8 days ago ā¢ 103
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. ā¢ 40 items ā¢ Updated 7 days ago ā¢ 72
OmniDocBench: Benchmarking Diverse PDF Document Parsing with Comprehensive Annotations Paper ā¢ 2412.07626 ā¢ Published 16 days ago ā¢ 20
Running 47 š Scaling FineWeb to 1000+ languages: Step 1: finding signal in 100s of evaluation tasks