Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated about 22 hours ago • 194
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 89
Search-o1: Agentic Search-Enhanced Large Reasoning Models Paper • 2501.05366 • Published 18 days ago • 80
view article Article Train 400x faster Static Embedding Models with Sentence Transformers 13 days ago • 125
InternVL2.5-MPO Collection Enhancing the Reasoning Ability of MLLMs via Mixed Preference Optimization • 16 items • Updated 18 days ago • 26
NuNER: Entity Recognition Encoder Pre-training via LLM-Annotated Data Paper • 2402.15343 • Published Feb 23, 2024 • 13
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling • 3 items • Updated Dec 19, 2024 • 127
Qwen2.5-Math Collection Math-specific model series based on Qwen2.5 • 11 items • Updated 14 days ago • 65
Common Models Collection The first generation of models pretrained on Common Corpus. • 5 items • Updated Dec 5, 2024 • 28
OpenScholar: Synthesizing Scientific Literature with Retrieval-augmented LMs Paper • 2411.14199 • Published Nov 21, 2024 • 30