Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 • 3 items • Updated about 23 hours ago • 199
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 2 items • Updated 1 day ago • 77
SmolVLM 256M & 500M Collection Collection for models & demos for even smoller SmolVLM release • 12 items • Updated 5 days ago • 55
VideoLLaMA3 Collection Frontier Multimodal Foundation Models for Video Understanding • 13 items • Updated 3 days ago • 8
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 5 days ago • 226
UI-TARS: Pioneering Automated GUI Interaction with Native Agents Paper • 2501.12326 • Published 6 days ago • 45
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published 13 days ago • 59
view article Article Yay! Organizations can now publish blog Articles By huggingface • 7 days ago • 30
high-quality Chinese training datasets Collection a suite of high-quality Chinese datasets, used for pretraining, fine-tuning or preference alignment. And the models trained on these datasets. • 12 items • Updated 11 days ago • 9
view article Article MiniMax-01 is Now Open-Source: Scaling Lightning Attention for the AI Agent Era By MiniMax-AI • 13 days ago • 40
MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 13 days ago • 268
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 15 days ago • 88
O1 Replication Journey -- Part 3: Inference-time Scaling for Medical Reasoning Paper • 2501.06458 • Published 17 days ago • 29
Magpie Reasoning Datasets Collection Reasoning datasets built by Magpie and its friends! • 8 items • Updated about 7 hours ago • 8