DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper β’ 2501.12948 β’ Published 5 days ago β’ 224
view article Article Hugging Face and FriendliAI partner to supercharge model deployment on the Hub 6 days ago β’ 29
view article Article Yay! Organizations can now publish blog Articles By huggingface β’ 7 days ago β’ 30
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published 24 days ago β’ 87
Towards System 2 Reasoning in LLMs: Learning How to Think With Meta Chain-of-Though Paper β’ 2501.04682 β’ Published 19 days ago β’ 89
TACO Models Collection This collection contains the best-performing TACO models based on LLaMA-3/Qwen2 and SigLIP/CLIP. β’ 3 items β’ Updated Dec 20, 2024 β’ 8
OS-Genesis: Automating GUI Agent Trajectory Construction via Reverse Task Synthesis Paper β’ 2412.19723 β’ Published Dec 27, 2024 β’ 81
Cosmos World Foundation Model Platform for Physical AI Paper β’ 2501.03575 β’ Published 21 days ago β’ 66
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper β’ 2501.04519 β’ Published 20 days ago β’ 249
Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference Paper β’ 2412.13663 β’ Published Dec 18, 2024 β’ 125
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 127