🧠 Reasoning datasets Collection Datasets with reasoning traces for math and code released by the community • 14 items • Updated about 12 hours ago • 90
OpenR1-Math Collection Dataset and SFT model distilled from DeepSeek-R1. Check out our blog post for more details: https://huggingface.co./blog/open-r1/update-2 • 3 items • Updated 22 days ago • 7
SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper • 2502.02737 • Published Feb 4 • 199
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 839
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 71
A Simple and Provable Scaling Law for the Test-Time Compute of Large Language Models Paper • 2411.19477 • Published Nov 29, 2024 • 6
ProcessBench: Identifying Process Errors in Mathematical Reasoning Paper • 2412.06559 • Published Dec 9, 2024 • 80
Diverse Beam Search: Decoding Diverse Solutions from Neural Sequence Models Paper • 1610.02424 • Published Oct 7, 2016 • 1