SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model Paper β’ 2502.02737 β’ Published 5 days ago β’ 144 β’ 5
o3-mini vs DeepSeek-R1: Which One is Safer? Paper β’ 2501.18438 β’ Published 11 days ago β’ 21 β’ 3