ModelCloud/QwQ-32B-Preview-gptqmodel-4bit-vortex-v3 Text Generation • Updated Dec 20, 2024 • 616 • 14
The Lessons of Developing Process Reward Models in Mathematical Reasoning Paper • 2501.07301 • Published 15 days ago • 88
Agentless: Demystifying LLM-based Software Engineering Agents Paper • 2407.01489 • Published Jul 1, 2024 • 59
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 20 days ago • 249
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper • 2501.05441 • Published 18 days ago • 85
view article Article Fine-tune a SmolLM on domain-specific synthetic data from a LLM By davidberenstein1957 • 24 days ago • 32