Running 2.21k 2.21k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
yentinglin/Mistral-Small-24B-Instruct-2501-reasoning Text Generation • Updated 20 days ago • 2.15k • 51