Can 1B LLM Surpass 405B LLM? Rethinking Compute-Optimal Test-Time Scaling Paper โข 2502.06703 โข Published Feb 10 โข 142
cognitivecomputations/Dolphin3.0-R1-Mistral-24B Text Generation โข Updated 26 days ago โข 6.99k โข 162