BioAspire Collection Reconstruction of ASPIRE variants detailed in the original paper + fine-tuning experiments from different base models. • 5 items • Updated 3 days ago
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 18 days ago • 66
Running on CPU Upgrade 5.01k 5.01k MTEB Leaderboard 🥇 Select benchmarks and languages for text embeddings evaluation
Running 2.15k 2.15k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters