Model Leaderboards - a davidberenstein1957 Collection

davidberenstein1957 's Collections

Smol but mighty

LLM evals and benchmark datasets

Synthetic Data Papers

Dataset Viber annotators

Cool and fun Spaces

Model Leaderboards

Useful datasets

Follow The Money

Model Leaderboards

updated Jan 22

Running on CPU Upgrade

5.01k

5.01k

MTEB Leaderboard

🥇

Select benchmarks and languages for text embeddings evaluation
Running

344

344

Reward Bench Leaderboard

📐

Explore and analyze RewardBench leaderboard data
Running on CPU Upgrade

12.7k

12.7k

Open LLM Leaderboard

🏆

Track, rank and evaluate open LLMs and chatbots
Running

4.14k

4.14k

Chatbot Arena Leaderboard

🏆

Display chatbot leaderboard and statistics
Running

1.18k

1.18k

Big Code Models Leaderboard

📈

Submit code models for evaluation on benchmarks
Running

222

222

AI2 WildBench Leaderboard (V2)

🦁

Display and explore model leaderboards and chat history
Running on CPU Upgrade

655

655

Open VLM Leaderboard

🌎

VLMEvalKit Evaluation Results Collection
Running

183

183

BigCodeBench Leaderboard

🥇

Explore and analyze code evaluation data
Running

436

436

LLM-Perf Leaderboard

🏆

Explore LLM performance across hardware
Running

101

101

MTEB Arena

⚔

Teach, test, evaluate language models with MTEB Arena