Post
2324
Excited to bring our benchmarking leaderboard of >100 LLM API endpoints to HF!
Speed and price are often just as important as quality when building applications with LLMs. We bring together all the data you need to consider all three when you need to pick a model and API provider.
Coverage:
‣ Quality (Index of evals, MMLU, Chatbot Arena, HumanEval, MT-Bench)
‣ Throughput (tokens/s: median, P5, P25, P75, P95)
‣ Latency (TTFT: median, P5, P25, P75, P95)
‣ Context window
‣ OpenAI library compatibility
Link to Space: ArtificialAnalysis/LLM-Performance-Leaderboard
Blog post: https://huggingface.co./blog/leaderboard-artificial-analysis
Speed and price are often just as important as quality when building applications with LLMs. We bring together all the data you need to consider all three when you need to pick a model and API provider.
Coverage:
‣ Quality (Index of evals, MMLU, Chatbot Arena, HumanEval, MT-Bench)
‣ Throughput (tokens/s: median, P5, P25, P75, P95)
‣ Latency (TTFT: median, P5, P25, P75, P95)
‣ Context window
‣ OpenAI library compatibility
Link to Space: ArtificialAnalysis/LLM-Performance-Leaderboard
Blog post: https://huggingface.co./blog/leaderboard-artificial-analysis