Post
1480
@FinancialSupport
and I just released a new version of the Italian LLMs leaderboard https://huggingface.co./spaces/FinancialSupport/open_ita_llm_leaderboard
using the super useful https://huggingface.co./demo-leaderboard template from @clefourrier .
We’ve evaluated over 50 models (base, merged, fine-tuned, etc.) from:
- Major companies like Meta, Mistral, Google ...
- University groups such as https://huggingface.co./sapienzanlp or https://huggingface.co./swap-uniba
- Italian Companies like https://huggingface.co./MoxoffSpA , https://huggingface.co./FairMind or https://huggingface.co./raicrits
- Various communities and individuals
All models were tested on #Italian benchmarks #mmlu #arc-c #hellaswag, which we contributed to the opensource lm-evaluation-harness library from https://huggingface.co./EleutherAI.
Plus, you can now submit your model for automatic evaluation, thanks to to https://huggingface.co./seeweb sponsored computation.
Curious about the top Italian models? Check out the leaderboard and submit your model!
https://huggingface.co./spaces/FinancialSupport/open_ita_llm_leaderboard
using the super useful https://huggingface.co./demo-leaderboard template from @clefourrier .
We’ve evaluated over 50 models (base, merged, fine-tuned, etc.) from:
- Major companies like Meta, Mistral, Google ...
- University groups such as https://huggingface.co./sapienzanlp or https://huggingface.co./swap-uniba
- Italian Companies like https://huggingface.co./MoxoffSpA , https://huggingface.co./FairMind or https://huggingface.co./raicrits
- Various communities and individuals
All models were tested on #Italian benchmarks #mmlu #arc-c #hellaswag, which we contributed to the opensource lm-evaluation-harness library from https://huggingface.co./EleutherAI.
Plus, you can now submit your model for automatic evaluation, thanks to to https://huggingface.co./seeweb sponsored computation.
Curious about the top Italian models? Check out the leaderboard and submit your model!
https://huggingface.co./spaces/FinancialSupport/open_ita_llm_leaderboard