leaderboard-pr-bot commited on
Commit
542d369
·
verified ·
1 Parent(s): feb79d4

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co./spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co./spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +14 -0
README.md CHANGED
@@ -200,3 +200,17 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
200
  |Winogrande (5-shot) |76.56|
201
  |GSM8k (5-shot) |69.45|
202
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
200
  |Winogrande (5-shot) |76.56|
201
  |GSM8k (5-shot) |69.45|
202
 
203
+
204
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
205
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_xxx777xxxASD__L3-SnowStorm-v1.15-4x8B-B)
206
+
207
+ | Metric |Value|
208
+ |---------------------------------|----:|
209
+ |Avg. |68.01|
210
+ |AI2 Reasoning Challenge (25-Shot)|60.67|
211
+ |HellaSwag (10-Shot) |81.60|
212
+ |MMLU (5-Shot) |68.12|
213
+ |TruthfulQA (0-shot) |51.69|
214
+ |Winogrande (5-shot) |76.56|
215
+ |GSM8k (5-shot) |69.45|
216
+