FPHam
/

Karen_TheEditor_V2_STRICT_Mistral_7B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

FPHam commited on Apr 21

Commit

fae6b3c

•

1 Parent(s): 8bdf7c4

Update README.md

Files changed (1) hide show

README.md +14 -1

README.md CHANGED Viewed

@@ -136,4 +136,17 @@ After probably 10 different versions with subsequent changes, I can now say that
 The goal was to create a model that wouldn't change the style of the text. Often, LLM models, when asked to edit text, will attempt to rewrite the text even if the text is already fine. This proved to be quite challenging for such a small model where the main task was to determine the right balance between fixing the text (and not changing its style) and copying it verbatim.
-The strict model assumes that you're already a good writer that doesn't need hand-holding and that every word you've written you've meant.

 The goal was to create a model that wouldn't change the style of the text. Often, LLM models, when asked to edit text, will attempt to rewrite the text even if the text is already fine. This proved to be quite challenging for such a small model where the main task was to determine the right balance between fixing the text (and not changing its style) and copying it verbatim.
+The strict model assumes that you're already a good writer that doesn't need hand-holding and that every word you've written you've meant.
+# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
+Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_FPHam__Karen_TheEditor_V2_STRICT_Mistral_7B)
+|             Metric              |Value|
+|---------------------------------|----:|
+|Avg.                             |59.13|
+|AI2 Reasoning Challenge (25-Shot)|59.56|
+|HellaSwag (10-Shot)              |81.79|
+|MMLU (5-Shot)                    |59.56|
+|TruthfulQA (0-shot)              |49.36|
+|Winogrande (5-shot)              |74.35|
+|GSM8k (5-shot)                   |30.17|