moxin-org
/

moxin-chat-7b

Inference Endpoints

Model card Files Files and versions Community

piuzha commited on 5 days ago

Commit

64e4874

•

1 Parent(s): 6430c0f

Update README.md

Files changed (1) hide show

README.md +10 -1

README.md CHANGED Viewed

@@ -87,7 +87,7 @@ print(decoded[0])
 ## Evaluation
-We test the performance of our model with [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness). The evaluation results on common datasets are shown below. We test on AI2 Reasoning Challenge (25-shot), HellaSwag (10-shot), MMLU (5-shot), and Winogrande (5-shot).
 |          Models         | ARC-C | Hellaswag |  MMLU | WinoGrade |  Ave  |
 |:----------------------:|:-----:|:---------:|:-----:|:---------:|:-----:|
@@ -122,7 +122,16 @@ We also test the zero shot performance on AI2 Reasoning Challenge (0-shot), AI2
 | Moxin-7B-finetune 	|   80.03   	|   75.17   	| 82.24 	| 81.12 	| 58.64 	| 75.44 	|

 ## Evaluation
+We test the performance of our model with [lm-evaluation-harness](https://github.com/EleutherAI/lm-evaluation-harness). The evaluation results on common datasets are shown below. We test on AI2 Reasoning Challenge (25-shot), HellaSwag (10-shot), MMLU (5-shot), and Winogrande (5-shot). We release the Moxin-7B-finetuned as our base model. We further finetune our base model on Tulu v2 to obtain our chat model.
 |          Models         | ARC-C | Hellaswag |  MMLU | WinoGrade |  Ave  |
 |:----------------------:|:-----:|:---------:|:-----:|:---------:|:-----:|
 | Moxin-7B-finetune 	|   80.03   	|   75.17   	| 82.24 	| 81.12 	| 58.64 	| 75.44 	|
+## Citation
+```
+@article{zhao2024fully,
+  title={Fully Open Source Moxin-7B Technical Report},
+  author={Zhao, Pu and Shen, Xuan and Kong, Zhenglun and Shen, Yixin and Chang, Sung-En and Rupprecht, Timothy and Lu, Lei and Nan, Enfu and Yang, Changdi and He, Yumei and others},
+  journal={arXiv preprint arXiv:2412.06845},
+  year={2024}
+}
+```