Text Generation
Transformers
PyTorch
English
Chinese
llama
text-generation-inference
Inference Endpoints
GeneZC commited on
Commit
791cafa
β€’
1 Parent(s): 98d590c

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +31 -3
README.md CHANGED
@@ -1,3 +1,31 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
+ ---
2
+ language:
3
+ - en
4
+ - zh
5
+ license: apache-2.0
6
+ library_name: transformers
7
+ datasets:
8
+ - EleutherAI/pile
9
+ - togethercomputer/RedPajama-Data-1T
10
+ - p208p2002/wudao
11
+ widget:
12
+ - text: <s> 4 + 3 =
13
+ ---
14
+ ## MiniLoong-3B
15
+
16
+ πŸ“‘ [arXiv](https://arxiv.org/abs/2311.07052) | πŸ‘» [GitHub](https://github.com/GeneZC/MiniMA) | πŸ€— [HuggingFace-MiniMA-3B](https://huggingface.co/GeneZC/MiniMA-3B) | πŸ€— [HuggingFace-MiniChat-3B](https://huggingface.co/GeneZC/MiniChat-3B) | πŸ€– [ModelScope-MiniMA-3B](https://modelscope.cn/models/GeneZC/MiniMA-3B) | πŸ€– [ModelScope-MiniChat-3B](https://modelscope.cn/models/GeneZC/MiniChat-3B) | πŸ€— [HuggingFace-MiniChat-1.5-3B](https://huggingface.co/GeneZC/MiniChat-1.5-3B) | πŸ€— [HuggingFace-MiniMA-2-3B](https://huggingface.co/GeneZC/MiniMA-2-3B) | πŸ€— [HuggingFace-MiniChat-2-3B](https://huggingface.co/GeneZC/MiniChat-2-3B) | πŸ€— [HuggingFace-MiniMA-2-1B](https://huggingface.co/GeneZC/MiniMA-2-1B) | πŸ€— [HuggingFace-MiniLoong-3B](https://huggingface.co/GeneZC/MiniLoong-3B) | πŸ€— [HuggingFace-MiniMix-2/4x3B](https://huggingface.co/GeneZC/MiniMix-2_4x3B)
17
+
18
+ ❗ Must comply with LICENSE of LLaMA-2 since it is derived from LLaMA-2.
19
+
20
+ <img src="./teaser_d.jpg" alt="teaser_d" width="700" />
21
+
22
+ ## Bibtex
23
+
24
+ ```bibtex
25
+ @article{zhang2023law,
26
+ title={Towards the Law of Capacity Gap in Distilling Language Models},
27
+ author={Zhang, Chen and Song, Dawei and Ye, Zheyu and Gao, Yan},
28
+ year={2023},
29
+ url={https://arxiv.org/abs/2311.07052}
30
+ }
31
+ ```