adamo1139 commited on
Commit
1c703ca
1 Parent(s): 884ee18

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -3
README.md CHANGED
@@ -1,3 +1,9 @@
1
- ---
2
- license: apache-2.0
3
- ---
 
 
 
 
 
 
 
1
+ ---
2
+ license: apache-2.0
3
+ datasets:
4
+ - adamo1139/uninstruct-v1-experimental-chatml
5
+ ---
6
+ ## Basic Model Info
7
+ 1 epoch on adamo1139/uninstruct-v1-experimental-chatml. I used [GaLore](https://arxiv.org/abs/2403.03507).\
8
+ Purpose of this model is to make the model un-learn to use chatml-specific code words such as: <|im_start|>, <|im_end|>, user, assistant.
9
+ This is a base model meant for further finetuning. I think much of OpenAI slop is still left in there, so it's probably best combined with preference optimization method like DPO, ORPO or SPO for best results.