Update README.md
Browse files
README.md
CHANGED
@@ -1,3 +1,9 @@
|
|
1 |
-
---
|
2 |
-
license: apache-2.0
|
3 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
1 |
+
---
|
2 |
+
license: apache-2.0
|
3 |
+
datasets:
|
4 |
+
- adamo1139/uninstruct-v1-experimental-chatml
|
5 |
+
---
|
6 |
+
## Basic Model Info
|
7 |
+
1 epoch on adamo1139/uninstruct-v1-experimental-chatml. I used [GaLore](https://arxiv.org/abs/2403.03507).\
|
8 |
+
Purpose of this model is to make the model un-learn to use chatml-specific code words such as: <|im_start|>, <|im_end|>, user, assistant.
|
9 |
+
This is a base model meant for further finetuning. I think much of OpenAI slop is still left in there, so it's probably best combined with preference optimization method like DPO, ORPO or SPO for best results.
|