Update README.md
Browse files
README.md
CHANGED
@@ -14,8 +14,10 @@ model-index:
|
|
14 |
---
|
15 |
# Magistrate 3.2 3B
|
16 |
|
17 |
-
Continued pretraining applied to
|
18 |
|
|
|
|
|
19 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
20 |
should probably proofread and complete it, then remove this comment. -->
|
21 |
|
@@ -196,9 +198,8 @@ special_tokens:
|
|
196 |
|
197 |
</details><br>
|
198 |
|
199 |
-
|
200 |
-
|
201 |
-
- Loss: 0.6802
|
202 |
|
203 |
## Model description
|
204 |
|
|
|
14 |
---
|
15 |
# Magistrate 3.2 3B
|
16 |
|
17 |
+
Continued pretraining applied to [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) using no synthetic data. ~250M tokens.
|
18 |
|
19 |
+
The model achieves the following results on the evaluation set:
|
20 |
+
- Loss: 0.6802
|
21 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
22 |
should probably proofread and complete it, then remove this comment. -->
|
23 |
|
|
|
198 |
|
199 |
</details><br>
|
200 |
|
201 |
+
|
202 |
+
|
|
|
203 |
|
204 |
## Model description
|
205 |
|