macadeliccc
/

magistrate-3.2-3b-base

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

macadeliccc commited on Sep 30, 2024

Commit

a69b33c

·

verified ·

1 Parent(s): 1f345af

Update README.md

Files changed (1) hide show

README.md +5 -4

README.md CHANGED Viewed

@@ -14,8 +14,10 @@ model-index:
 ---
 # Magistrate 3.2 3B
-Continued pretraining applied to meta-llama/Llama-3.2-3B using no synthetic data.  ~250M tokens.
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
@@ -196,9 +198,8 @@ special_tokens:
 </details><br>
-This model is a fine-tuned version of [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) on the None dataset.
-It achieves the following results on the evaluation set:
-- Loss: 0.6802
 ## Model description

 ---
 # Magistrate 3.2 3B
+Continued pretraining applied to  [meta-llama/Llama-3.2-3B](https://huggingface.co/meta-llama/Llama-3.2-3B) using no synthetic data.  ~250M tokens.
+The model achieves the following results on the evaluation set:
+- Loss: 0.6802
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
 </details><br>
 ## Model description