End of training

Browse files

Files changed (4) hide show

README.md +18 -23
generation_config.json +1 -1
model.safetensors +1 -1
runs/Feb12_08-49-40_dcd95080f986/events.out.tfevents.1707727782.dcd95080f986.34.0 +2 -2

README.md CHANGED Viewed

@@ -8,9 +8,6 @@ metrics:
 model-index:
 - name: english-to-darija-2
   results: []
-language:
-- en
-- ary
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -20,9 +17,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Helsinki-NLP/opus-mt-tc-big-en-ar](https://huggingface.co/Helsinki-NLP/opus-mt-tc-big-en-ar) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.8922
-- Bleu: 77.1567
-- Gen Len: 16.379
 ## Model description
@@ -47,30 +44,28 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 12
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|
-| 2.0991        | 1.0   | 1525  | 1.8210          | 20.4147 | 15.6004 |
-| 1.3476        | 2.0   | 3050  | 1.3142          | 38.8364 | 15.808  |
-| 0.9515        | 3.0   | 4575  | 1.0775          | 55.6305 | 16.1504 |
-| 0.6951        | 4.0   | 6100  | 0.9632          | 66.9767 | 16.2509 |
-| 0.5589        | 5.0   | 7625  | 0.9089          | 71.1846 | 16.4062 |
-| 0.4546        | 6.0   | 9150  | 0.8902          | 74.3547 | 16.2483 |
-| 0.3671        | 7.0   | 10675 | 0.8841          | 75.4645 | 16.3627 |
-| 0.3085        | 8.0   | 12200 | 0.8812          | 76.2993 | 16.3739 |
-| 0.2668        | 9.0   | 13725 | 0.8863          | 76.6186 | 16.347  |
-| 0.2257        | 10.0  | 15250 | 0.8876          | 76.6218 | 16.4226 |
-| 0.2077        | 11.0  | 16775 | 0.8907          | 76.8682 | 16.4054 |
-| 0.1969        | 12.0  | 18300 | 0.8922          | 77.1567 | 16.379  |
 ### Framework versions
-- Transformers 4.35.2
-- Pytorch 2.1.0+cu121
-- Datasets 2.16.1
-- Tokenizers 0.15.1

 model-index:
 - name: english-to-darija-2
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [Helsinki-NLP/opus-mt-tc-big-en-ar](https://huggingface.co/Helsinki-NLP/opus-mt-tc-big-en-ar) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8514
+- Bleu: 70.9947
+- Gen Len: 9.092
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss | Bleu    | Gen Len |
 |:-------------:|:-----:|:-----:|:---------------:|:-------:|:-------:|
+| 1.6032        | 1.0   | 4651  | 1.4540          | 25.9364 | 8.9697  |
+| 1.1191        | 2.0   | 9302  | 1.0805          | 48.0549 | 9.0661  |
+| 0.8048        | 3.0   | 13953 | 0.9419          | 61.3646 | 9.1018  |
+| 0.5978        | 4.0   | 18604 | 0.8939          | 65.6846 | 9.1161  |
+| 0.477         | 5.0   | 23255 | 0.8623          | 68.0005 | 9.1049  |
+| 0.4228        | 6.0   | 27906 | 0.8540          | 69.1959 | 9.1276  |
+| 0.3534        | 7.0   | 32557 | 0.8479          | 69.944  | 9.0744  |
+| 0.305         | 8.0   | 37208 | 0.8473          | 70.55   | 9.0987  |
+| 0.2678        | 9.0   | 41859 | 0.8489          | 70.8065 | 9.1166  |
+| 0.243         | 10.0  | 46510 | 0.8514          | 70.9947 | 9.092   |
 ### Framework versions
+- Transformers 4.37.2
+- Pytorch 2.2.0+cu121
+- Datasets 2.17.0
+- Tokenizers 0.15.1

generation_config.json CHANGED Viewed

@@ -12,5 +12,5 @@
   "num_beams": 4,
   "pad_token_id": 61246,
   "renormalize_logits": true,
-  "transformers_version": "4.35.2"
 }

   "num_beams": 4,
   "pad_token_id": 61246,
   "renormalize_logits": true,
+  "transformers_version": "4.37.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2519b0dd0ee6e8b2fd2839e347fef67196f4b2fc16f9c54df6eaa7b716b2d7ab
 size 956571820

 version https://git-lfs.github.com/spec/v1
+oid sha256:26940dfd0a187e420f66c5822b3cc7e0c9d57afdb4ab6089b2269be0109ce537
 size 956571820

runs/Feb12_08-49-40_dcd95080f986/events.out.tfevents.1707727782.dcd95080f986.34.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0615375e9cbacf846b270d44a46485d22ede8c1fa5e4f31bb720618ddeb3224b
-size 23165

 version https://git-lfs.github.com/spec/v1
+oid sha256:9ab0c0e6e9dc8e9b99c3cf2e299edd749c38fb7aefa667d278936c816fdb2e0e
+size 23902