End of training

Browse files

Files changed (3) hide show

README.md +23 -23
adapter_model.safetensors +1 -1
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,8 +17,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the German_docx dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.0992
-- Model Preparation Time: 0.007
 ## Model description
@@ -52,27 +52,27 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time |
 |:-------------:|:------:|:----:|:---------------:|:----------------------:|
-| No log        | 0.0146 | 1    | 0.3248          | 0.007                  |
-| 1.8501        | 0.1460 | 10   | 0.3090          | 0.007                  |
-| 1.3107        | 0.2920 | 20   | 0.2642          | 0.007                  |
-| 0.9799        | 0.4380 | 30   | 0.2404          | 0.007                  |
-| 0.6309        | 0.5839 | 40   | 0.2107          | 0.007                  |
-| 0.9043        | 0.7299 | 50   | 0.1864          | 0.007                  |
-| 1.015         | 0.8759 | 60   | 0.1617          | 0.007                  |
-| 0.9035        | 1.0219 | 70   | 0.1585          | 0.007                  |
-| 0.6689        | 1.1679 | 80   | 0.1586          | 0.007                  |
-| 0.3336        | 1.3139 | 90   | 0.1477          | 0.007                  |
-| 0.377         | 1.4599 | 100  | 0.1408          | 0.007                  |
-| 0.5013        | 1.6058 | 110  | 0.1442          | 0.007                  |
-| 0.2791        | 1.7518 | 120  | 0.1277          | 0.007                  |
-| 0.3665        | 1.8978 | 130  | 0.1164          | 0.007                  |
-| 0.4709        | 2.0438 | 140  | 0.1106          | 0.007                  |
-| 0.2456        | 2.1898 | 150  | 0.1061          | 0.007                  |
-| 0.152         | 2.3358 | 160  | 0.1045          | 0.007                  |
-| 0.1813        | 2.4818 | 170  | 0.1007          | 0.007                  |
-| 0.1594        | 2.6277 | 180  | 0.1010          | 0.007                  |
-| 0.1856        | 2.7737 | 190  | 0.1005          | 0.007                  |
-| 0.1788        | 2.9197 | 200  | 0.0992          | 0.007                  |
 ### Framework versions

 This model is a fine-tuned version of [vidore/colpaligemma-3b-pt-448-base](https://huggingface.co/vidore/colpaligemma-3b-pt-448-base) on the German_docx dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0815
+- Model Preparation Time: 0.0061
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss | Model Preparation Time |
 |:-------------:|:------:|:----:|:---------------:|:----------------------:|
+| No log        | 0.0146 | 1    | 0.3622          | 0.0061                 |
+| 1.6392        | 0.1460 | 10   | 0.3430          | 0.0061                 |
+| 1.1999        | 0.2920 | 20   | 0.3049          | 0.0061                 |
+| 1.2826        | 0.4380 | 30   | 0.2768          | 0.0061                 |
+| 0.7583        | 0.5839 | 40   | 0.2492          | 0.0061                 |
+| 0.603         | 0.7299 | 50   | 0.2258          | 0.0061                 |
+| 1.014         | 0.8759 | 60   | 0.1958          | 0.0061                 |
+| 0.8131        | 1.0219 | 70   | 0.1688          | 0.0061                 |
+| 0.6346        | 1.1679 | 80   | 0.1591          | 0.0061                 |
+| 0.5089        | 1.3139 | 90   | 0.1502          | 0.0061                 |
+| 0.4616        | 1.4599 | 100  | 0.1341          | 0.0061                 |
+| 0.4498        | 1.6058 | 110  | 0.1136          | 0.0061                 |
+| 0.4422        | 1.7518 | 120  | 0.1062          | 0.0061                 |
+| 0.3519        | 1.8978 | 130  | 0.0989          | 0.0061                 |
+| 0.2382        | 2.0438 | 140  | 0.0925          | 0.0061                 |
+| 0.242         | 2.1898 | 150  | 0.0894          | 0.0061                 |
+| 0.3462        | 2.3358 | 160  | 0.0907          | 0.0061                 |
+| 0.1371        | 2.4818 | 170  | 0.0862          | 0.0061                 |
+| 0.2691        | 2.6277 | 180  | 0.0838          | 0.0061                 |
+| 0.0869        | 2.7737 | 190  | 0.0833          | 0.0061                 |
+| 0.3401        | 2.9197 | 200  | 0.0815          | 0.0061                 |
 ### Framework versions

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:4be353554a330dd46816122f19fc605554389d29a473861df0f75960705845c2
 size 157071680

 version https://git-lfs.github.com/spec/v1
+oid sha256:3ebcef182f505e54adc721326488e7508050c7df376f6debfad18f155bc49ab2
 size 157071680

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:406d37c978018a5a9dd13a26f63f4df32571eb8f47b65d85ce7a74f2ca2ef4f5
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:669ba7c5243f079413201308d424089324b69c5a9c0b207e1d78881c731c29c6
 size 5240