an-atlas commited on
Commit
a7522b7
1 Parent(s): 7ef9a2d

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -5
README.md CHANGED
@@ -1,5 +1,6 @@
1
  ---
2
  license: apache-2.0
 
3
  tags:
4
  - generated_from_trainer
5
  model-index:
@@ -14,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
14
 
15
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 3.5347
18
 
19
  ## Model description
20
 
@@ -45,14 +46,14 @@ The following hyperparameters were used during training:
45
 
46
  | Training Loss | Epoch | Step | Validation Loss |
47
  |:-------------:|:-----:|:----:|:---------------:|
48
- | No log | 1.0 | 1 | 3.9368 |
49
- | No log | 2.0 | 2 | 3.6674 |
50
- | No log | 3.0 | 3 | 3.5347 |
51
 
52
 
53
  ### Framework versions
54
 
55
- - Transformers 4.30.2
56
  - Pytorch 2.0.1+cu118
57
  - Datasets 2.13.1
58
  - Tokenizers 0.13.3
 
1
  ---
2
  license: apache-2.0
3
+ base_model: distilgpt2
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.3704
19
 
20
  ## Model description
21
 
 
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | No log | 1.0 | 1 | 4.7786 |
50
+ | No log | 2.0 | 2 | 4.4947 |
51
+ | No log | 3.0 | 3 | 4.3704 |
52
 
53
 
54
  ### Framework versions
55
 
56
+ - Transformers 4.31.0
57
  - Pytorch 2.0.1+cu118
58
  - Datasets 2.13.1
59
  - Tokenizers 0.13.3