umarbutler
/

open-australian-legal-distilgpt2

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

umarbutler commited on Nov 28, 2023

Commit

06071d6

•

1 Parent(s): 80abc11

Update README.md

Files changed (1) hide show

README.md +0 -1

README.md CHANGED Viewed

@@ -57,7 +57,6 @@ The training dataset was subsequently fed to [DistilGPT2](https://huggingface.co
 | Batch size per device | 4 |
 | Weight decay | 0.01 |
 | Warmup ratio | 0.06 |
-| Gradient accumulation steps | 1 |
 After training for 3 epochs, or 465,441 steps, over a period of ~40 hours on a single GeForce RTX 2080 Ti, the model achieved a loss of 0.65.

 | Batch size per device | 4 |
 | Weight decay | 0.01 |
 | Warmup ratio | 0.06 |
 After training for 3 epochs, or 465,441 steps, over a period of ~40 hours on a single GeForce RTX 2080 Ti, the model achieved a loss of 0.65.