LegalLMs
Collection
XLM-RoBERTa models with continued pretraining on the MultiLegalPile
•
37 items
•
Updated
•
3
This model was trained from scratch on an unknown dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss |
---|---|---|---|
0.9425 | 0.25 | 50000 | 0.7593 |
0.712 | 1.11 | 100000 | 0.6754 |
0.6889 | 1.36 | 150000 | 0.6195 |
0.6506 | 2.21 | 200000 | 0.6097 |