speech to text
Collection
Speech to text models
•
8 items
•
Updated
This model is a fine-tuned version of openai/whisper-base on the mozilla-foundation/common_voice_16_0 ml dataset. It achieves the following results on the evaluation set:
More information needed
More information needed
More information needed
The following hyperparameters were used during training:
Training Loss | Epoch | Step | Validation Loss | Wer |
---|---|---|---|---|
0.3151 | 4.02 | 200 | 0.4517 | 54.5134 |
0.0703 | 9.02 | 400 | 0.4561 | 46.7285 |
0.0144 | 14.02 | 600 | 0.5625 | 43.7627 |
0.006 | 19.02 | 800 | 0.6260 | 42.7247 |
0.0024 | 24.02 | 1000 | 0.6938 | 43.0306 |
0.0012 | 29.02 | 1200 | 0.7354 | 44.2169 |
Base model
openai/whisper-base