|
--- |
|
license: mit |
|
language: |
|
- ar |
|
--- |
|
|
|
## Checkpoints |
|
|
|
### Pre-Trained Models |
|
|
|
Model | Pre-train Dataset | Model | Tokenizer | |
|
| --- | --- | --- | --- | |
|
| ArTST v2 base | Dialects | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/pretrain_checkpoint.pt) | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) |
|
|
|
### Finetuned Models |
|
Model | FInetune Dataset | Model | Tokenizer | |
|
| --- | --- | --- | --- | |
|
| ArTST v2 ASR | MGB2 | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/ASR_MGB2_best.pt_hf.pt) | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | |
|
| ArTST v2 ASR | QASR | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/ASR_QASR_best.pt_hf.pt) | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | |
|
| ArTST v2 ASR | MGB2 - Dialects | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/ASR_Dialects_MGB2_best.pt_hf.pt) | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | |
|
| ArTST v2 ASR | MGB2 - Dialects | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/ASR_Dialects_QASR_best.pt_hf.pt) | [Hugging Face](https://huggingface.co./MBZUAI/ArTSTv2/blob/main/tokenizer_artstv2.model) | |
|
|
|
# Acknowledgements |
|
|
|
ArTST is built on [SpeechT5](https://arxiv.org/abs/2110.07205) Architecture. If you use any of ArTST models, please cite |
|
|
|
``` |
|
@inproceedings{toyin2023artst, |
|
title={ArTST: Arabic Text and Speech Transformer}, |
|
author={Toyin, Hawau and Djanibekov, Amirbek and Kulkarni, Ajinkya and Aldarmaki, Hanan}, |
|
booktitle={Proceedings of ArabicNLP 2023}, |
|
pages={41--51}, |
|
year={2023} |
|
} |
|
``` |