PaLe-MADLAD

The MADLAD-400 model fine-tuned to translate from Proper Karelian, Livvi, Ludian, and Veps to Russian and vice versa. We call this model Paragraph-Level as we trained it on paragraphs comprising multiple sentences. The model demonstrates the capacity to handle gender-neutral pronouns (presenting a major obstacle in translating from Finno-Ugric languages) and other discourse-level phenomena.

Please cite the following paper if you use this model in your work:

@inproceedings{
pashchenko2024paragraphlevel,
title={Paragraph-Level Machine Translation for Low-Resource Finno-Ugric Languages},
author={Dmytro Pashchenko and Lisa Yankovskaya and Mark Fishel},
booktitle={The Joint 25th Nordic Conference on Computational Linguistics and 11th Baltic Conference on Human Language Technologies},
year={2024},
url={https://openreview.net/forum?id=uTFJsQpNZk}
}
Downloads last month
17
Safetensors
Model size
2.94B params
Tensor type
F32
·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The model has no library tag.