sem-sem

  • source group: Semitic languages

  • target group: Semitic languages

  • OPUS readme: sem-sem

  • model: transformer

  • source language(s): apc ara arq arz heb mlt

  • target language(s): apc ara arq arz heb mlt

  • model: transformer

  • pre-processing: normalization + SentencePiece (spm32k,spm32k)

  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)

  • download original weights: opus-2020-07-27.zip

  • test set translations: opus-2020-07-27.test.txt

  • test set scores: opus-2020-07-27.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.ara-ara.ara.ara 4.2 0.200
Tatoeba-test.ara-heb.ara.heb 34.0 0.542
Tatoeba-test.ara-mlt.ara.mlt 16.6 0.513
Tatoeba-test.heb-ara.heb.ara 18.8 0.477
Tatoeba-test.mlt-ara.mlt.ara 20.7 0.388
Tatoeba-test.multi.multi 27.1 0.507

System Info:

Downloads last month
15
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Spaces using Helsinki-NLP/opus-mt-sem-sem 7