opus-mt-cpp-cpp / README.md
system's picture
system HF staff
Update README.md
f28caa5
|
raw
history blame
2.56 kB
metadata
language:
  - id
  - cpp
tags:
  - translation
license: apache-2.0

cpp-cpp

  • source group: Creoles and pidgins, Portuguese-based

  • target group: Creoles and pidgins, Portuguese-based

  • OPUS readme: cpp-cpp

  • model: transformer

  • source language(s): ind pap

  • target language(s): ind pap

  • model: transformer

  • pre-processing: normalization + SentencePiece (spm32k,spm32k)

  • a sentence initial language token is required in the form of >>id<< (id = valid target language ID)

  • download original weights: opus-2020-07-26.zip

  • test set translations: opus-2020-07-26.test.txt

  • test set scores: opus-2020-07-26.eval.txt

Benchmarks

testset BLEU chr-F
Tatoeba-test.msa-msa.msa.msa 0.7 0.149
Tatoeba-test.msa-pap.msa.pap 31.7 0.577
Tatoeba-test.multi.multi 21.1 0.369
Tatoeba-test.pap-msa.pap.msa 17.7 0.197

System Info: