gpt2-wechsel-uyghur / README.md
benjamin's picture
Create README.md
a4cee9a
metadata
language: ug
license: mit

gpt2-wechsel-uyghur

Model trained with WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models.

See the code here: https://github.com/CPJKU/wechsel

And the paper here: https://arxiv.org/abs/2112.06598

Performance

Model PPL
gpt2-wechsel-sundanese 111.72
gpt2 (retrained from scratch) 149.46
Model PPL
gpt2-wechsel-scottish-gaelic 16.43
gpt2 (retrained from scratch) 19.53
Model PPL
gpt2-wechsel-uyghur 34.33
gpt2 (retrained from scratch) 42.82
Model PPL
gpt2-wechsel-malagasy 14.01
gpt2 (retrained from scratch) 15.93

See our paper for details.

Citation

Please cite WECHSEL as

@misc{minixhofer2021wechsel,
      title={WECHSEL: Effective initialization of subword embeddings for cross-lingual transfer of monolingual language models}, 
      author={Benjamin Minixhofer and Fabian Paischer and Navid Rekabsaz},
      year={2021},
      eprint={2112.06598},
      archivePrefix={arXiv},
      primaryClass={cs.CL}
}