|
--- |
|
language: da |
|
tags: |
|
- speech |
|
- xls_r |
|
- xls_r_pretrained |
|
- danish |
|
license: apache-2.0 |
|
--- |
|
## XLS-R-300m-danish |
|
|
|
Continued pretraining of [facebook/wav2vec2-xls-r-300m](https://huggingface.co/facebook/wav2vec2-xls-r-300m) for 120.000 steps on 141.000 hours of speech from Danish radio (DR P1 and Radio24Syv from 2005 to 2021). |
|
|
|
The model was pretrained on 16kHz audio using fairseq and should be fine-tuned to perform speech recognition. |
|
|
|
A fine-tuned version of this model for ASR can be found [here](https://huggingface.co/chcaa/xls-r-300m-danish-nst-cv9). |
|
|
|
The model was trained by [Lasse Hansen](https://github.com/HLasse) ([CHCAA](https://chcaa.io)) and [Alvenir](https://alvenir.ai) on the [UCloud](https:/cloud.sdu.dk) platform. Many thanks to the Royal Danish Library for providing access to the data. |
|
|
|
|