whisper-large-v3-ft-cv-cy

This model is a version of openai/whisper-large-v3 fine-tuned with the train_all and other_with_excluded custom splits from techiaith/commonvoice_18_0_cy

It achieves the following results on the Common Voice for Welsh release 18's standard test set:

  • WER: 18.50
  • CER: 5.32

N.B. this model performs considerably worse on English language speech, but better on Welsh than a bilingual model

Usage

from transformers import pipeline

transcriber = pipeline("automatic-speech-recognition", model="techiaith/whisper-large-v3-ft-cv-cy")
result = transcriber(<path or url to soundfile>)
print (result)

{'text': 'Mae hen wlad fy nhadau yn annwyl i mi.'}

Downloads last month
21
Safetensors
Model size
1.54B params
Tensor type
F32
·
Inference Examples
Unable to determine this model's library. Check the docs .

Model tree for techiaith/whisper-large-v3-ft-cv-cy

Finetuned
(297)
this model
Finetunes
1 model

Dataset used to train techiaith/whisper-large-v3-ft-cv-cy

Collection including techiaith/whisper-large-v3-ft-cv-cy

Evaluation results

  • Wer on DewiBrynJones/commonvoice_18_0_cy default
    self-reported
    0.185