update model card
Browse files
README.md
CHANGED
@@ -26,6 +26,12 @@ model-index:
|
|
26 |
- name: Test CER (no LM)
|
27 |
type: cer
|
28 |
value: 6.53
|
|
|
|
|
|
|
|
|
|
|
|
|
29 |
---
|
30 |
|
31 |
# XLS-R-300M Uzbek CV8
|
@@ -53,7 +59,7 @@ The model is not reliable enough to use as a substitute for live captions for ac
|
|
53 |
|
54 |
## Training and evaluation data
|
55 |
|
56 |
-
The 50% of the `train` common voice official split was used as training data. The 50% of the official `dev` split was used as validation data, and the full `test` set was used for final evaluation.
|
57 |
|
58 |
The kenlm language model was compiled from the target sentences of the train + other datasets.
|
59 |
|
|
|
26 |
- name: Test CER (no LM)
|
27 |
type: cer
|
28 |
value: 6.53
|
29 |
+
- name: Test WER (with LM)
|
30 |
+
type: wer
|
31 |
+
value: 15.065
|
32 |
+
- name: Test CER (with LM)
|
33 |
+
type: cer
|
34 |
+
value: 3.077
|
35 |
---
|
36 |
|
37 |
# XLS-R-300M Uzbek CV8
|
|
|
59 |
|
60 |
## Training and evaluation data
|
61 |
|
62 |
+
The 50% of the `train` common voice official split was used as training data. The 50% of the official `dev` split was used as validation data, and the full `test` set was used for final evaluation of the model without LM, while the model with LM was evaluated only on 500 examples from the `test` set.
|
63 |
|
64 |
The kenlm language model was compiled from the target sentences of the train + other datasets.
|
65 |
|