Update README.md
Browse files
README.md
CHANGED
@@ -35,26 +35,17 @@ should probably proofread and complete it, then remove this comment. -->
|
|
35 |
|
36 |
# wav2vec2-mms-1b-CV17.0-training_set_variations
|
37 |
|
38 |
-
This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on
|
39 |
-
|
40 |
-
|
41 |
-
- Wer: 0.3849
|
42 |
-
- Cer: 0.0627
|
43 |
-
- Bleu: 0.3848
|
44 |
-
|
45 |
-
## Model description
|
46 |
-
|
47 |
-
More information needed
|
48 |
|
49 |
## Intended uses & limitations
|
50 |
|
51 |
-
|
52 |
|
53 |
## Training and evaluation data
|
54 |
|
55 |
-
|
56 |
-
|
57 |
-
## Training procedure
|
58 |
|
59 |
### Training hyperparameters
|
60 |
|
@@ -71,21 +62,6 @@ The following hyperparameters were used during training:
|
|
71 |
- training_steps: 2000
|
72 |
- mixed_precision_training: Native AMP
|
73 |
|
74 |
-
### Training results
|
75 |
-
|
76 |
-
| Training Loss | Epoch | Step | Validation Loss | Wer | Cer | Bleu |
|
77 |
-
|:-------------:|:-------:|:----:|:---------------:|:------:|:------:|:------:|
|
78 |
-
| 12.5537 | 1.5625 | 50 | 3.9513 | 1.0006 | 0.9854 | 0.0 |
|
79 |
-
| 2.2034 | 3.125 | 100 | 0.3019 | 0.4137 | 0.0683 | 0.3510 |
|
80 |
-
| 0.226 | 4.6875 | 150 | 0.2305 | 0.3794 | 0.0623 | 0.3981 |
|
81 |
-
| 0.1904 | 6.25 | 200 | 0.2262 | 0.3776 | 0.0618 | 0.3988 |
|
82 |
-
| 0.1798 | 7.8125 | 250 | 0.2275 | 0.3760 | 0.0621 | 0.4040 |
|
83 |
-
| 0.1724 | 9.375 | 300 | 0.2399 | 0.4021 | 0.0659 | 0.3610 |
|
84 |
-
| 0.1791 | 10.9375 | 350 | 0.2310 | 0.3883 | 0.0635 | 0.3797 |
|
85 |
-
| 0.1678 | 12.5 | 400 | 0.2405 | 0.3961 | 0.0666 | 0.3722 |
|
86 |
-
| 0.1527 | 14.0625 | 450 | 0.2335 | 0.3849 | 0.0627 | 0.3848 |
|
87 |
-
|
88 |
-
|
89 |
### Framework versions
|
90 |
|
91 |
- Transformers 4.44.2
|
|
|
35 |
|
36 |
# wav2vec2-mms-1b-CV17.0-training_set_variations
|
37 |
|
38 |
+
This model is a fine-tuned version of [facebook/mms-1b-all](https://huggingface.co/facebook/mms-1b-all) on common_voice_17_0's Tamil dataset.
|
39 |
+
Several adapters were trained with different training set sizes. The intention was to test the improvement in performance as the quantity of training data increased.
|
40 |
+
This model should not be used to perform STT tasks.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
41 |
|
42 |
## Intended uses & limitations
|
43 |
|
44 |
+
Testing purposes only. This is not intended as an STT solution.
|
45 |
|
46 |
## Training and evaluation data
|
47 |
|
48 |
+
common_voice_17_0 "ta"
|
|
|
|
|
49 |
|
50 |
### Training hyperparameters
|
51 |
|
|
|
62 |
- training_steps: 2000
|
63 |
- mixed_precision_training: Native AMP
|
64 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
65 |
### Framework versions
|
66 |
|
67 |
- Transformers 4.44.2
|