geninhu commited on
Commit
48f71da
1 Parent(s): 5ffaf78

update model card README.md

Browse files
Files changed (1) hide show
  1. README.md +14 -36
README.md CHANGED
@@ -1,28 +1,12 @@
1
  ---
2
  license: apache-2.0
3
- language:
4
- - vi
5
  tags:
6
  - automatic-speech-recognition
7
- - robust-speech-event
8
- - common-voice
9
  model-index:
10
  - name: xls-asr-vi-40h-1B
11
- results:
12
- - task:
13
- name: Speech Recognition
14
- type: automatic-speech-recognition
15
- dataset:
16
- name: Common Voice 7.0 vi
17
- type: common_voice
18
- args: vi
19
- metrics:
20
- - name: Test WER
21
- type: wer
22
- value: 34.21
23
- - name: Test CER
24
- type: cer
25
- value: 19.94
26
  ---
27
 
28
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -30,17 +14,18 @@ should probably proofread and complete it, then remove this comment. -->
30
 
31
  # xls-asr-vi-40h-1B
32
 
33
- This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the common voice 7.0 vi & private dataset.
 
 
 
34
 
35
- ### Benchmark WER result:
36
- | | [VIVOS](https://huggingface.co/datasets/vivos) | [COMMON VOICE 7.0 VI](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) |
37
- |---|---|---|
38
- |without LM| 25.92 | 34.21 |
39
 
40
- ### Benchmark CER result:
41
- | | [VIVOS](https://huggingface.co/datasets/vivos) | [COMMON VOICE 7.0 VI](https://huggingface.co/datasets/mozilla-foundation/common_voice_7_0) |
42
- |---|---|---|
43
- |without LM| 9.24 | 19.94 |
 
44
 
45
  ## Training and evaluation data
46
 
@@ -60,15 +45,8 @@ The following hyperparameters were used during training:
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
62
  - lr_scheduler_warmup_steps: 1500
63
- - num_epochs: 50.0
64
  - mixed_precision_training: Native AMP
65
- - attention_dropout: 0.2
66
- - activation_dropout: 0.1
67
- - warmup_steps: 1500
68
- - mask_time_prob: .15
69
- - mask_time_length: 10
70
- - mask_feature_prob: 0.25
71
- - mask_feature_length: 64
72
 
73
  ### Training results
74
 
 
1
  ---
2
  license: apache-2.0
 
 
3
  tags:
4
  - automatic-speech-recognition
5
+ - geninhu/fpt-vi
6
+ - generated_from_trainer
7
  model-index:
8
  - name: xls-asr-vi-40h-1B
9
+ results: []
 
 
 
 
 
 
 
 
 
 
 
 
 
 
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
14
 
15
  # xls-asr-vi-40h-1B
16
 
17
+ This model is a fine-tuned version of [facebook/wav2vec2-xls-r-1b](https://huggingface.co/facebook/wav2vec2-xls-r-1b) on the GENINHU/FPT-VI - NA dataset.
18
+ It achieves the following results on the evaluation set:
19
+ - Loss: 4.1691
20
+ - Wer: 0.4133
21
 
22
+ ## Model description
 
 
 
23
 
24
+ More information needed
25
+
26
+ ## Intended uses & limitations
27
+
28
+ More information needed
29
 
30
  ## Training and evaluation data
31
 
 
45
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
46
  - lr_scheduler_type: linear
47
  - lr_scheduler_warmup_steps: 1500
48
+ - num_epochs: 10.0
49
  - mixed_precision_training: Native AMP
 
 
 
 
 
 
 
50
 
51
  ### Training results
52