FunPang commited on
Commit
440eaf8
1 Parent(s): 6320865

FunPang/whisper-small-Cantonese-test

Browse files
README.md CHANGED
@@ -1,19 +1,19 @@
1
  ---
 
 
2
  base_model: openai/whisper-small
 
 
3
  datasets:
4
  - common_voice_13_0
5
- library_name: transformers
6
- license: apache-2.0
7
  metrics:
8
  - wer
9
- tags:
10
- - generated_from_trainer
11
  model-index:
12
  - name: whisper-small-Cantonese-test
13
  results:
14
  - task:
15
- type: automatic-speech-recognition
16
  name: Automatic Speech Recognition
 
17
  dataset:
18
  name: common_voice_13_0
19
  type: common_voice_13_0
@@ -21,9 +21,9 @@ model-index:
21
  split: test
22
  args: zh-HK
23
  metrics:
24
- - type: wer
25
- value: 77.31885348050561
26
- name: Wer
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -33,8 +33,8 @@ should probably proofread and complete it, then remove this comment. -->
33
 
34
  This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_13_0 dataset.
35
  It achieves the following results on the evaluation set:
36
- - Loss: 0.3920
37
- - Wer: 77.3189
38
 
39
  ## Model description
40
 
@@ -60,14 +60,15 @@ The following hyperparameters were used during training:
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
62
  - lr_scheduler_warmup_steps: 10
63
- - training_steps: 100
64
  - mixed_precision_training: Native AMP
65
 
66
  ### Training results
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:------:|:----:|:---------------:|:-------:|
70
- | 0.4575 | 0.1140 | 100 | 0.3920 | 77.3189 |
 
71
 
72
 
73
  ### Framework versions
 
1
  ---
2
+ library_name: transformers
3
+ license: apache-2.0
4
  base_model: openai/whisper-small
5
+ tags:
6
+ - generated_from_trainer
7
  datasets:
8
  - common_voice_13_0
 
 
9
  metrics:
10
  - wer
 
 
11
  model-index:
12
  - name: whisper-small-Cantonese-test
13
  results:
14
  - task:
 
15
  name: Automatic Speech Recognition
16
+ type: automatic-speech-recognition
17
  dataset:
18
  name: common_voice_13_0
19
  type: common_voice_13_0
 
21
  split: test
22
  args: zh-HK
23
  metrics:
24
+ - name: Wer
25
+ type: wer
26
+ value: 71.49724051985046
27
  ---
28
 
29
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 
33
 
34
  This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_13_0 dataset.
35
  It achieves the following results on the evaluation set:
36
+ - Loss: 0.3400
37
+ - Wer: 71.4972
38
 
39
  ## Model description
40
 
 
60
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
61
  - lr_scheduler_type: linear
62
  - lr_scheduler_warmup_steps: 10
63
+ - training_steps: 200
64
  - mixed_precision_training: Native AMP
65
 
66
  ### Training results
67
 
68
  | Training Loss | Epoch | Step | Validation Loss | Wer |
69
  |:-------------:|:------:|:----:|:---------------:|:-------:|
70
+ | 0.5135 | 0.1140 | 100 | 0.4197 | 78.8143 |
71
+ | 0.4537 | 0.2281 | 200 | 0.3400 | 71.4972 |
72
 
73
 
74
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:850656614b5f1ea5d7dee9abc3523e1774f68fe7d9807168e56656a5b69b7d35
3
  size 966995080
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:5563fafa85c57cb5bef513c797fb7059f32564ab8a6be8ca9a3d265899cede02
3
  size 966995080
runs/Sep21_22-07-24_asus2/events.out.tfevents.1726981646.asus2.29844.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1cbd7a78c933b9c408406d46d0cb532e28f0bd6548f82a1b9c37d117726e4ab5
3
+ size 8490
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bc53f834a5015588e1b0f0575c111d98390bdacaa857053e4d19621e85b3c8dd
3
  size 5432
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:b4ce8fb59e5b821b40989acef5f5bf76d4c34dface0113aa20e140a413a741e3
3
  size 5432