kimbochen
/

whisper-small-zh-tw

@@ -1,41 +1,38 @@
 ---
-language:
-- zh
 license: apache-2.0
 tags:
-- whisper-event
 - generated_from_trainer
 datasets:
-- mozilla-foundation/common_voice_11_0
 metrics:
 - wer
 model-index:
-- name: Whisper Small Chinese - Kimbo Chen
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
-      name: Common Voice 11.0
-      type: mozilla-foundation/common_voice_11_0
       config: zh-TW
       split: test
       args: zh-TW
     metrics:
     - name: Wer
       type: wer
-      value: 40.81883316274309
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# Whisper Small Chinese - Kimbo Chen
-This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the Common Voice 11.0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1984
-- Wer: 40.8188
 ## Model description
@@ -56,11 +53,11 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 64
-- eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- lr_scheduler_warmup_steps: 100
 - training_steps: 1000
 - mixed_precision_training: Native AMP
@@ -68,11 +65,11 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
-| 0.1438        | 1.05  | 200  | 0.1822          | 42.4360 |
-| 0.0315        | 2.1   | 400  | 0.1869          | 42.1290 |
-| 0.0113        | 4.01  | 600  | 0.1953          | 40.6346 |
-| 0.0053        | 5.06  | 800  | 0.1950          | 40.6755 |
-| 0.0035        | 6.11  | 1000 | 0.1984          | 40.8188 |
 ### Framework versions

 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
 datasets:
+- common_voice_11_0
 metrics:
 - wer
 model-index:
+- name: openai/whisper-small
   results:
   - task:
       name: Automatic Speech Recognition
       type: automatic-speech-recognition
     dataset:
+      name: common_voice_11_0
+      type: common_voice_11_0
       config: zh-TW
       split: test
       args: zh-TW
     metrics:
     - name: Wer
       type: wer
+      value: 32.594792142530835
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# openai/whisper-small
+This model is a fine-tuned version of [openai/whisper-small](https://huggingface.co/openai/whisper-small) on the common_voice_11_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3250
+- Wer: 32.5948
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 1e-05
 - train_batch_size: 64
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- lr_scheduler_warmup_steps: 800
 - training_steps: 1000
 - mixed_precision_training: Native AMP
 | Training Loss | Epoch | Step | Validation Loss | Wer     |
 |:-------------:|:-----:|:----:|:---------------:|:-------:|
+| 0.3465        | 1.05  | 200  | 0.3499          | 41.9324 |
+| 0.2137        | 2.1   | 400  | 0.2953          | 36.2951 |
+| 0.1255        | 4.01  | 600  | 0.2927          | 33.7232 |
+| 0.0509        | 5.06  | 800  | 0.3149          | 34.0566 |
+| 0.0164        | 6.11  | 1000 | 0.3250          | 32.5948 |
 ### Framework versions