EzraWilliam
/

wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod14

@@ -1,8 +1,8 @@
 ---
 license: apache-2.0
 tags:
 - generated_from_trainer
-base_model: facebook/wav2vec2-large-xlsr-53
 datasets:
 - common_voice_13_0
 metrics:
@@ -11,8 +11,8 @@ model-index:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod14
   results:
   - task:
-      type: automatic-speech-recognition
       name: Automatic Speech Recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
@@ -20,9 +20,9 @@ model-index:
       split: test
       args: id
     metrics:
-    - type: wer
-      value: 0.9999539085545722
-      name: Wer
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
@@ -32,8 +32,8 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9481
-- Wer: 1.0000
 ## Model description
@@ -52,8 +52,8 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 0.003
-- train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
@@ -65,23 +65,23 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
-| 2.9078        | 1.0   | 556  | 2.9342          | 1.0    |
-| 2.875         | 2.0   | 1112 | 2.8557          | 1.0    |
-| 2.6528        | 3.0   | 1668 | 2.5665          | 1.0000 |
-| 2.386         | 4.0   | 2224 | 2.3012          | 1.0000 |
-| 2.3101        | 5.0   | 2780 | 2.1943          | 0.9999 |
-| 2.2018        | 6.0   | 3336 | 2.1332          | 1.0    |
-| 2.1752        | 7.0   | 3892 | 2.0791          | 1.0    |
-| 2.1255        | 8.0   | 4448 | 2.0347          | 1.0    |
-| 2.0975        | 9.0   | 5004 | 2.0129          | 1.0    |
-| 2.0718        | 10.0  | 5560 | 1.9677          | 1.0    |
-| 2.0771        | 11.0  | 6116 | 1.9591          | 1.0    |
-| 2.0516        | 12.0  | 6672 | 1.9481          | 1.0000 |
 ### Framework versions
-- Transformers 4.39.3
 - Pytorch 2.2.2+cu121
 - Datasets 2.18.0
-- Tokenizers 0.15.2

 ---
 license: apache-2.0
+base_model: facebook/wav2vec2-large-xlsr-53
 tags:
 - generated_from_trainer
 datasets:
 - common_voice_13_0
 metrics:
 - name: wav2vec2-xlsr-53-CV-demo-google-colab-Ezra_William_Prod14
   results:
   - task:
       name: Automatic Speech Recognition
+      type: automatic-speech-recognition
     dataset:
       name: common_voice_13_0
       type: common_voice_13_0
       split: test
       args: id
     metrics:
+    - name: Wer
+      type: wer
+      value: 0.32789454277286134
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 This model is a fine-tuned version of [facebook/wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53) on the common_voice_13_0 dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.3505
+- Wer: 0.3279
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 16
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 | Training Loss | Epoch | Step | Validation Loss | Wer    |
 |:-------------:|:-----:|:----:|:---------------:|:------:|
+| 2.9451        | 1.0   | 278  | 2.9182          | 1.0    |
+| 2.87          | 2.0   | 556  | 2.7116          | 1.0    |
+| 1.1102        | 3.0   | 834  | 0.6030          | 0.5907 |
+| 0.6952        | 4.0   | 1112 | 0.4691          | 0.4755 |
+| 0.5976        | 5.0   | 1390 | 0.4316          | 0.4263 |
+| 0.4842        | 6.0   | 1668 | 0.3887          | 0.3842 |
+| 0.4444        | 7.0   | 1946 | 0.3722          | 0.3670 |
+| 0.4221        | 8.0   | 2224 | 0.3721          | 0.3538 |
+| 0.3929        | 9.0   | 2502 | 0.3527          | 0.3463 |
+| 0.3611        | 10.0  | 2780 | 0.3538          | 0.3386 |
+| 0.3669        | 11.0  | 3058 | 0.3513          | 0.3303 |
+| 0.3517        | 12.0  | 3336 | 0.3505          | 0.3279 |
 ### Framework versions
+- Transformers 4.40.0
 - Pytorch 2.2.2+cu121
 - Datasets 2.18.0
+- Tokenizers 0.19.1

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:b470860af4b7fc4d239bd0740df84bdf356e3fece6eeef202511db9107fa3793
 size 1261991980

 version https://git-lfs.github.com/spec/v1
+oid sha256:0f98d6f88e04a591ed3ecccf2a3ce79285dbb9929743c9f0a858beb1ddb4ac41
 size 1261991980

runs/Apr18_17-39-16_549c0851f499/events.out.tfevents.1713462074.549c0851f499.577.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e30fd76de9332cb146b5647baf0e9281a8faa817065259cb03967cce6a61d223
-size 15971

 version https://git-lfs.github.com/spec/v1
+oid sha256:c8a1cc7a4a62a60c10635c829e9805a0b63398c9c05a5fbab504f05f9b54b97c
+size 17594