Update README.md
Browse files
README.md
CHANGED
@@ -10,7 +10,7 @@ tags:
|
|
10 |
- speech
|
11 |
license: mit
|
12 |
model-index:
|
13 |
-
- name: Wav2Vec2
|
14 |
results:
|
15 |
- task:
|
16 |
name: Speech Recognition
|
@@ -25,27 +25,11 @@ model-index:
|
|
25 |
value: 33.17
|
26 |
---
|
27 |
|
28 |
-
## Spaces Demo
|
29 |
-
Check the spaces demo [here](https://huggingface.co/spaces/Harveenchadha/wav2vec2-vakyansh-hindi/tree/main)
|
30 |
-
|
31 |
-
## Pretrained Model
|
32 |
-
|
33 |
-
Fine-tuned on Multilingual Pretrained Model [CLSRIL-23](https://arxiv.org/abs/2107.07402). The original fairseq checkpoint is present [here](https://github.com/Open-Speech-EkStep/vakyansh-models). When using this model, make sure that your speech input is sampled at 16kHz.
|
34 |
-
|
35 |
-
**Note: The result from this model is without a language model so you may witness a higher WER in some cases.**
|
36 |
-
|
37 |
## Dataset
|
38 |
|
39 |
This model was trained on 4200 hours of Hindi Labelled Data. The labelled data is not present in public domain as of now.
|
40 |
|
41 |
-
## Training Script
|
42 |
|
43 |
-
Models were trained using experimental platform setup by Vakyansh team at Ekstep. Here is the [training repository](https://github.com/Open-Speech-EkStep/vakyansh-wav2vec2-experimentation).
|
44 |
-
|
45 |
-
In case you want to explore training logs on wandb they are [here](https://wandb.ai/harveenchadha/hindi_finetuning_multilingual?workspace=user-harveenchadha).
|
46 |
-
|
47 |
-
|
48 |
-
## [Colab Demo](https://colab.research.google.com/github/harveenchadha/bol/blob/main/demos/hf/hindi/hf_hindi_him_4200_demo.ipynb)
|
49 |
|
50 |
## Usage
|
51 |
|
@@ -132,7 +116,5 @@ print("WER: {:2f}".format(100 * wer.compute(predictions=result["pred_strings"],
|
|
132 |
|
133 |
**Test Result**: 33.17 %
|
134 |
|
135 |
-
[**Colab Evaluation**](https://colab.research.google.com/github/harveenchadha/bol/blob/main/demos/hf/hindi/hf_vakyansh_hindi_him_4200_evaluation_common_voice.ipynb)
|
136 |
-
|
137 |
## Credits
|
138 |
-
Thanks to
|
|
|
10 |
- speech
|
11 |
license: mit
|
12 |
model-index:
|
13 |
+
- name: Wav2Vec2 Hindi Model by Aditi sharma
|
14 |
results:
|
15 |
- task:
|
16 |
name: Speech Recognition
|
|
|
25 |
value: 33.17
|
26 |
---
|
27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
## Dataset
|
29 |
|
30 |
This model was trained on 4200 hours of Hindi Labelled Data. The labelled data is not present in public domain as of now.
|
31 |
|
|
|
32 |
|
|
|
|
|
|
|
|
|
|
|
|
|
33 |
|
34 |
## Usage
|
35 |
|
|
|
116 |
|
117 |
**Test Result**: 33.17 %
|
118 |
|
|
|
|
|
119 |
## Credits
|
120 |
+
Thanks to Deepmindz Innovations for making this possible.
|