Update README.md
Browse files
README.md
CHANGED
@@ -25,6 +25,20 @@ pipeline_tag: automatic-speech-recognition
|
|
25 |
- Trained for 6 epochs with a learning rate of 1e-4 and warmup steps of 100 without gradient accumulation.
|
26 |
- **Evaluation Metrics:** Word Error Rate (WER)
|
27 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
28 |
## Usage
|
29 |
- **Usage:** The model can be used for speech recognition tasks. Inputs should be audio files, and the model outputs transcriptions.
|
30 |
|
|
|
25 |
- Trained for 6 epochs with a learning rate of 1e-4 and warmup steps of 100 without gradient accumulation.
|
26 |
- **Evaluation Metrics:** Word Error Rate (WER)
|
27 |
|
28 |
+
## Evaluation
|
29 |
+
- **Testing Data**
|
30 |
+
- Test Data 1: LoFi Students (LEVI_LoFi_v2_TEST_punc+cased_student)
|
31 |
+
- Test Data 2: LoFi Tutors (LEVI_LoFi_v2_TEST_punc+cased_tutor)
|
32 |
+
- Test Data 3: HiFi Students (LEVI_orig11_HiFi_punc+cased_student)
|
33 |
+
- Test Data 4: HiFi Tutor (LEVI_orig11_HiFi_punc+cased_tutor)
|
34 |
+
- **Metric**
|
35 |
+
- Word Error Rate (WER)
|
36 |
+
- **Results**
|
37 |
+
- Test Data 1: 44.1%
|
38 |
+
- Test Data 2: 15.1%
|
39 |
+
- Test Data 3: 44.2%
|
40 |
+
- Test Data 4: 15.9%
|
41 |
+
|
42 |
## Usage
|
43 |
- **Usage:** The model can be used for speech recognition tasks. Inputs should be audio files, and the model outputs transcriptions.
|
44 |
|