metadata
tags:
- espnet
- audio
- speech-recognition
language: en
datasets:
- google/fleurs
license: cc-by-4.0
ESPnet2 ASR model
espnet/wanchichen_fleurs_asr_conformer_sctctc
This model was trained by William Chen using the fleurs recipe in espnet.
Demo: How to use in ESPnet2
cd espnet
pip install -e .
cd egs2/fleurs/asr1
./run.sh
RESULTS
Environments
- date:
Sat Oct 22 14:55:21 EDT 2022
- python version:
3.8.6 (default, Dec 17 2020, 16:57:01) [GCC 10.2.0]
- espnet version:
espnet 202207
- pytorch version:
pytorch 1.8.1+cu102
- Git hash:
e534106b837ff6cdd29977a52983c022ff1afb0f
- Commit date:
Sun Sep 11 22:31:23 2022 -0400
- Commit date:
asr_train_asr_xlsr_conformer_scctc_raw_all_bpe6500_sp
WER
dataset | Snt | Wrd | Corr | Sub | Del | Ins | Err | S.Err |
---|---|---|---|---|---|---|---|---|
decode_asr_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave_3best/test_all | 77809 | 1592160 | 70.5 | 26.1 | 3.4 | 3.4 | 32.9 | 97.0 |
CER
dataset | Snt | Wrd | Corr | Sub | Del | Ins | Err | S.Err |
---|---|---|---|---|---|---|---|---|
decode_asr_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave_3best/test_all | 77809 | 10235271 | 92.2 | 4.7 | 3.1 | 2.6 | 10.4 | 97.0 |
TER
dataset | Snt | Wrd | Corr | Sub | Del | Ins | Err | S.Err |
---|---|---|---|---|---|---|---|---|
decode_asr_lm_lm_train_lm_all_bpe6500_valid.loss.ave_asr_model_valid.acc.ave_3best/test_all | 77809 | 9622352 | 91.3 | 5.6 | 3.1 | 2.7 | 11.4 | 97.0 |