|
--- |
|
license: apache-2.0 |
|
--- |
|
# Tunisian Arabic ASR Model with wav2vec2 and code switching |
|
This repository provides all the necessary tools to perform automatic speech recognition from an end-to-end system pretrained on Tunisian arabic dialect. This model utilizes a code_switching approach and can process english , french and tunisian arabic |
|
## Performance |
|
the performance of the mode is : |
|
| Release Version |WER (%) | CER (%) | |
|
|-----------------|---------|---------| |
|
| v1.0 |29.47 | 12.44 | |
|
## Pipeline |
|
The architecture comprises three components: |
|
* French ASR pretrained with wav2vec2 on french corporas |
|
* English ASR pretrained with wav2vec2 on english corporas |
|
* Custom Tunisian ASR pretrained using wav2vec on a tunisian arabic corpora |
|
All three models will process the audio data. Subsequently, the resulting posteriorgrams will be combined and utilized as input for the Mixer, which will produce the final posteriorgrams. |
|
## Install |
|
```python |
|
pip install speechbrain transformers |
|
``` |
|
|
|
|