SalahZa's picture
first commit
0d1350d
|
raw
history blame
1.02 kB
---
license: apache-2.0
---
# Tunisian Arabic ASR Model with wav2vec2 and code switching
This repository provides all the necessary tools to perform automatic speech recognition from an end-to-end system pretrained on Tunisian arabic dialect. This model utilizes a code_switching approach and can process english , french and tunisian arabic
## Performance
the performance of the mode is :
| Release Version |WER (%) | CER (%) |
|-----------------|---------|---------|
| v1.0 |29.47 | 12.44 |
## Pipeline
The architecture comprises three components:
* French ASR pretrained with wav2vec2 on french corporas
* English ASR pretrained with wav2vec2 on english corporas
* Custom Tunisian ASR pretrained using wav2vec on a tunisian arabic corpora
All three models will process the audio data. Subsequently, the resulting posteriorgrams will be combined and utilized as input for the Mixer, which will produce the final posteriorgrams.
## Install
```python
pip install speechbrain transformers
```