SalahZa
/

Code_Switched_Tunisian_Speech_Recognition

Model card Files Files and versions Community

Code_Switched_Tunisian_Speech_Recognition / README.md

SalahZa's picture

first commit

0d1350d about 1 year ago

|

1.02 kB

	---
	license: apache-2.0
	---
	# Tunisian Arabic ASR Model with wav2vec2 and code switching
	This repository provides all the necessary tools to perform automatic speech recognition from an end-to-end system pretrained on Tunisian arabic dialect. This model utilizes a code_switching approach and can process english , french and tunisian arabic
	## Performance
	the performance of the mode is :
	\| Release Version \|WER (%) \| CER (%) \|
	\|-----------------\|---------\|---------\|
	\| v1.0 \|29.47 \| 12.44 \|
	## Pipeline
	The architecture comprises three components:
	* French ASR pretrained with wav2vec2 on french corporas
	* English ASR pretrained with wav2vec2 on english corporas
	* Custom Tunisian ASR pretrained using wav2vec on a tunisian arabic corpora
	All three models will process the audio data. Subsequently, the resulting posteriorgrams will be combined and utilized as input for the Mixer, which will produce the final posteriorgrams.
	## Install
	```python
	pip install speechbrain transformers
	```