Spaces:
Runtime error
Runtime error
File size: 1,040 Bytes
d081411 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 |
# Tunisian Arabic ASR Model with wav2vec2
This repository provides all the necessary tools to perform automatic speech recognition from an end-to-end system pretrained on Tunisian arabic dialect
## Performance
the performance of the mode is :
| Release Version | |WER (%) | CER (%) |
|-----------------|----|---------|---------|
| v1.0 | Without LM |11.82 | 6.33 |
## Dataset
This ASR model was trained on :
* TARIC : The corpus, named TARIC (Tunisian Arabic Railway Interaction Corpus) has a collection of audio recordings and transcriptions from dialogues in the Tunisian Railway Transport Network. - [Taric Corpus](https://aclanthology.org/L14-1385/) -
* STAC :A corpus of spoken Tunisian Arabic - [STAC Corpus](https://www.researchgate.net/publication/307583782_Spoken_Tunisian_Arabic_Corpus_STAC_Transcription_and_Annotation)
* IWSLT : A Tunisian conversational speech - [IWSLT Corpus](https://iwslt.org/2022/dialect)-
* Tunspeech : Our custom dataset
## Install
```python
pip install speechbrain transformers
```
|