File size: 1,090 Bytes
3cf1778
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
bdd8602
 
3cf1778
 
4819e0c
 
 
 
f75d4dd
4819e0c
 
 
 
 
 
3cf1778
 
 
 
 
 
 
 
7ebef99
 
 
 
3cf1778
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
---
license: cc0-1.0
datasets:
- mah92/Musa-FA_EN-Public-Phone-Audio-Dataset
language:
- fa
- en
pipeline_tag: text-to-speech
---
# بسم اله الرحمن الرحیم - هست کلید در گنج حکیم
# Model Card for Musa(AS)

This is the 2nd persian/english text-to-speech model using the brand new matcha TTS model.

Much faster and better than VITS.

You can test this model [here](https://huggingface.co/spaces/k2-fsa/text-to-speech) under persian+english part.

Enjoy!


## Usage with the Sherpa-onnx repo

see: my other repo in hugging face

## Usage with the Matcha-TTS repo

see: my other repo in hugging face

## Training results

![Training Results](musa-22050.png)

## Credits

Trained by Ali Mahmoudi (@mah92)

Special thanks to Masoud Azizi (@Mablue ), Amirreza Ramezani (@brightening-eyes ), and Dr. Hamid Jafari (Khaneh Noor Iranian Basir).

Special thanks to people from @ttsfarsi channel. 

I should also thank you @csukuangfj from Xiaomi corporation for your helps and cares in icefall and sherpa-onnx repos.

و ما نحن بشئ الا بما رحم ربنا