mah92's picture
Update README.md
4526d1a verified
metadata
license: cc0-1.0
datasets:
  - mah92/Khadijah-FA_EN-Public-Phone-Audio-Dataset
language:
  - fa
  - en
pipeline_tag: text-to-speech

بسم اله الرحمن الرحیم - هست کلید در گنج حکیم

Model Card for Khadijah(SA)

This is the first persian/english text-to-speech model using the brand new matcha TTS model.

Much faster and better than VITS.

Works best with the UNIVERSAL_V1_22050Hz hifigan vocoder.

You can test this model here under persian+english part.

Enjoy!

Training method

see: how_to_train_matcha_tts

Training results

Training Results

Credits

Trained by Ali Mahmoudi (@mah92)

Special thanks to Masoud Azizi (@Mablue ), Amirreza Ramezani (@brightening-eyes ), and Dr. Hamid Jafari (Khaneh Noor Iranian Basir).

Special thanks to people from @ttsfarsi channel.

I should also thank you @csukuangfj from Xiaomi corporation for your helps and cares in icefall and sherpa-onnx repos.

و ما نحن بشئ الا بما رحم ربنا