|
# Mockingbird TTS Demo |
|
This repo hosts Mockingbird, a demo of open Text-to-Speech tools. |
|
|
|
Currently, 3 synthesizers are supported: |
|
- [**Meta's Massively Multilingual Speech (MMS)**](https://ai.meta.com/blog/multilingual-model-speech-recognition/) model |
|
- [**Coqui's TTS**](https://docs.coqui.ai/en/latest/#) package and the models supplied via that |
|
- [**ESpeak-NG's**](espeak-ng) synthetic voices |
|
|
|
Voice conversion is achieved through Coqui. |
|
|
|
Notes: |
|
1. ESpeak-NG seems to have the worst performance out of the box, but it has a lot of options for controlling voice output. |
|
2. Coqui is no longer being officially developed. |
|
3. Where a synthesizer supports multiple models/voices, I manually pick the appropriate model. |
|
4. Not all synthesizers support a given language. |
|
|