Hindi TTS (Text-to-Speech, 24kHz)

Overview

Hindi TTS is a high-quality Text-to-Speech model developed using the F5 TTS architecture. Built by FuturixAI and Quantum Works, this model enables natural-sounding Hindi speech synthesis and is distributed under the MIT license. It is intended for both research and commercial applications.

Key Features

  • Language: Hindi
  • Sampling Rate: 24 kHz

Training Data

The model was trained on the IndicVoices-R_Hindi dataset provided by IIT Madras.

Usage Instructions

Prerequisites

Ensure you have installed the necessary dependencies for the f5-tts_infer-cli. Refer to the GitHub repository for installation instructions:
https://github.com/rumourscape/F5-TTS

Example Usage

f5-tts_infer-cli \
--model "Futurix-AI/Hindi-TTS" \
--ref_audio "ref_audio.wav" \
--ref_text "यह संदर्भ ऑडियो का सामग्री, उपशीर्षक या लिप्यंतरण है।" \
--gen_text "यह एक उदाहरण है जो मॉडल से बोलने के लिए उत्पन्न किया गया है।"

Parameters:

  • --model: Replace "hindi_tts_checkpoint.pth" with the actual checkpoint file name.
  • --ref_audio: Path to the reference audio file (e.g., "ref_audio.wav").
  • --ref_text: Hindi text corresponding to the reference audio.
  • --gen_text: Hindi text for the TTS model to generate speech.

license: mit

Downloads last month

-

Downloads are not tracked for this model. How to track
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.
The model cannot be deployed to the HF Inference API: The HF Inference API does not support text-to-speech models for safetensors library.

Model tree for Futurix-AI/Hindi-TTS

Base model

SWivid/F5-TTS
Finetuned
(26)
this model

Dataset used to train Futurix-AI/Hindi-TTS