metadata
title: README
emoji: 🔥
colorFrom: purple
colorTo: green
sdk: docker
pinned: false
S Labs Solutions
Active Projects
--
Dhwani - Your Kannada Speaking Voice Buddy
Overview
Dhwani is a self-hosted GenAI platform designed to provide voice mode interaction for Kannada and other Indian languages.
Research Goals
- Measure and improve the Time to First Token Generation (TTFTG) for model architectures in ASR, Translation, and TTS systems.
- Develop and enhance a Kannada voice model that meets industry standards set by OpenAI, Google, ElevenLabs, xAI
- Create robust voice solutions for Indian languages, with a specific emphasis on Kannada.
Project Video
Models and Tools
The project utilizes the following open-source tools:
Open-Source Tool | Source Repository |
---|---|
Automatic Speech Recognition : ASR | ASR Indic Server |
Text to Speech : TTS | TTS Indic Server |
Translation | Indic Translate Server |
Document Parser | Indic Document Server |
Dhwani Server | Dhwani Server |
Dhwani Android | Android |
Large Language Model | LLM Indic Server |
Features
Feature | Description | Components |
---|---|---|
Kannada Voice AI | Provides answers to voice queries using a LLM | LLM |
Text Query | Allows querying text data for specific information. | LLM |
Voice to Text Translation | Converts spoken language to text and translates it. | ASR, Translation |
PDF Translate | Translates content from PDF documents. | |
Text to Speech | Generates speech from text. | TTS |
Voice to Voice Translation | Converts spoken language to text, translates it, and then generates speech. | ASR, Translation, TTS |
Answer Engine with Translate | Provides answers to queries with translation capabilities. | ASR, LLM, Translation, TTS |
Contact
- For any questions or issues, please open an issue on GitHub or contact us via email.
- For collaborations
- Join the discord group - invite link
- For business queries, Email : info (at) slabstech (dot) com