README / README.md
sachin
Init slabstech
06a5720
|
raw
history blame
3.5 kB
metadata
title: README
emoji: 🔥
colorFrom: purple
colorTo: green
sdk: docker
pinned: false

S Labs Solutions

Active Projects

--

Dhwani - Your Kannada Speaking Voice Buddy

Overview

Dhwani is a self-hosted GenAI platform designed to provide voice mode interaction for Kannada and other Indian languages.

Research Goals

  • Measure and improve the Time to First Token Generation (TTFTG) for model architectures in ASR, Translation, and TTS systems.
  • Develop and enhance a Kannada voice model that meets industry standards set by OpenAI, Google, ElevenLabs, xAI
  • Create robust voice solutions for Indian languages, with a specific emphasis on Kannada.

Project Video

Models and Tools

The project utilizes the following open-source tools:

Open-Source Tool Source Repository
Automatic Speech Recognition : ASR ASR Indic Server
Text to Speech : TTS TTS Indic Server
Translation Indic Translate Server
Document Parser Indic Document Server
Dhwani Server Dhwani Server
Dhwani Android Android
Large Language Model LLM Indic Server

Features

Feature Description Components
Kannada Voice AI Provides answers to voice queries using a LLM LLM
Text Query Allows querying text data for specific information. LLM
Voice to Text Translation Converts spoken language to text and translates it. ASR, Translation
PDF Translate Translates content from PDF documents.
Text to Speech Generates speech from text. TTS
Voice to Voice Translation Converts spoken language to text, translates it, and then generates speech. ASR, Translation, TTS
Answer Engine with Translate Provides answers to queries with translation capabilities. ASR, LLM, Translation, TTS

Contact

  • For any questions or issues, please open an issue on GitHub or contact us via email.
  • For collaborations
  • For business queries, Email : info (at) slabstech (dot) com