Spaces:

HumeAI
/

expressive-tts-arena

Running

App Files Files Community

zach commited on Feb 4

Commit

87ff28a

1 Parent(s): 6130461

Add project README

Browse files

Files changed (1) hide show

README.md +68 -0

README.md ADDED Viewed

	@@ -0,0 +1,68 @@

+<div align="center">
+    <img src="https://storage.googleapis.com/hume-public-logos/hume/hume-banner.png">
+    <h1>Expressive TTS Arena</h1>
+    <p>
+        <strong>An interactive platform for comparing and evaluating the expressiveness of different text-to-speech engines</strong>
+    </p>
+</div>
+## Overview
+Expressive TTS Arena is an open-source web application that enables users to compare text-to-speech outputs with a focus on expressiveness rather than just audio quality. Built with Gradio, it provides a seamless interface for generating and comparing speech synthesis from different providers, including Hume and ElevenLabs.
+## Features
+- Text generation using Claude AI for creating expressive content
+- Direct text input or AI-assisted text generation
+- Comparative analysis of different TTS engines
+- Simple voting mechanism for preferred outputs
+- Random voice selection from multiple providers
+- Real-time speech synthesis comparison
+## Prerequisites
+- Python >=3.11.11
+- Virtual environment capability
+- API keys for Hume AI, Anthropic, and ElevenLabs
+### Installation
+1. Create and activate the virtual environment:
+    ```sh
+    python -m venv gradio-env
+    source gradio-env/bin/activate  # On Windows, use: gradio-env\Scripts\activate
+    ```
+2. Install dependencies:
+    ```sh
+    pip install -r requirements.txt
+    ```
+3. Configure environment variables:
+    - Create a `.env` file based on `.env.example`
+    - Add your API keys:
+    ```sh
+    HUME_API_KEY=YOUR_HUME_API_KEY
+    ANTHROPIC_API_KEY=YOUR_ANTHROPIC_API_KEY
+    ELEVENLABS_API_KEY=YOUR_ELEVENLABS_API_KEY
+    ```
+4. Run the application:
+    ```sh
+    watchfiles "python -m src.app"`
+    ```
+## User Flow
+1. **Enter or Generate Text:** Type directly in the Text box, or optionally enter a Prompt, click "Generate text", and edit if needed.
+2. **Synthesize Speech:** Click "Synthesize speech" to generate two audio outputs.
+3. **Listen & Compare:** Playback both options (A & B) to hear the differences.
+4. **Vote for Your Favorite:** Click "Vote for option A" or "Vote for option B" to choose your favorite.
+## Contributing
+We welcome contributions to the Expressive TTS Arena! This project is intended to serve as example code and is open-source. Feel free to submit issues, fork the repository, and create pull requests for any improvements.
+## License
+[Add your chosen license here]