HumeAI/expressive-tts-arenaPublic

NotificationsYou must be signed in to change notification settings
Fork4
Star19

License

MIT license

19 stars 4 forks Branches Tags Activity

Star

Notifications

You must be signed in to change notification settings

Branches Tags

Folders and files

Name		Name	Last commit message	Last commit date
Latest commit History 220 Commits
.github/workflows		.github/workflows
public		public
src		src
static/css		static/css
.dockerignore		.dockerignore
.env.example		.env.example
.gitignore		.gitignore
.pre-commit-config.yaml		.pre-commit-config.yaml
Dockerfile		Dockerfile
LICENSE.txt		LICENSE.txt
README.md		README.md
pyproject.toml		pyproject.toml
uv.lock		uv.lock

Repository files navigation

title	emoji	colorFrom	colorTo	sdk	app_file	python_version	pinned	license
Expressive TTS Arena	🎤	indigo	pink	docker	src/main.py	3.11	true	mit

Expressive TTS Arena

A web application for comparing and evaluating the expressiveness of different text-to-speech models

Overview

Expressive TTS Arena is an open-source web application for evaluating the expressiveness of voice generation and speech synthesis from different text-to-speech providers.

For support or to join the conversation, visit ourDiscord.

Prerequisites

Python >=3.11.11
pip >=25.0
uv >=0.5.29
Postgres
API keys for Hume AI, Anthropic, OpenAI, and ElevenLabs

Project Structure

Expressive TTS Arena/├── public/├── src/│   ├── common/│   │   ├── __init__.py│   │   ├── common_types.py         # Application-wide custom type aliases and definitions.│   │   ├── config.py               # Manages application config (Singleton) loaded from env vars.│   │   ├── constants.py            # Application-wide constant values.│   │   ├── utils.py                # General-purpose utility functions used across modules.│   ├── core/│   │   ├── __init__.py│   │   ├── tts_service.py          # Service handling Text-to-Speech provider selection and API calls.│   │   ├── voting_service.py       # Service managing database operations for votes and leaderboards.│   ├── database/                   # Database access layer using SQLAlchemy.│   │   ├── __init__.py│   │   ├── crud.py                 # Data Access Objects (DAO) / CRUD operations for database models.│   │   ├── database.py             # Database connection setup (engine, session management).│   │   └── models.py               # SQLAlchemy ORM models defining database tables.│   ├── frontend/│   │   ├── components/│   │   │   │   ├── __init__.py     │   │   │   │   ├── arena.py        # UI definition and logic for the 'Arena' tab.│   │   │   │   ├── leaderboard.py  # UI definition and logic for the 'Leaderboard' tab.│   │   ├── __init__.py│   │   ├── frontend.py             # Main Gradio application class; orchestrates UI components and layout.│   ├── integrations/               # Modules for interacting with external third-party APIs.│   │   ├── __init__.py│   │   ├── anthropic_api.py        # Integration logic for the Anthropic API.│   │   ├── elevenlabs_api.py       # Integration logic for the ElevenLabs API.│   │   └── hume_api.py             # Integration logic for the Hume API.│   ├── middleware/│   │   ├── __init__.py│   │   ├── meta_tag_injection.py   # Middleware for injecting custom HTML meta tags into the Gradio page.│   ├── scripts/│   │   ├── __init__.py│   │   ├── init_db.py              # Script to create database tables based on models.│   │   ├── test_db.py              # Script for testing the database connection configuration.│   ├── __init__.py│   ├── main.py                     # Main script to configure and run the Gradio application.│── static/│   ├── audio/                      # Temporary storage for generated audio files served to the UI.│   ├── css/│   │   ├── styles.css              # Custom CSS overrides and styling for the Gradio UI.├── .dockerignore├── .env.example├── .gitignore├── .pre-commit-config.yaml├── Dockerfile├── LICENSE.txt├── pyproject.toml├── README.md├── uv.lock

Installation

This project uses theuv package manager. Follow the installation instructions for your platformhere.

Configure environment variables:

Create a.env file based on.env.example
Add your API keys:

HUME_API_KEY=YOUR_HUME_API_KEYANTHROPIC_API_KEY=YOUR_ANTHROPIC_API_KEYELEVENLABS_API_KEY=YOUR_ELEVENLABS_API_KEYOPENAI_API_KEY=YOUR_OPENAI_API_KEY

Run the application:

Standard

uv run python -m src.main

With hot-reloading

uv run watchfiles"python -m src.main" src

Test the application by navigating to the the localhost URL in your browser (e.g.localhost:7860 orhttp://127.0.0.1:7860)
(Optional) If contributing, install pre-commit hook for automatic linting, formatting, and type-checking:
```
uv run pre-commit install
```

User Flow

Select a sample character, or input a custom character description and click"Generate Text", to generate your text input.
Click the"Synthesize Speech" button to synthesize two TTS outputs based on your text and character description.
Listen to both audio samples to compare their expressiveness.
Vote for the most expressive result by clicking either"Select Option A" or"Select Option B".

License

This project is licensed under the MIT License - see theLICENSE.txt file for details.

About

No description, website, or topics provided.

Releases

No releases published

Packages

No packages published

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

License

Uh oh!

Folders and files

Latest commit

History

Repository files navigation

Expressive TTS Arena

Overview

Prerequisites

Project Structure

Installation

User Flow

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages

Uh oh!

Contributors4

Languages

Movatterモバイル変換

License

HumeAI/expressive-tts-arena

Folders and files

Latest commit

History

Repository files navigation

Expressive TTS Arena

Overview

Prerequisites

Project Structure

Installation

User Flow

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages0

Uh oh!

Contributors4

Languages

Packages