speech-synthesis
Here are 1,426 public repositories matching this topic...
Language:All
Sort:Most stars
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
- Updated
Aug 16, 2024 - Python
🧠 Leon is your open-source personal assistant.
- Updated
Sep 14, 2025 - TypeScript
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
- Updated
Oct 7, 2025 - Python
State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.
- Updated
Aug 12, 2024 - Jupyter Notebook
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
- Updated
Sep 28, 2025 - Python
End-to-End Speech Processing Toolkit
- Updated
Oct 7, 2025 - Python
Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.
- Updated
May 27, 2025 - Python
Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key
- Updated
Aug 28, 2025 - Python
so-vits-svc fork with realtime support, improved interface and more features.
- Updated
Oct 7, 2025 - Python
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
- Updated
Aug 13, 2024 - Python
VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech
- Updated
Dec 6, 2023 - Python
StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models
- Updated
Aug 10, 2024 - Python
eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.
- Updated
Sep 15, 2025 - C
Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple
- Updated
Oct 18, 2023 - Jupyter Notebook
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
- Updated
Oct 5, 2025 - Python
DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code
- Updated
Mar 19, 2025 - Python
An Open Source text-to-speech system built by inverting Whisper.
- Updated
Jun 8, 2025 - Jupyter Notebook
Speech To Speech: an effort for an open-sourced and modular GPT4-o
- Updated
Apr 15, 2025 - Python
Foundational model for human-like, expressive TTS
- Updated
Jul 30, 2024 - Python
Improve this page
Add a description, image, and links to thespeech-synthesis topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeech-synthesis topic, visit your repo's landing page and select "manage topics."