speech
Here are 1,948 public repositories matching this topic...
Language:All
Sort:Most stars
🐸💬 - a deep learning toolkit for Text-to-Speech, battle-tested in research and production
- Updated
Aug 16, 2024 - Python
🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time
- Updated
Jan 7, 2026 - Python
SoftVC VITS Singing Voice Conversion
- Updated
Nov 11, 2023 - Python
🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools
- Updated
Feb 18, 2026 - Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
- Updated
Feb 19, 2026 - Python
Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything
- Updated
Sep 5, 2024 - Jupyter Notebook
kaldi-asr/kaldi is the official location of the Kaldi project.
- Updated
Sep 22, 2025 - Shell
🤖 💬 Deep learning for Text to Speech (Discussion forum:https://discourse.mozilla.org/c/tts)
- Updated
Nov 9, 2023 - Jupyter Notebook
ModelScope: bring the notion of Model-as-a-Service to life.
- Updated
Feb 19, 2026 - Python
EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine
- Updated
Aug 13, 2024 - Python
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
- Updated
Feb 12, 2026 - Python
Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.
- Updated
Jan 15, 2025 - Python
💬 Speech recognition for your site
- Updated
Aug 7, 2024 - JavaScript
VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning
- Updated
Feb 11, 2026 - Python
Silero Models: pre-trained text-to-speech models made embarrassingly simple
- Updated
Feb 3, 2026 - Jupyter Notebook
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
- Updated
Nov 26, 2025 - Jupyter Notebook
Speech To Speech: an effort for an open-sourced and modular GPT4-o
- Updated
Feb 20, 2026 - Python
Low-latency AI inference engine for mobile devices & wearables
- Updated
Feb 20, 2026 - C
Improve this page
Add a description, image, and links to thespeech topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeech topic, visit your repo's landing page and select "manage topics."