speech-to-text
Here are 4,049 public repositories matching this topic...
Language:All
Sort:Most stars
Port of OpenAI's Whisper model in C/C++
- Updated
Nov 1, 2025 - C++
DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.
- Updated
Jun 19, 2025 - C++
Faster Whisper transcription with CTranslate2
- Updated
Oct 31, 2025 - Python
WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)
- Updated
Oct 21, 2025 - Python
🧠 Leon is your open-source personal assistant.
- Updated
Nov 6, 2025 - TypeScript
kaldi-asr/kaldi is the official location of the Kaldi project.
- Updated
Sep 22, 2025 - Shell
Translate the video from one language to another and add dubbing.
- Updated
Nov 4, 2025 - Python
Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node
- Updated
Oct 24, 2025 - Jupyter Notebook
A PyTorch-based Speech Toolkit
- Updated
Nov 2, 2025 - Python
Speech recognition module for Python, supporting several engines and APIs, online and offline.
- Updated
Oct 28, 2025 - Python
A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.
- Updated
Jul 11, 2025 - Python
Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages
- Updated
Nov 5, 2025 - C++
A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统
- Updated
Sep 6, 2025 - Python
Multilingual Voice Understanding Model
- Updated
Aug 15, 2025 - Python
💬 Speech recognition for your site
- Updated
Aug 7, 2024 - JavaScript
A free, open source, and extensible speech-to-text application that works completely offline.
- Updated
Nov 6, 2025 - TypeScript
Silero Models: pre-trained text-to-speech models made embarrassingly simple
- Updated
Oct 31, 2025 - Jupyter Notebook
Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.
- Updated
Jul 11, 2025 - Python
Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper
- Updated
Oct 14, 2025 - Jupyter Notebook
Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.
- Updated
Oct 5, 2025 - Python
Improve this page
Add a description, image, and links to thespeech-to-text topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeech-to-text topic, visit your repo's landing page and select "manage topics."