speech-to-text

Star

Here are 4,049 public repositories matching this topic...

Language:All

Filter by language

All4,049 Python1,631 JavaScript487 Jupyter Notebook336 TypeScript264 Java158 HTML142 C#120 Swift73 C++71 Dart57

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

ggml-org /whisper.cpp

Star44.3k

Port of OpenAI's Whisper model in C/C++

inference transformer speech-recognition openai speech-to-text whisper

UpdatedNov 1, 2025
C++

mozilla /DeepSpeech

Star26.6k

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

machine-learning embedded deep-learning offline tensorflow speech-recognition neural-networks speech-to-text deepspeech on-device

UpdatedJun 19, 2025
C++

SYSTRAN /faster-whisper

Star18.9k

Faster Whisper transcription with CTranslate2

deep-learning inference transformer speech-recognition openai speech-to-text quantization whisper

UpdatedOct 31, 2025
Python

m-bain /whisperX

Sponsor

Star18.6k

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

UpdatedOct 21, 2025
Python

leon-ai /leon

Star16.8k

🧠 Leon is your open-source personal assistant.

nodejs python bot text-to-speech automation privacy ai offline chatbot artificial-intelligence speech-synthesis assistant speech-recognition personal-assistant speech-to-text leon flite voice-assistant virtual-assistant ai-assistant

UpdatedNov 6, 2025
TypeScript

kaldi-asr /kaldi

Star15.2k

kaldi-asr/kaldi is the official location of the Kaldi project.

shell c-plus-plus cuda speech speech-recognition speech-to-text kaldi speaker-verification speaker-id

UpdatedSep 22, 2025
Shell

jianchang512 /pyvideotrans

Star15.1k

Translate the video from one language to another and add dubbing.

text-to-speech speech-to-text video-transition

UpdatedNov 4, 2025
Python

alphacep /vosk-api

Star13.6k

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

android python raspberry-pi ios privacy deep-neural-networks deep-learning offline voice-recognition speech-recognition speech-to-text kaldi stt speaker-verification asr speech-to-text-android deepspeech speaker-identification google-speech-to-text vosk

UpdatedOct 24, 2025
Jupyter Notebook

speechbrain /speechbrain

Star10.7k

A PyTorch-based Speech Toolkit

audio deep-learning transformers pytorch voice-recognition speech-recognition speech-to-text language-model speaker-recognition speaker-verification speech-processing audio-processing asr speaker-diarization speechrecognition speech-separation speech-enhancement spoken-language-understanding huggingface speech-toolkit

UpdatedNov 2, 2025
Python

Uberi /speech_recognition

Star8.9k

Speech recognition module for Python, supporting several engines and APIs, online and offline.

audio python speech-recognition speech-to-text

UpdatedOct 28, 2025
Python

KoljaB /RealtimeSTT

Star8.9k

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

python realtime speech-to-text

UpdatedJul 11, 2025
Python

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

android windows macos linux lazarus raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v object-pascal asr arm32 onnx vits

UpdatedNov 5, 2025
C++

nl8590687 /ASRT_SpeechRecognition

Star8.3k

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

python tensorflow keras cnn python3 speech-recognition speech-to-text ctc chinese-speech-recognition asrt

UpdatedSep 6, 2025
Python

FunAudioLLM /SenseVoice

Star6.9k

Multilingual Voice Understanding Model

multilingual python ai pytorch speech-recognition speech-to-text asr cross-lingual speech-emotion-recognition audio-event-classification aigc llm gpt-4o

UpdatedAug 15, 2025
Python

TalAter /annyang

Star6.7k

💬 Speech recognition for your site

voice speech speech-recognition speech-to-text

UpdatedAug 7, 2024
JavaScript

cjpais /Handy

Sponsor

Star5.8k

A free, open source, and extensible speech-to-text application that works completely offline.

cross-platform accessibility speech-to-text tauri-v2

UpdatedNov 6, 2025
TypeScript

snakers4 /silero-models

Star5.5k

Silero Models: pre-trained text-to-speech models made embarrassingly simple

text-to-speech speech pytorch tts speech-synthesis colab armenian russian speech-to-text ukrainian pretrained-models georgian belarus kyrgyz uzbek kazakh azerbaijani tajik tts-models torch-hub

UpdatedOct 31, 2025
Jupyter Notebook

modelscope /FunClip

Star5.1k

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

speech-recognition speech-to-text gradio video-clip subtitles-generator video-subtitles llm gradio-python-llm

UpdatedJul 11, 2025
Python

MahmoudAshraf97 /whisper-diarization

Star5.1k

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

UpdatedOct 14, 2025
Jupyter Notebook

abus-aikorea /voice-pro

Star5k

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

text-to-speech translator audiobook podcasts tts speech-synthesis subtitles speech-recognition webui speech-to-text karaoke transcription gradio whisper voice-conversion voice-cloning yt-dlp faster-whisper whisperx

UpdatedOct 5, 2025
Python

Improve this page

Add a description, image, and links to thespeech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly