asr

Star

Here are 1,442 public repositories matching this topic...

Language:All

Filter by language

All1,442 Python652 Jupyter Notebook145 C++50 JavaScript47 Shell45 TypeScript39 Java36 HTML24 C#20 C18

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

m-bain /whisperX

Sponsor

Star19.2k

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

speech speech-recognition speech-to-text whisper asr

UpdatedOct 21, 2025
Python

NVIDIA-NeMo /NeMo

Star16.3k

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

machine-translation tts speech-synthesis neural-networks deeplearning speaker-recognition asr speech-translation speaker-diariazation generative-ai

UpdatedDec 17, 2025
Python

alphacep /vosk-api

Star13.9k

Offline speech recognition API for Android, iOS, Raspberry Pi and servers with Python, Java, C# and Node

android python raspberry-pi ios privacy deep-neural-networks deep-learning offline voice-recognition speech-recognition speech-to-text kaldi stt speaker-verification asr speech-to-text-android deepspeech speaker-identification google-speech-to-text vosk

UpdatedDec 8, 2025
Jupyter Notebook

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

tts speech-synthesis transformer voice-recognition speech-recognition whisper asr vocoder conformer sound-classification kws self-supervised-learning code-switch voice-cloning speech-translation punctuation-restoration wav2vec2 streaming-asr speech-alignment streaming-tts

UpdatedOct 20, 2025
Python

speechbrain /speechbrain

Star10.9k

A PyTorch-based Speech Toolkit

audio deep-learning transformers pytorch voice-recognition speech-recognition speech-to-text language-model speaker-recognition speaker-verification speech-processing audio-processing asr speaker-diarization speechrecognition speech-separation speech-enhancement spoken-language-understanding huggingface speech-toolkit

UpdatedDec 15, 2025
Python

k2-fsa /sherpa-onnx

Star9.3k

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

android windows macos linux lazarus raspberry-pi ios text-to-speech csharp cpp dotnet speech-to-text aarch64 mfc risc-v object-pascal asr arm32 onnx vits

UpdatedDec 17, 2025
C++

FunAudioLLM /SenseVoice

Star7.2k

Multilingual Voice Understanding Model

multilingual python ai pytorch speech-recognition speech-to-text asr cross-lingual speech-emotion-recognition audio-event-classification aigc llm gpt-4o

UpdatedAug 15, 2025
Python

wzpan /wukong-robot

Sponsor

Star7.1k

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目，支持ChatGPT多轮对话能力，还可能是首个支持脑机交互的开源智能音箱项目。

alexa ai amazon-echo muse tts openai google-home unit bci speaker homeassistant snowboy asr anyq raspeberry-pi gpt3 chatgpt

UpdatedOct 25, 2024
Python

jdepoix /youtube-transcript-api

Sponsor

Star6.6k

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!

python cli youtube youtube-video youtube-api captions subtitles transcript subtitle transcripts asr youtube-subtitles youtube-transcripts youtube-captions youtube-transcript translating-transcripts youtube-asr

UpdatedOct 13, 2025
Python

xiangyuecn /Recorder

Star5.5k

html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式，支持pc和Android、iOS部分浏览器、Hybrid App（提供Android iOS App源码）、微信，提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

audio javascript html html5 dtmf webrtc webm mp3 wav recording recorder amr ogg record h5 asr sound-record luyin g711a g711u

UpdatedMar 31, 2025
JavaScript

MahmoudAshraf97 /whisper-diarization

Star5.3k

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

speech speech-recognition speech-to-text whisper asr speaker-diarization

UpdatedNov 26, 2025
Jupyter Notebook

wenet-e2e /wenet

Star5k

Production First and Production Ready End-to-End Speech Recognition Toolkit

pytorch transformer speech-recognition automatic-speech-recognition production-ready whisper asr conformer e2e-models

UpdatedDec 16, 2025
Python

PeterH0323 /Streamer-Sales

Star3.6k

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁，一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋

chat chatbot text-generation tts gpt chat-application asr rag digital-human llm chatgpt internlm-chat-7b internlm2 meta-human

UpdatedMar 8, 2025
Python

ahmetoner /whisper-asr-webservice

Sponsor

Star3.1k

OpenAI Whisper ASR Webservice API

docker speech speech-recognition automatic-speech-recognition speech-to-text asr openai-whisper

UpdatedNov 23, 2025
Python

tensorflow /lingvo

Star2.9k

Lingvo

nlp research translation tensorflow machine-translation speech distributed tts speech-synthesis mnist speech-recognition lm seq2seq speech-to-text gpu-computing language-model asr

UpdatedDec 5, 2025
Python

CheshireCC /faster-whisper-GUI

Star2.8k

faster_whisper GUI with PySide6

openai vad whisper asr transcribe voice-transcription faster-whisper whisperx

UpdatedDec 8, 2024
Python

Purfview /whisper-standalone-win

Star2.7k

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

subtitles speech-recognition openai speech-to-text whisper asr speaker-diarization uvr transcriber diarization faster-whisper ctranslate2 whisperx whisper-faster vocal-extractor

UpdatedNov 7, 2025

linto-ai /whisper-timestamped

Star2.7k

Multilingual Automatic Speech Recognition with word-level timestamps and confidence

python machine-learning deep-learning speech transformers python3 pytorch speech-recognition speech-to-text attention-mechanism whisper speech-processing asr speaker-diarization attention-model attention-is-all-you-need attention-seq2seq attention-visualization attention-network multilingual-models

UpdatedSep 9, 2025
Python

coqui-ai /STT

Star2.5k

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

deep-learning tensorflow voice-recognition speech-recognition automatic-speech-recognition speech-to-text stt asr speech-recognizer speech-recognition-api

UpdatedMar 11, 2024
C++

mravanelli /pytorch-kaldi

Star2.4k

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

deep-neural-networks deep-learning speech dnn pytorch recurrent-neural-networks lstm gru speech-recognition rnn kaldi rnn-model asr lstm-neural-networks multilayer-perceptron-network timit dnn-hmm

UpdatedMar 14, 2022
Python

Improve this page

Add a description, image, and links to theasr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theasr topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

asr

Here are 1,442 public repositories matching this topic...

m-bain /whisperX

NVIDIA-NeMo /NeMo

alphacep /vosk-api

PaddlePaddle /PaddleSpeech

speechbrain /speechbrain

k2-fsa /sherpa-onnx

FunAudioLLM /SenseVoice

wzpan /wukong-robot

jdepoix /youtube-transcript-api

xiangyuecn /Recorder

MahmoudAshraf97 /whisper-diarization

wenet-e2e /wenet

PeterH0323 /Streamer-Sales

ahmetoner /whisper-asr-webservice

tensorflow /lingvo

CheshireCC /faster-whisper-GUI

Purfview /whisper-standalone-win

linto-ai /whisper-timestamped

coqui-ai /STT

mravanelli /pytorch-kaldi

Improve this page

Add this topic to your repo