voice-activity-detection

Star

Here are 190 public repositories matching this topic...

Language:All

Filter by language

All190 Python88 Jupyter Notebook25 C++11 TypeScript9 C8 MATLAB7 JavaScript4 Rust4 C#3 Go3

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

modelscope /FunASR

Star12.9k

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

pytorch speech-recognition vad punctuation whisper audio-visual-speech-recognition speaker-diarization voice-activity-detection conformer pretrained-model rnnt dfsmn paraformer speechgpt speechllm

UpdatedOct 1, 2025
Python

noisetorch /NoiseTorch

Star9.9k

Real-time microphone noise suppression on Linux.

linux voice pulseaudio hacktoberfest noise-reduction voice-activity-detection voice-activated noise-suppression hacktoberfest2023

UpdatedJan 13, 2025
Go

pyannote /pyannote-audio

Star8.4k

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

pytorch pretrained-models speaker-recognition speaker-verification speech-processing speaker-diarization voice-activity-detection speech-activity-detection speaker-change-detection speaker-embedding overlapped-speech-detection

UpdatedOct 6, 2025
Jupyter Notebook

smacke /ffsubsync

Sponsor

Star7.4k

Automagically synchronize subtitles with video.

audio sync synchronization video ffmpeg captions subtitles caption alignment fast-fourier-transform subtitle vad vlc srt fft vlc-media-player srt-subtitles voice-activity-detection speech-detection string-alignment

UpdatedSep 1, 2025
Python

snakers4 /silero-vad

Star7k

Silero VAD: pre-trained enterprise-grade Voice Activity Detector

voice-commands speech pytorch voice-recognition vad voice-control speech-processing voice-detection voice-activity-detection onnx onnxruntime onnx-runtime

UpdatedAug 26, 2025
Python

jim-schwoebel /voice_datasets

Star2k

🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).

data voice voice-commands dataset voice-recognition noise voice-chat datasets voice-control voice-conversion voice-assistant voice-activity-detection voice-synthesis audio-datasets voice-computing voice-dataset voice-datasets audio-dataset

UpdatedJun 6, 2024

BingLingGroup /autosub

Star2k

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

subtitles substation-alpha audio-segmentation xfyun cloud-speech-api voice-activity-detection baidu-api xunfei-api

UpdatedDec 21, 2023
Python

ricky0123 /vad

Sponsor

Star1.6k

Voice activity detector (VAD) for the browser with a simple API

typescript web speech-to-text web-audio-api voice-activity-detection onnxruntime silero-vad

UpdatedSep 24, 2025
TypeScript

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

kotlin python c go csharp cpp speech-recognition vad asr voice-activity-detection

UpdatedSep 17, 2025
C++

TEN-framework /ten-vad

Star1.5k

Voice Activity Detector (VAD) : low-latency, high-performance and lightweight

audio real-time voice-commands speech voice-recognition vad automatic-speech-recognition speech-processing conversational-ai voice-activity-detection voice-agent silero-vad

UpdatedSep 15, 2025
C

juanmc2005 /diart

Star1.5k

A python package to build AI-powered real-time audio applications

real-time deep-learning transcription speaker-diarization streaming-audio voice-activity-detection speaker-embedding

UpdatedFeb 12, 2025
Python

coqui-ai /open-speech-corpora

Star1.4k

💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies

text-to-speech tts speech-synthesis voice-recognition speech-recognition speech-to-text stt speech-processing voice-activity-detection speech-separation speech-emotion-recognition voice-cloning

UpdatedJun 6, 2024

ggeop /Python-ai-assistant

Star991

Python AI assistant 🧠

python nlp ai mongodb sklearn pymongo voice-commands voice-recognition nltk voice-chat voice-control python35 nlp-machine-learning wolfram-language voice-assistant google-speech-recognition voice-activity-detection voice-recognition-experiment google-speech-to-text linux-assistant

UpdatedNov 17, 2024
Python

iamsrikanthnani /pluely

Sponsor

Star968

The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly during meetings, interviews, and conversations without anyone knowing. Built with Tauri for native performance, just 10MB. Completely undetectable in video calls, screen shares, and recordings.

react desktop-app rust typescript gemini openai speech-to-text stealth grok claude voice-activity-detection undetectable tauri tailwindcss ai-assistant llm shadcn cluely-alternative grok-4

UpdatedOct 5, 2025
TypeScript

jtkim-kaist /VAD

Star865

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

data speech dnn lstm speech-recognition attention vad voice-detection voice-activity-detection bdnn acam speech-activity-detection

UpdatedJun 9, 2021
MATLAB

ina-foss /inaSpeechSegmenter

Star838

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

music speech audio-analysis noise gender-equality segmentation gender praat gender-classification male female transgender voice-activity-detection music-detection mirex speech-activity-detection speech-segmentation speech-music speaker-gender speech-detection

UpdatedSep 19, 2025
Python

amsehili /auditok

Star817

An audio/acoustic activity detection and audio segmentation tool

vad audio-data audio-activities audio-segmentation voice-detection voice-activity-detection

UpdatedDec 11, 2024
Python

FluidInference /FluidAudio

Star730

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

audio macos swift ios real-time avfoundation nvidia vad automatic-speech-recognition speech-to-text ane speaker-recognition asr speaker-diarization voice-activity-detection coreml speaker-identification speaker-embedding parakeet

UpdatedOct 7, 2025
Swift

baxtree /subaligner

Star485

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers.https://subaligner.readthedocs.io/

scc captions subtitles alignment webvtt substation-alpha subrip tmp transcription sbv mpl2 sami ttml voice-activity-detection subtitle-conversion microdvd subtitle-translation advanced-substation-alpha subtitle-synchronization ebu-stl

UpdatedAug 6, 2025
Python

shashikg /WhisperS2T

Star471

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

deep-learning speech-recognition vad speech-to-text whisper asr tensorrt voice-activity-detection tensorrt-llm

UpdatedAug 27, 2024
Jupyter Notebook

Improve this page

Add a description, image, and links to thevoice-activity-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thevoice-activity-detection topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

voice-activity-detection

Here are 190 public repositories matching this topic...

modelscope /FunASR

noisetorch /NoiseTorch

pyannote /pyannote-audio

smacke /ffsubsync

snakers4 /silero-vad

jim-schwoebel /voice_datasets

BingLingGroup /autosub

ricky0123 /vad

k2-fsa /sherpa-ncnn

TEN-framework /ten-vad

juanmc2005 /diart

coqui-ai /open-speech-corpora

ggeop /Python-ai-assistant

iamsrikanthnani /pluely

jtkim-kaist /VAD

ina-foss /inaSpeechSegmenter

amsehili /auditok

FluidInference /FluidAudio

baxtree /subaligner

shashikg /WhisperS2T

Improve this page

Add this topic to your repo