voice-activity-detection
Here are 190 public repositories matching this topic...
Language:All
Sort:Most stars
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
- Updated
Oct 1, 2025 - Python
Real-time microphone noise suppression on Linux.
- Updated
Jan 13, 2025 - Go
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
- Updated
Oct 6, 2025 - Jupyter Notebook
Automagically synchronize subtitles with video.
- Updated
Sep 1, 2025 - Python
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
- Updated
Aug 26, 2025 - Python
🔊 A comprehensive list of open-source datasets for voice and sound computing (95+ datasets).
- Updated
Jun 6, 2024
Command-line utility to transcribe/translate from video/audio/subtitles to subtitles
- Updated
Dec 21, 2023 - Python
Voice activity detector (VAD) for the browser with a simple API
- Updated
Sep 24, 2025 - TypeScript
Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.
- Updated
Sep 17, 2025 - C++
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
- Updated
Sep 15, 2025 - C
A python package to build AI-powered real-time audio applications
- Updated
Feb 12, 2025 - Python
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
- Updated
Jun 6, 2024
Python AI assistant 🧠
- Updated
Nov 17, 2024 - Python
The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly during meetings, interviews, and conversations without anyone knowing. Built with Tauri for native performance, just 10MB. Completely undetectable in video calls, screen shares, and recordings.
- Updated
Oct 5, 2025 - TypeScript
Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.
- Updated
Jun 9, 2021 - MATLAB
CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.
- Updated
Sep 19, 2025 - Python
An audio/acoustic activity detection and audio segmentation tool
- Updated
Dec 11, 2024 - Python
Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.
- Updated
Oct 7, 2025 - Swift
Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers.https://subaligner.readthedocs.io/
- Updated
Aug 6, 2025 - Python
An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine
- Updated
Aug 27, 2024 - Jupyter Notebook
Improve this page
Add a description, image, and links to thevoice-activity-detection topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thevoice-activity-detection topic, visit your repo's landing page and select "manage topics."