Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

voice-activity-detection

Here are 190 public repositories matching this topic...

A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.

  • UpdatedOct 1, 2025
  • Python
NoiseTorch

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

  • UpdatedOct 6, 2025
  • Jupyter Notebook
autosub

Command-line utility to transcribe/translate from video/audio/subtitles to subtitles

  • UpdatedDec 21, 2023
  • Python

Voice activity detector (VAD) for the browser with a simple API

  • UpdatedSep 24, 2025
  • TypeScript

Real-time speech recognition and voice activity detection (VAD) using next-gen Kaldi with ncnn without Internet connection. Support iOS, Android, Linux, macOS, Windows, Raspberry Pi, VisionFive2, LicheePi4A etc.

  • UpdatedSep 17, 2025
  • C++
diart

A python package to build AI-powered real-time audio applications

  • UpdatedFeb 12, 2025
  • Python
Python-ai-assistant

The Open Source Alternative to Cluely - A lightning-fast, privacy-first AI assistant that works seamlessly during meetings, interviews, and conversations without anyone knowing. Built with Tauri for native performance, just 10MB. Completely undetectable in video calls, screen shares, and recordings.

  • UpdatedOct 5, 2025
  • TypeScript

Voice activity detection (VAD) toolkit including DNN, bDNN, LSTM and ACAM based VAD. We also provide our directly recorded dataset.

  • UpdatedJun 9, 2021
  • MATLAB

CNN-based audio segmentation toolkit. Allows to detect speech, music, noise and speaker gender. Has been designed for large scale gender equality studies based on speech time per gender.

  • UpdatedSep 19, 2025
  • Python

An audio/acoustic activity detection and audio segmentation tool

  • UpdatedDec 11, 2024
  • Python
FluidAudio

Native Swift and CoreML SDK for local speaker diarization, VAD, and speech-to-text for real-time workloads. Works on iOS and macOS.

  • UpdatedOct 7, 2025
  • Swift

Automatically synchronize and translate subtitles, or create new ones by transcribing, using pre-trained DNNs, Forced Alignments and Transformers.https://subaligner.readthedocs.io/

  • UpdatedAug 6, 2025
  • Python

An Optimized Speech-to-Text Pipeline for the Whisper Model Supporting Multiple Inference Engine

  • UpdatedAug 27, 2024
  • Jupyter Notebook

Improve this page

Add a description, image, and links to thevoice-activity-detection topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thevoice-activity-detection topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp