speech-processing
Here are 728 public repositories matching this topic...
Language:All
Sort:Most stars
A PyTorch-based Speech Toolkit
- Updated
Dec 15, 2025 - Python
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
- Updated
Dec 13, 2025 - Jupyter Notebook
Silero VAD: pre-trained enterprise-grade Voice Activity Detector
- Updated
Dec 10, 2025 - Python
Reading list for research topics in multimodal machine learning
- Updated
Aug 20, 2024
Foundation Architecture for (M)LLMs
- Updated
Apr 11, 2024 - Python
Multilingual Automatic Speech Recognition with word-level timestamps and confidence
- Updated
Sep 9, 2025 - Python
WaveNet vocoder
- Updated
Jul 29, 2023 - Python
AI powered speech denoising and enhancement
- Updated
Dec 3, 2024 - Python
Controllable and fast Text-to-Speech for over 7000 languages!
- Updated
Jun 30, 2025 - Python
PyTorch implementation of convolutional neural networks-based text-to-speech synthesis models
- Updated
Dec 19, 2023 - Python
A curated list of awesome Speaker Diarization papers, libraries, datasets, and other resources.
- Updated
Jul 22, 2025
Voice Activity Detector (VAD) : low-latency, high-performance and lightweight
- Updated
Dec 15, 2025 - C
💎 A list of accessible speech corpora for ASR, TTS, and other Speech Technologies
- Updated
Jun 6, 2024
General Speech Restoration
- Updated
Feb 17, 2025 - Python
SincNet is a neural architecture for efficiently processing raw audio samples.
- Updated
Apr 28, 2021 - Python
StreamSpeech is an “All in One” seamless model for offline and simultaneous speech recognition, speech translation and speech synthesis.
- Updated
Jun 29, 2025 - Python
Open source audio annotation tool for humans
- Updated
Dec 14, 2025 - TypeScript
A Framework for Speech, Language, Audio, Music Processing with Large Language Model
- Updated
Oct 24, 2025 - Python
Verbatim Automatic Speech Recognition with improved word-level timestamps and filler detection
- Updated
Jun 3, 2025 - Python
Improve this page
Add a description, image, and links to thespeech-processing topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeech-processing topic, visit your repo's landing page and select "manage topics."