speaker-embedding
Here are 41 public repositories matching this topic...
Sort:Most stars
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
- Updated
Feb 20, 2026 - Jupyter Notebook
A python package to build AI-powered real-time audio applications
- Updated
Feb 12, 2025 - Python
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
- Updated
Feb 20, 2026 - Swift
Speaker embedding (d-vector) trained with GE2E loss
- Updated
Jan 8, 2024 - Python
Keras implementation of ‘’Deep Speaker: an End-to-End Neural Speaker Embedding System‘’ (speaker recognition)
- Updated
Apr 27, 2020 - Python
A pipeline to read lips and generate speech for the read content, i.e Lip to Speech Synthesis.
- Updated
Jul 23, 2025 - Python
PyTorch implementation of Densely Connected Time Delay Neural Network
- Updated
May 4, 2023 - Python
Fine-tuning toolkit for Chatterbox TTS & Chatterbox TURBO models. Supports 23 languages with smart vocabulary extension. Features offline preprocessing, automatic VAD trimming, and voice cloning capabilities. Train custom TTS models with your own dataset in LJSpeech and file-based format.
- Updated
Feb 20, 2026 - Python
Companion repository for the paper "A Comparison of Metric Learning Loss Functions for End-to-End Speaker Verification" published at SLSP 2020
- Updated
Oct 7, 2020 - Jupyter Notebook
A curated list of speaker-embedding speaker-verification, speaker-identification resources.
- Updated
Aug 12, 2021
Luigi pipeline to download VoxCeleb(2) audio from YouTube and extract speaker segments
- Updated
Mar 29, 2021 - Python
Voxceleb1 i-vector based speaker recognition system
- Updated
May 22, 2018 - Perl
On-device speaker recognition engine powered by deep learning
- Updated
Feb 13, 2026 - Python
Trained speaker embedding deep learning models and evaluation pipelines in pytorch and tesorflow for speaker recognition.
- Updated
Oct 4, 2019 - Jupyter Notebook
PyTorch implementation of the 1D-Triplet-CNN neural network model described in Fusing MFCC and LPC Features using 1D Triplet CNN for Speaker Recognition in Severely Degraded Audio Signals by A. Chowdhury, and A. Ross.
- Updated
Jan 13, 2020 - Python
Speaker embedding for VI-SVC and VI-SVS, alse for VITS; Use this to replace the ID to implement voice clone.
- Updated
Sep 16, 2022 - Python
Awesome Speech Dataset, including download links and a brief explanation for each resource. These datasets provide diverse and high-quality speech data covering various domains such as conversational, academic, political, and more.
- Updated
Jul 4, 2025
DropClass and DropAdapt - repository for the paper accepted to Speaker Odyssey 2020
- Updated
Oct 29, 2020 - Python
[ICASSP'24] Emphasized Non-Target Speaker Knowledge in Knowledge Distillation for Speaker Verification
- Updated
Mar 20, 2024 - Python
A curated list of awesome speaker recognition/verification papers, projects, datasets, and competition.
- Updated
Aug 29, 2021
Improve this page
Add a description, image, and links to thespeaker-embedding topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeaker-embedding topic, visit your repo's landing page and select "manage topics."