speaker-recognition
Here are 326 public repositories matching this topic...
Language:All
Sort:Most stars
A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)
- Updated
Dec 17, 2025 - Python
A PyTorch-based Speech Toolkit
- Updated
Dec 15, 2025 - Python
Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding
- Updated
Dec 13, 2025 - Jupyter Notebook
This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.
- Updated
Sep 25, 2024 - Python
SincNet is a neural architecture for efficiently processing raw audio samples.
- Updated
Apr 28, 2021 - Python
This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods
- Updated
Dec 17, 2025 - Python
In defence of metric learning for speaker recognition
- Updated
Mar 26, 2024 - Python
Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.
- Updated
Dec 17, 2025 - Swift
Research and Production Oriented Speaker Verification, Recognition and Diarization Toolkit
- Updated
Dec 17, 2025 - Python
an open-source implementation of sequence-to-sequence based speech processing engine
- Updated
Dec 2, 2022 - C++
🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification
- Updated
Mar 3, 2020 - Python
Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)
- Updated
Apr 11, 2024 - Python
Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)
- Updated
Dec 13, 2023 - Python
speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition
- Updated
Jul 1, 2021 - Python
This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.
- Updated
Aug 12, 2025 - Python
Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.
- Updated
Apr 16, 2024 - Python
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
- Updated
Jun 18, 2025 - HTML
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
- Updated
Jan 22, 2024 - Tcl
Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work:https://arxiv.org/abs/2007.16196
- Updated
Nov 11, 2020 - Python
Improve this page
Add a description, image, and links to thespeaker-recognition topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeaker-recognition topic, visit your repo's landing page and select "manage topics."