Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

speaker-recognition

Here are 326 public repositories matching this topic...

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

  • UpdatedDec 17, 2025
  • Python

Neural building blocks for speaker diarization: speech activity detection, speaker change detection, overlapped speech detection, speaker embedding

  • UpdatedDec 13, 2025
  • Jupyter Notebook
uis-rnn

This is the library for the Unbounded Interleaved-State Recurrent Neural Network (UIS-RNN) algorithm, corresponding to the paper Fully Supervised Speaker Diarization.

  • UpdatedSep 25, 2024
  • Python

This project uses a variety of advanced voiceprint recognition models such as EcapaTdnn, ResNetSE, ERes2Net, CAM++, etc. It is not excluded that more models will be supported in the future. At the same time, this project also supports MelSpectrogram, Spectrogram data preprocessing methods

  • UpdatedDec 17, 2025
  • Python

In defence of metric learning for speaker recognition

  • UpdatedMar 26, 2024
  • Python
FluidAudio

Frontier CoreML audio models in your apps — text-to-speech, speech-to-text, voice activity detection, and speaker diarization. In Swift, powered by SOTA open source.

  • UpdatedDec 17, 2025
  • Swift

an open-source implementation of sequence-to-sequence based speech processing engine

  • UpdatedDec 2, 2022
  • C++

🔈 Deep Learning & 3D Convolutional Neural Networks for Speaker Verification

  • UpdatedMar 3, 2020
  • Python

Unofficial reimplementation of ECAPA-TDNN for speaker recognition (EER=0.86 for Vox1_O when train only in Vox2)

  • UpdatedApr 11, 2024
  • Python

Angular penalty loss functions in Pytorch (ArcFace, SphereFace, Additive Margin, CosFace)

  • UpdatedDec 13, 2023
  • Python

speaker diarization by uis-rnn and speaker embedding by vgg-speaker-recognition

  • UpdatedJul 1, 2021
  • Python

This repository contains audio samples and supplementary materials accompanying publications by the "Speaker, Voice and Language" team at Google.

  • UpdatedAug 12, 2025
  • Python

Aims to create a comprehensive voice toolkit for training, testing, and deploying speaker verification systems.

  • UpdatedApr 16, 2024
  • Python

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

  • UpdatedJun 18, 2025
  • HTML

使用Tensorflow实现声纹识别

  • UpdatedJun 16, 2024
  • Python

Deep speaker embeddings in PyTorch, including x-vectors. Code used in this work:https://arxiv.org/abs/2007.16196

  • UpdatedNov 11, 2020
  • Python

Improve this page

Add a description, image, and links to thespeaker-recognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeaker-recognition topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp