Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

wav2vec2

Here are 172 public repositories matching this topic...

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

  • UpdatedFeb 11, 2026
  • Python
s3prl

How to use our public wav2vec2 dimensional emotion model

  • UpdatedMay 22, 2023
  • Jupyter Notebook

A live speech recognition using Facebooks wav2vec 2.0 model.

  • UpdatedFeb 4, 2024
  • Python
vid2cleantxt

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where we'll be open for contributions to enable real-time meeting transcription! 🚀

  • UpdatedDec 24, 2025
  • Python

⚡ Finetune Wa2vec 2.0 For Speech Recognition

  • UpdatedFeb 6, 2025
  • Python

Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'

  • UpdatedJan 6, 2025
  • Jupyter Notebook
ASR

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

  • UpdatedSep 3, 2021

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

  • UpdatedMay 30, 2025
  • Python

GSoC'2021 | TensorFlow implementation of Wav2Vec2

  • UpdatedJan 11, 2022
  • Jupyter Notebook

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

  • UpdatedDec 5, 2022
  • Python

A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics

  • UpdatedJan 12, 2026
  • Jupyter Notebook

Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.

  • UpdatedMay 19, 2023
  • Python
MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

  • UpdatedDec 6, 2022
  • Jupyter Notebook

Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in the ASRU 2021 conference.

  • UpdatedSep 14, 2021
  • Python

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.

  • UpdatedJan 11, 2025
  • Jupyter Notebook

SHAS: Approaching optimal Segmentation for End-to-End Speech Translation

  • UpdatedFeb 9, 2023
  • Python

[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations

  • UpdatedDec 18, 2023
  • Python

Improve this page

Add a description, image, and links to thewav2vec2 topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thewav2vec2 topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp