wav2vec2
Here are 172 public repositories matching this topic...
Language:All
Sort:Most stars
Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.
- Updated
Feb 11, 2026 - Python
Self-Supervised Speech Pre-training and Representation Learning Toolkit
- Updated
Jun 13, 2025 - Python
How to use our public wav2vec2 dimensional emotion model
- Updated
May 22, 2023 - Jupyter Notebook
A live speech recognition using Facebooks wav2vec 2.0 model.
- Updated
Feb 4, 2024 - Python
Python API & command-line tool to easily transcribe speech-based video files into clean text
- Updated
Oct 29, 2024 - Jupyter Notebook
🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where we'll be open for contributions to enable real-time meeting transcription! 🚀
- Updated
Dec 24, 2025 - Python
⚡ Finetune Wa2vec 2.0 For Speech Recognition
- Updated
Feb 6, 2025 - Python
Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'
- Updated
Jan 6, 2025 - Jupyter Notebook
End-to-End Vietnamese Speech Recognition using wav2vec 2.0
- Updated
Sep 3, 2021
Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem
- Updated
May 30, 2025 - Python
GSoC'2021 | TensorFlow implementation of Wav2Vec2
- Updated
Jan 11, 2022 - Jupyter Notebook
Solution for Zalo AI Challenge 2022 - Lyrics Alignment
- Updated
Dec 5, 2022 - Python
A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics
- Updated
Jan 12, 2026 - Jupyter Notebook
Wav2vec 2.0 Self-Supervised Pretraining
- Updated
Feb 6, 2025 - Python
Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.
- Updated
May 19, 2023 - Python
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
- Updated
Dec 6, 2022 - Jupyter Notebook
Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in the ASRU 2021 conference.
- Updated
Sep 14, 2021 - Python
In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.
- Updated
Jan 11, 2025 - Jupyter Notebook
SHAS: Approaching optimal Segmentation for End-to-End Speech Translation
- Updated
Feb 9, 2023 - Python
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
- Updated
Dec 18, 2023 - Python
Improve this page
Add a description, image, and links to thewav2vec2 topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thewav2vec2 topic, visit your repo's landing page and select "manage topics."