wav2vec2

Star

Here are 172 public repositories matching this topic...

Language:All

Filter by language

All172 Python98 Jupyter Notebook63 JavaScript3 Shell2 Dart1 Java1 PHP1

Sort:Most stars

Sort options

Most stars Fewest stars Most forks Fewest forks Recently updated Least recently updated

PaddlePaddle /PaddleSpeech

Star12.5k

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

tts speech-synthesis transformer voice-recognition speech-recognition whisper asr vocoder conformer sound-classification kws self-supervised-learning code-switch voice-cloning speech-translation punctuation-restoration wav2vec2 streaming-asr speech-alignment streaming-tts

UpdatedFeb 11, 2026
Python

s3prl /s3prl

Star2.5k

Self-Supervised Speech Pre-training and Representation Learning Toolkit

representation-learning tera cpc apc pase mockingjay self-supervised-learning speech-representation wav2vec speech-pretraining hubert vq-apc vq-wav2vec wav2vec2 decoar distilhubert wavlm unispeech-sat decoar2 data2vec

UpdatedJun 13, 2025
Python

audeering /w2v2-how-to

Star539

How to use our public wav2vec2 dimensional emotion model

deep-learning valence arousal onnx speech-emotion-recognition dominance transformer-models wav2vec2 msp-podcast

UpdatedMay 22, 2023
Jupyter Notebook

oliverguhr /wav2vec2-live

Star376

A live speech recognition using Facebooks wav2vec 2.0 model.

pyaudio speech speech-recognition speech-to-text asr wav2vec wav2vec2

UpdatedFeb 4, 2024
Python

pszemraj /vid2cleantxt

Star218

Python API & command-line tool to easily transcribe speech-based video files into clean text

audio python nlp video speech video-summarization transformer video-processing speech-recognition sentence keyword speech-to-text transcription spelling-correction keyword-extraction whisper audio-processing sentence-boundary-detection video-summarisation wav2vec2

UpdatedOct 29, 2024
Jupyter Notebook

inboxpraveen /LLM-Minutes-of-Meeting

Sponsor

Star162

🎤📄 An innovative tool that transforms audio or video files into text transcripts and generates concise meeting minutes. Stay organized and efficient in your meetings, and get ready for Phase 2 where we'll be open for contributions to enable real-time meeting transcription! 🚀

python nlp natural-language-processing web translation transformers web-application speech-recognition speech-to-text whisper meeting-minutes webapplication minutes-of-meeting huggingface huggingface-transformers wav2vec2 llm whisper-ai llm-inference

UpdatedDec 24, 2025
Python

khanld /ASR-Wav2vec-Finetune

Star145

⚡ Finetune Wa2vec 2.0 For Speech Recognition

pytorch speech-recognition speech-to-text asr huggingface vietnamese-speech-recognition wav2vec2 finetune-wav2vec

UpdatedFeb 6, 2025
Python

habla-liaa /ser-with-w2v2

Star140

Official implementation of INTERSPEECH 2021 paper 'Emotion Recognition from Speech Using Wav2vec 2.0 Embeddings'

deep-learning tensorflow speech speech-emotion-recognition wav2vec2

UpdatedJan 6, 2025
Jupyter Notebook

vietai /ASR

Star105

End-to-End Vietnamese Speech Recognition using wav2vec 2.0

asr pretrained-weights ctc-loss asr-model end-to-end-speech-recognition wav2vec2

UpdatedSep 3, 2021

tuanio /noisy-student-training-asr

Star98

Pytorch implementation of Noisy Student Training for Automatic Speech Recognition and Automatic Pronunciation Error Detection problem

machine-learning deep-learning pytorch speech-recognition semi-supervised-learning data-augmentation nst conformer pretrained noisy-student wav2vec2 aped

UpdatedMay 30, 2025
Python

thevasudevgupta /gsoc-wav2vec2

Star91

GSoC'2021 | TensorFlow implementation of Wav2Vec2

tensorflow gsoc speech-to-text librispeech-dataset wav2vec2

UpdatedJan 11, 2022
Jupyter Notebook

Telegram-Zalo /zac2022-lyric-alignment

Star68

Solution for Zalo AI Challenge 2022 - Lyrics Alignment

deep-learning vietnamese pytorch dynamic-programming forced-alignment wav2vec2 music-alignment

UpdatedDec 5, 2022
Python

mikezzb /lyrics-sync

Star64

A deep learning lyrics-to-audio alignment system, generating synchronized lyrics from a song and its lyrics

python music machine-learning ai deep-learning lyrics jupyter-notebook music-information-retrieval demucs wav2vec2

UpdatedJan 12, 2026
Jupyter Notebook

khanld /Wav2vec2-Pretraining

Star58

Wav2vec 2.0 Self-Supervised Pretraining

speech-recognition speech-to-text quantization speech-processing asr self-supervised pretraining contrastive-learning wav2vec2

UpdatedFeb 6, 2025
Python

HarunoriKawano /Wav2vec2.0

Star55

Implementation of the paper "wav2vec 2.0: A Framework for Self-Supervised Learning of Speech Representations" in Pytorch.

pytorch speech-recognition wav2vec2

UpdatedMay 19, 2023
Python

vectominist /MiniASR

Star53

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

minimal pytorch speech-recognition asr ctc fairseq speech-representation hubert wav2vec2 s3prl

UpdatedDec 6, 2022
Jupyter Notebook

mmakiuchi /multimodal_emotion_recognition

Star52

Scripts used in the research described in the paper "Multimodal Emotion Recognition with High-level Speech and Text Features" accepted in the ASRU 2021 conference.

emotion-recognition speech-emotion-recognition text-emotion-detection disentanglement-learning wav2vec2 asru2021

UpdatedSep 14, 2021
Python

pooya-mohammadi /audio-classification-pytorch

Star43

In this project, several approaches for training/finetuning an audio gender recognition is provided. The code can simply be used for any other audio classification task by simply changing the number of classes and the input dataset.

python deep-learning transformers pytorch lstm audio-classification wav2vec2 deep-utils