hubert
Here are 33 public repositories matching this topic...
Language:All
Sort:Most stars
so-vits-svc fork with realtime support, improved interface and more features.
- Updated
Mar 16, 2025 - Python
Self-Supervised Speech Pre-training and Representation Learning Toolkit
- Updated
Mar 11, 2025 - Python
Phoneme segmentation using pre-trained speech models
- Updated
Nov 4, 2022 - Python
A mini, simple, and fast end-to-end automatic speech recognition toolkit.
- Updated
Dec 6, 2022 - Jupyter Notebook
[ICASSP 2023] Mingling or Misalignment? Temporal Shift for Speech Emotion Recognition with Pre-trained Representations
- Updated
Dec 18, 2023 - Python
An implementation of Speech Emotion Recognition, based on HuBERT model, training with PyTorch and HuggingFace framework, and fine-tuning on the RAVDESS dataset.
- Updated
May 18, 2022 - Jupyter Notebook
This repo contains the source code of the first deep learning-base singing voice beat tracking system. It leverages WavLM and DistilHuBERT pre-trained speech models to create vocal embeddings and trains linear multi-head self-attention layers on top of them to extract vocal beat activations. Then, it uses HMM decoder to infer signing beats and t…
- Updated
Sep 4, 2022 - Python
Cover Song Powered by SoftVC VITS
- Updated
May 9, 2023 - Jupyter Notebook
The official implementation of the method discussed in the paper Improving Spoken Language Identification with Map-Mix(work accepted at ICASSP-2023)
- Updated
Feb 17, 2023
Google collab for testing SoftVC VITS Singing Voice Conversion for AI capable of changing the singer within music files.
- Updated
Apr 21, 2023 - Jupyter Notebook
code for our paper DistilALHuBERT: A Distilled Parameter Sharing Audio Representation Model
- Updated
Mar 15, 2023 - Python
unsupervised spoken utterances scoring
- Updated
Nov 21, 2023 - Python
Universal Pooling Method for Speaker Verification Utilizing Pre-trained Multi-layer Features, 2025 preprint
- Updated
Sep 19, 2024 - Python
In this code, we have used common and well-known datasets such as the Toronto dataset available on Kaggle to create a sentiment analysis model from human voice. This model is designed based on the Bert model and is called Hubert.
- Updated
Jan 28, 2024 - Jupyter Notebook
A library to help your context being persisted in your react native apps
- Updated
Sep 17, 2023 - Java
Advanced Speech Emotion Recognition, based on ExHuBERT: Enhancing HuBERT Through Block Extension and Fine-Tuning on 37 Emotion Datasets and 14 languages (Emotions: Disgust, Neutral, Kind, Anger, Surprise, Joy)
- Updated
Aug 7, 2024 - Jupyter Notebook
Speech Keyword detection using Wav2Vec Model
- Updated
Nov 23, 2022 - Python
Improve this page
Add a description, image, and links to thehubert topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thehubert topic, visit your repo's landing page and select "manage topics."