Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

speech-representation

Here are 14 public repositories matching this topic...

s3prl

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

  • UpdatedMar 2, 2025
  • Python

[ACL 2024] Official PyTorch code for extracting features and training downstream models with emotion2vec: Self-Supervised Pre-Training for Speech Emotion Representation

  • UpdatedDec 23, 2024
  • Python

A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.

  • UpdatedJun 4, 2025
  • Python

This is the code for paper: XY-Tokenizer: Mitigating the Semantic-Acoustic Conflict in Low-Bitrate Speech Codecs. Demos, technical insights and experimental results are presented on

  • UpdatedSep 19, 2025
  • Python

LightHuBERT: Lightweight and Configurable Speech Representation Learning with Once-for-All Hidden-Unit BERT

  • UpdatedSep 26, 2022
  • Python

Ultra-low bitrate speech codec (0.27-1 kbps) with cross-modal alignment and real-time capabilities

  • UpdatedAug 27, 2025
  • Python

Survey of audio language models

  • UpdatedJun 21, 2025
  • Jupyter Notebook
MiniASR

A mini, simple, and fast end-to-end automatic speech recognition toolkit.

  • UpdatedDec 6, 2022
  • Jupyter Notebook

DUSTED: Spoken-Term Discovery using Discrete Speech Units

  • UpdatedOct 2, 2024
  • Jupyter Notebook

Causal Speech Enhancement Based on a Two-Branch Nested U-Net Architecture Using Self-Supervised Speech Embeddings

  • UpdatedJun 6, 2025
  • Python

Semi-supervised spoken language understanding (SLU) via self-supervised speech and language model pretraining

  • UpdatedMar 23, 2021
  • Python

Improve this page

Add a description, image, and links to thespeech-representation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeech-representation topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp