Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

speech-language-model

Here are 18 public repositories matching this topic...

LLaMA-Omni is a low-latency and high-quality end-to-end speech interaction model built upon Llama-3.1-8B-Instruct, aiming to achieve speech capabilities at the GPT-4o level.

  • UpdatedMay 19, 2025
  • Python

[ICLR 2025] SOTA discrete acoustic codec models with 40/75 tokens per second for audio language modeling

  • UpdatedMar 2, 2025
  • Python

[ICCV 2025] Official Implementation for "Lyra: An Efficient and Speech-Centric Framework for Omni-Cognition"

  • UpdatedJan 9, 2025
  • Python

AAAI 2025: Codec Does Matter: Exploring the Semantic Shortcoming of Codec for Audio Language Model

  • UpdatedApr 20, 2025
  • Python
slamkit

SlamKit is an open source tool kit for efficient training of SpeechLMs. It was used for "Slamming: Training a Speech Language Model on One GPU in a Day"

  • UpdatedMay 18, 2025
  • Python

Code for DeSTA2.5-Audio

  • UpdatedAug 7, 2025
  • Python

Code and model for ICASSP 2025 Paper "Developing Instruction-Following Speech Language Model Without Speech Instruction-Tuning Data"

  • UpdatedJul 15, 2025
  • HTML

A single-layer, streaming codec model providing SOTA audio quality and discrete tokens designed for superior downstream modelability.

  • UpdatedJun 4, 2025
  • Python

Streamable Text-to-Speech model using a language modeling approach, without vector quantization

  • UpdatedMay 20, 2025
  • Python

Ultra-low-bitrate Speech Codec for Speech Language Modeling Applications

  • UpdatedDec 20, 2024
  • Python

Survey of audio language models

  • UpdatedJun 21, 2025
  • Jupyter Notebook

The official code for the SALMon🍣 benchmark (ICASSP 2025 - Oral)

  • UpdatedAug 15, 2025
  • Python

Official repository of the IEEE SLT 2024 paper "Self-Supervised Syllable Discovery Based on Speaker-Disentangled HuBERT"

  • UpdatedSep 27, 2025
  • Python

Speech Resynthesis and Language Modeling

  • UpdatedJun 11, 2025
  • Python

a fully open-source implementation of a GPT-4o-like speech-to-speech video understanding model.

  • UpdatedApr 7, 2025
  • Python

[CVPR 2025] OmniMMI: A Comprehensive Multi-modal Interaction Benchmark in Streaming Video Contexts

  • UpdatedApr 7, 2025
  • Python

Improve this page

Add a description, image, and links to thespeech-language-model topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeech-language-model topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp