Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

text-to-audio

Here are 68 public repositories matching this topic...

Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

  • UpdatedMay 27, 2025
  • Python

[CVPR 2025] MMAudio: Taming Multimodal Joint Training for High-Quality Video-to-Audio Synthesis

  • UpdatedSep 24, 2025
  • Python

HunyuanVideo-Foley: Multimodal Diffusion with Representation Alignment for High-Fidelity Foley Audio Generation.

  • UpdatedSep 28, 2025
  • Python
tango

A family of diffusion models for text-to-audio generation.

  • UpdatedJul 29, 2025
  • Python

[NeurIPS 2025] PyTorch implementation of [ThinkSound], a unified framework for generating audio from any modality, guided by Chain-of-Thought (CoT) reasoning.

  • UpdatedSep 19, 2025
  • Python

TangoFlux: Super Fast and Faithful Text to Audio Generation with Flow Matching

  • UpdatedJul 29, 2025
  • Jupyter Notebook

PyTorch Implementation of Make-An-Audio (ICML'23) with a Text-to-Audio Generative Model

  • UpdatedMay 22, 2024
  • Python

Implementation of NÜWA, state of the art attention network for text to video synthesis, in Pytorch

  • UpdatedJan 17, 2023
  • Python
mustango

Mustango: Toward Controllable Text-to-Music Generation

  • UpdatedJun 2, 2025
  • Python

High-quality Text-to-Audio Generation with Efficient Diffusion Transformer

  • UpdatedOct 8, 2025
  • Python

AudioStory: Generating Long-Form Narrative Audio with Large Language Models

  • UpdatedSep 21, 2025
  • Jupyter Notebook

Official codes and models of the paper "Auffusion: Leveraging the Power of Diffusion and Large Language Models for Text-to-Audio Generation"

  • UpdatedMar 25, 2024
  • Jupyter Notebook

Word2Wave: a framework for generating short audio samples from a text prompt using WaveGAN and COALA.

  • UpdatedDec 13, 2021
  • Python

Subtitle to audio, generate audio from any subtitle file using Coqui-ai TTS and synchronize the audio timing according to subtitle time.

  • UpdatedDec 14, 2023
  • Python

Pytorch implementation of SoundCTM

  • UpdatedMar 31, 2025
  • Python

Improve this page

Add a description, image, and links to thetext-to-audio topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thetext-to-audio topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp