Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

audio-generation

Here are 174 public repositories matching this topic...

🤖 The free, Open Source alternative to OpenAI, Claude and others. Self-hosted and local-first. Drop-in replacement, running on consumer-grade hardware. No GPU required. Runs gguf, transformers, diffusers and many more. Features: Generate Text, MCP, Audio, Video, Images, Voice Cloning, Distributed, P2P and decentralized inference

  • UpdatedFeb 20, 2026
  • Go

Multi-lingual large voice generation model, providing inference, training and deployment full-stack ability.

  • UpdatedFeb 11, 2026
  • Python
Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

  • UpdatedMay 27, 2025
  • Python

YuE: Open Full-song Music Generation Foundation Model, something similar to Suno.ai but open

  • UpdatedJun 4, 2025
  • Python
TTS-WebUI

A single Gradio + React WebUI with extensions for ACE-Step, Kimi Audio, Piper TTS, GPT-SoVITS, CosyVoice, XTTSv2, DIA, Kokoro, OpenVoice, ParlerTTS, Stable Audio, MMS, StyleTTS2, MAGNet, AudioGen, MusicGen, Tortoise, RVC, Vocos, Demucs, SeamlessM4T, and Bark!

  • UpdatedFeb 19, 2026
  • TypeScript

AudioLDM: Generate speech, sound effects, music and beyond, with text.

  • UpdatedJun 25, 2025
  • Python

A framework for efficient model inference with omni-modality models

  • UpdatedFeb 20, 2026
  • Python

Text-to-Audio/Music Generation

  • UpdatedSep 29, 2024
  • Python

Audio generation using diffusion models, in PyTorch.

  • UpdatedJun 12, 2023
  • Python

A timeline of the latest AI models for audio generation, starting in 2023!

  • UpdatedJan 4, 2024

A fundamental toolkit designed for music, song, and audio generation

  • UpdatedMay 20, 2025
  • Python
tango

A family of diffusion models for text-to-audio generation.

  • UpdatedJul 29, 2025
  • Python

Official PyTorch implementation of BigVGAN (ICLR 2023)

  • UpdatedSep 5, 2024
  • Python

Self-host the powerful Chatterbox TTS model. This server offers a user-friendly Web UI, flexible API endpoints (incl. OpenAI compatible), predefined voices, voice cloning, and large audiobook-scale text processing. Runs accelerated on NVIDIA (CUDA), AMD (ROCm), and CPU.

  • UpdatedFeb 12, 2026
  • Python

AI Audio Datasets (AI-ADS) 🎵, including Speech, Music, and Sound Effects, which can provide training data for Generative AI, AIGC, AI model training, intelligent audio tool development, and audio applications.

  • UpdatedJul 8, 2025

A ComfyUI custom node integration for multi-engine multi-language Text-to-Speech and Voice Conversion. Supports: RVC, Qwen3-TTS, Cozy Voice 3, Step Audio EditX, IndexTTS-2, Chatterbox (classic and multilingual 23-lang), F5-TTS, Higgs Audio 2 and VibeVoice with unlimited text length, SRT timing, Character support, and many audio tools

  • UpdatedFeb 20, 2026
  • Python

[CVPR'23] MM-Diffusion: Learning Multi-Modal Diffusion Models for Joint Audio and Video Generation

  • UpdatedJun 5, 2024
  • Python

FunCodec is a research-oriented toolkit for audio quantization and downstream applications, such as text-to-speech synthesis, music generation et.al.

  • UpdatedJan 25, 2024
  • Python

Audio Development Tools (ADT) is a project for advancing sound, speech, and music technologies, featuring components for machine learning, sound synthesis, speech and music generation, signal processing, game audio, digital audio workstations (DAWs), and more.

  • UpdatedJul 11, 2025

Daily tracking of awesome audio papers, including music generation, zero-shot tts, asr, audio generation

  • UpdatedNov 2, 2025

Improve this page

Add a description, image, and links to theaudio-generation topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theaudio-generation topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp