Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

speech-synthesis

Here are 1,426 public repositories matching this topic...

leon

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

  • UpdatedOct 7, 2025
  • Python

State-of-the-Art Deep Learning scripts organized by models - easy to train and deploy with reproducible accuracy and performance on enterprise-grade infrastructure.

  • UpdatedAug 12, 2024
  • Jupyter Notebook

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

  • UpdatedSep 28, 2025
  • Python

A fast, local neural text to speech system

  • UpdatedAug 26, 2025
  • C++
Amphion

Amphion (/æmˈfaɪən/) is a toolkit for Audio, Music, and Speech Generation. Its purpose is to support reproducible research and help junior researchers and engineers get started in the field of audio, music, and speech generation research and development.

  • UpdatedMay 27, 2025
  • Python

Use Microsoft Edge's online text-to-speech service from Python WITHOUT needing Microsoft Edge or Windows or an API key

  • UpdatedAug 28, 2025
  • Python

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

  • UpdatedAug 13, 2024
  • Python

VITS: Conditional Variational Autoencoder with Adversarial Learning for End-to-End Text-to-Speech

  • UpdatedDec 6, 2023
  • Python

StyleTTS 2: Towards Human-Level Text-to-Speech through Style Diffusion and Adversarial Training with Large Speech Language Models

  • UpdatedAug 10, 2024
  • Python

eSpeak NG is an open source speech synthesizer that supports more than hundred languages and accents.

  • UpdatedSep 15, 2025
  • C

Silero Models: pre-trained speech-to-text, text-to-speech and text-enhancement models made embarrassingly simple

  • UpdatedOct 18, 2023
  • Jupyter Notebook
voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

  • UpdatedOct 5, 2025
  • Python

DiffSinger: Singing Voice Synthesis via Shallow Diffusion Mechanism (SVS & TTS); AAAI 2022; Official code

  • UpdatedMar 19, 2025
  • Python

An Open Source text-to-speech system built by inverting Whisper.

  • UpdatedJun 8, 2025
  • Jupyter Notebook

Speech To Speech: an effort for an open-sourced and modular GPT4-o

  • UpdatedApr 15, 2025
  • Python

Foundational model for human-like, expressive TTS

  • UpdatedJul 30, 2024
  • Python

Improve this page

Add a description, image, and links to thespeech-synthesis topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeech-synthesis topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp