Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

speech

Here are 1,948 public repositories matching this topic...

MockingBird

🚀Clone a voice in 5 seconds to generate arbitrary speech in real-time

  • UpdatedJan 7, 2026
  • Python
datasets

🤗 The largest hub of ready-to-use datasets for AI models with fast, easy-to-use and efficient data manipulation tools

  • UpdatedFeb 18, 2026
  • Python

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

  • UpdatedFeb 19, 2026
  • Python

Grounded SAM: Marrying Grounding DINO with Segment Anything & Stable Diffusion & Recognize Anything - Automatically Detect , Segment and Generate Anything

  • UpdatedSep 5, 2024
  • Jupyter Notebook

kaldi-asr/kaldi is the official location of the Kaldi project.

  • UpdatedSep 22, 2025
  • Shell

AudioGPT: Understanding and Generating Speech, Music, Sound, and Talking Head

  • UpdatedJul 6, 2024
  • Python
modelscope

ModelScope: bring the notion of Model-as-a-Service to life.

  • UpdatedFeb 19, 2026
  • Python

EmotiVoice 😊: a Multi-Voice and Prompt-Controlled TTS Engine

  • UpdatedAug 13, 2024
  • Python

Officially maintained, supported by PaddlePaddle, including CV, NLP, Speech, Rec, TS, big models and so on.

  • UpdatedJan 15, 2025
  • Python

💬 Speech recognition for your site

  • UpdatedAug 7, 2024
  • JavaScript

VoxCPM: Tokenizer-Free TTS for Context-Aware Speech Generation and True-to-Life Voice Cloning

  • UpdatedFeb 11, 2026
  • Python

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

  • UpdatedNov 26, 2025
  • Jupyter Notebook

Speech To Speech: an effort for an open-sourced and modular GPT4-o

  • UpdatedFeb 20, 2026
  • Python

A fast multimodal LLM for real-time voice

  • UpdatedDec 12, 2025
  • Python

Low-latency AI inference engine for mobile devices & wearables

  • UpdatedFeb 20, 2026
  • C

Improve this page

Add a description, image, and links to thespeech topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeech topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2026 Movatter.jp