Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

speech-to-text

Here are 4,049 public repositories matching this topic...

whisper.cpp

Port of OpenAI's Whisper model in C/C++

  • UpdatedNov 1, 2025
  • C++

DeepSpeech is an open source embedded (offline, on-device) speech-to-text engine which can run in real time on devices ranging from a Raspberry Pi 4 to high power GPU servers.

  • UpdatedJun 19, 2025
  • C++

Faster Whisper transcription with CTranslate2

  • UpdatedOct 31, 2025
  • Python

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

  • UpdatedOct 21, 2025
  • Python
leon

kaldi-asr/kaldi is the official location of the Kaldi project.

  • UpdatedSep 22, 2025
  • Shell

Translate the video from one language to another and add dubbing.

  • UpdatedNov 4, 2025
  • Python

Speech recognition module for Python, supporting several engines and APIs, online and offline.

  • UpdatedOct 28, 2025
  • Python

A robust, efficient, low-latency speech-to-text library with advanced voice activity detection, wake word activation and instant transcription.

  • UpdatedJul 11, 2025
  • Python

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

  • UpdatedNov 5, 2025
  • C++

A Deep-Learning-Based Chinese Speech Recognition System 基于深度学习的中文语音识别系统

  • UpdatedSep 6, 2025
  • Python

💬 Speech recognition for your site

  • UpdatedAug 7, 2024
  • JavaScript

A free, open source, and extensible speech-to-text application that works completely offline.

  • UpdatedNov 6, 2025
  • TypeScript

Open-source, accurate and easy-to-use video speech recognition & clipping tool, LLM based AI clipping intergrated.

  • UpdatedJul 11, 2025
  • Python

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

  • UpdatedOct 14, 2025
  • Jupyter Notebook
voice-pro

Gradio WebUI for creators and developers, featuring key TTS (Edge-TTS, kokoro) and zero-shot Voice Cloning (E2 & F5-TTS, CosyVoice), with Whisper audio processing, YouTube download, Demucs vocal isolation, and multilingual translation.

  • UpdatedOct 5, 2025
  • Python

Improve this page

Add a description, image, and links to thespeech-to-text topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeech-to-text topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp