Movatterモバイル変換


[0]ホーム

URL:


Skip to content

Navigation Menu

Sign in
Appearance settings

Search code, repositories, users, issues, pull requests...

Provide feedback

We read every piece of feedback, and take your input very seriously.

Saved searches

Use saved searches to filter your results more quickly

Sign up
Appearance settings
#

asr

Here are 1,442 public repositories matching this topic...

WhisperX: Automatic Speech Recognition with Word-level Timestamps (& Diarization)

  • UpdatedOct 21, 2025
  • Python

A scalable generative AI framework built for researchers and developers working on Large Language Models, Multimodal, and Speech AI (Automatic Speech Recognition and Text-to-Speech)

  • UpdatedDec 17, 2025
  • Python

Easy-to-use Speech Toolkit including Self-Supervised Learning model, SOTA/Streaming ASR with punctuation, Streaming TTS with text frontend, Speaker Verification System, End-to-End Speech Translation and Keyword Spotting. Won NAACL2022 Best Demo Award.

  • UpdatedOct 20, 2025
  • Python

Speech-to-text, text-to-speech, speaker diarization, speech enhancement, source separation, and VAD using next-gen Kaldi with onnxruntime without Internet connection. Support embedded systems, Android, iOS, HarmonyOS, Raspberry Pi, RISC-V, RK NPU, Ascend NPU, x86_64 servers, websocket server/client, support 12 programming languages

  • UpdatedDec 17, 2025
  • C++
wukong-robot

🤖 wukong-robot 是一个简单、灵活、优雅的中文语音对话机器人/智能音箱项目,支持ChatGPT多轮对话能力,还可能是首个支持脑机交互的开源智能音箱项目。

  • UpdatedOct 25, 2024
  • Python

This is a python API which allows you to get the transcript/subtitles for a given YouTube video. It also works for automatically generated subtitles and it does not require an API key nor a headless browser, like other selenium based solutions do!

  • UpdatedOct 13, 2025
  • Python

html5 js 录音 mp3 wav ogg webm amr g711a g711u 格式,支持pc和Android、iOS部分浏览器、Hybrid App(提供Android iOS App源码)、微信,提供ASR语音识别转文字 H5版语音通话聊天示例 DTMF编码解码

  • UpdatedMar 31, 2025
  • JavaScript

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

  • UpdatedNov 26, 2025
  • Jupyter Notebook

Production First and Production Ready End-to-End Speech Recognition Toolkit

  • UpdatedDec 16, 2025
  • Python

Streamer-Sales 销冠 —— 卖货主播 LLM 大模型🛒🎁,一个能够根据给定的商品特点从激发用户购买意愿角度出发进行商品解说的卖货主播大模型。🚀⭐内含详细的数据生成流程❗ 📦另外还集成了 LMDeploy 加速推理🚀、RAG检索增强生成 📚、TTS文字转语音🔊、数字人生成 🦸、 Agent 使用网络查询实时信息🌐、ASR 语音转文字🎙️、Vue 生态搭建前端🍍、FastAPI 搭建后端🗝️、Docker-compose 打包部署🐋

  • UpdatedMar 8, 2025
  • Python

faster_whisper GUI with PySide6

  • UpdatedDec 8, 2024
  • Python

Whisper & Faster-Whisper standalone executables for those who don't want to bother with Python.

  • UpdatedNov 7, 2025

🐸STT - The deep learning toolkit for Speech-to-Text. Training and deploying STT models has never been so easy.

  • UpdatedMar 11, 2024
  • C++
pytorch-kaldi

pytorch-kaldi is a project for developing state-of-the-art DNN/RNN hybrid speech recognition systems. The DNN part is managed by pytorch, while feature extraction, label computation, and decoding are performed with the kaldi toolkit.

  • UpdatedMar 14, 2022
  • Python

Improve this page

Add a description, image, and links to theasr topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with theasr topic, visit your repo's landing page and select "manage topics."

Learn more


[8]ページ先頭

©2009-2025 Movatter.jp