speechrecognition
Here are 193 public repositories matching this topic...
Language:All
Sort:Most stars
A PyTorch-based Speech Toolkit
- Updated
Dec 15, 2025 - Python
Open source inference code for Rev's model
- Updated
Apr 22, 2025 - Python
The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.
- Updated
Jun 18, 2025 - HTML
A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.
- Updated
Jan 22, 2024 - Tcl
A Keras CTC implementation of Baidu's DeepSpeech for model experimentation
- Updated
Mar 17, 2018 - Python
SDK & Sample to do speech recognition using websockets in Javascript
- Updated
Mar 25, 2019 - TypeScript
It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...
- Updated
Jan 9, 2023 - Python
A pytorch based end2end speech recognition system.
- Updated
Jan 16, 2021 - Python
WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.
- Updated
Jun 6, 2021 - Python
Repository containing experimentation platform on how to train, infer on wav2vec2 models.
- Updated
Sep 22, 2022 - Python
🙊 Speech Recognition , Text To Speech , Google Translate
- Updated
Sep 10, 2023 - Java
Web Browser Audio Detection/Speech Recording Events API
- Updated
Jul 15, 2022 - JavaScript
A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file
- Updated
May 5, 2024 - Python
the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.
- Updated
Jul 30, 2021 - Shell
Open source projects related to Snipshttps://snips.ai/.
- Updated
Jan 12, 2023 - JavaScript
Making Espnet easier to use
- Updated
Apr 9, 2021 - Python
Pytorch based phoneme recognition (TIMIT phoneme classification)
- Updated
Apr 25, 2018 - Python
A library for using Web Speech API with Angular
- Updated
May 29, 2023 - TypeScript
PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE
- Updated
May 5, 2024 - Python
Improve this page
Add a description, image, and links to thespeechrecognition topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeechrecognition topic, visit your repo's landing page and select "manage topics."