speechrecognition

The SpeechBrain project aims to build a novel speech toolkit fully based on PyTorch. With SpeechBrain users can easily create speech processing systems, ranging from speech recognition (both HMM/DNN and end-to-end), speaker recognition, speech enhancement, speech separation, multi-microphone speech processing, and many others.

deep-learning neural-network speech speech-recognition neural-networks deeplearning speech-to-text speaker-recognition speaker-verification speech-processing speech-recognizer beamforming speech-analysis timit speechrecognition speech-api speech-separation librispeech speech-emotion-recognition speaker-identification

UpdatedJun 18, 2025
HTML

SamirPaulb /real-time-voice-translator

Star363

A desktop application that uses AI to translate voice between languages in real time, while preserving the speaker's tone and emotion.

python machine-learning text-to-speech gui translation ml tkinter speech-to-text speaker-recognition final-year-project gtts speechrecognition playsound translates-audio voice-translator speech-to-speech deep-translator real-time-transcription googletranslator linguasync

UpdatedJan 22, 2024
Tcl

robmsmt /KerasDeepSpeech

Star243

A Keras CTC implementation of Baidu's DeepSpeech for model experimentation

machine-learning deep-learning neural-network keras nn speech neural-networks baidu deeplearning speech-to-text asr ctc speechrecognition coreml deepspeech

UpdatedMar 17, 2018
Python

Azure-Samples /SpeechToText-WebSockets-Javascript

Star220

SDK & Sample to do speech recognition using websockets in Javascript

javascript microsoft typescript browser sdk recognition js websocket websockets speech ts speech-recognition cognitive-services speechtotext speechrecognition microsoft-speech-service

UpdatedMar 25, 2019
TypeScript

roshan9419 /PersonalAssistantChatbot

Star133

It is a personal assistant chatbot, capable to perform many tasks same as Google Assistant plus more extra features...

opencv chatbot tkinter speechrecognition pyttsx3 assistatant

UpdatedJan 9, 2023
Python

by2101 /OpenASR

Star114

A pytorch based end2end speech recognition system.

speech transformer speech-recognition las speech-to-text asr speech-recognizer speechrecognition end2end

UpdatedJan 16, 2021
Python

shangeth /wavencoder

Star92

WavEncoder is a Python library for encoding audio signals, transforms for audio augmentation, and training audio classification models with PyTorch backend.

pytorch voice-recognition speech-recognition semi-supervised-learning deeplearning representation-learning unsupervised-learning speaker-recognition hacktoberfest speech-processing audio-processing speechrecognition

UpdatedJun 6, 2021
Python

Open-Speech-EkStep /vakyansh-wav2vec2-experimentation

Star87

Repository containing experimentation platform on how to train, infer on wav2vec2 models.

open-source speech pytorch speech-recognition asr indic-scripts indic-languages speechrecognition speechrecognition-python speech-recognition-model

UpdatedSep 22, 2022
Python

goxr3plus /java-google-speech-api

Star80

🙊 Speech Recognition , Text To Speech , Google Translate

text-to-speech google-translate speechrecognition

UpdatedSep 10, 2023
Java

solyarisoftware /WeBAD

Star78

Web Browser Audio Detection/Speech Recording Events API

audio javascript browser dom webrtc voice speech microphone voice-recognition recording volume push-to-talk volume-control audio-processing speechrecognition voice-interface recording-button audi-capture

UpdatedJul 15, 2022
JavaScript

botbahlul /autosrt

Star63

A python script COMMAND LINE utility to AUTO GENERATE SUBTITLE FILE (using free Google Speech Recognition API) and TRANSLATED SUBTITLE FILE (using unofficial online Google Translate API) for any video or audio file

python ffmpeg captions voice-recognition speech-recognition subtitle speechrecognition voicerecognition google-translate-api subriptext auto-caption auto-subtitle srt-subtitle

UpdatedMay 5, 2024
Python

IS2AI /ISSAI_SAIDA_Kazakh_ASR

Star56

the first industrial-scale open-source Kazakh speech corpus. KSC2 corpus subsumes the previously introduced two corpora: KSC and KazakhTTS2 and supplements additional data from other sources. KSC2 contains around 1.2k hours of high-quality transcribed data comprising over 600k utterances.

speech-synthesis speech-recognition speech-to-text speechrecognition

UpdatedJul 30, 2021
Shell

syntithenai /opensnips

Star55

Open source projects related to Snipshttps://snips.ai/.

docker nlu dialog speech kaldi audio-server rasa hotwords snowboy snips asr speechrecognition porcupine hark snips-skills

UpdatedJan 12, 2023
JavaScript

jindongwang /EasyEspnet

Star55

Making Espnet easier to use

toolkit speech speech-recognition easy-to-use asr speechrecognition espnet

UpdatedApr 9, 2021
Python

rollingstarky /Python-Voice-Assistant

Star43

A Python based Voice Assistant like Siri

python ai chatbot tts stt speechrecognition

UpdatedOct 1, 2020
Python

AppleHolic /PytorchSR

Star35

Pytorch based phoneme recognition (TIMIT phoneme classification)

paper pytorch timit speechrecognition minimalgru cbhg

UpdatedApr 25, 2018
Python

ng-web-apis /speech

Star33

A library for using Web Speech API with Angular

text-to-speech angular speech speech-synthesis speech-recognition speech-to-text speechrecognition speech-api

UpdatedMay 29, 2023
TypeScript

botbahlul /pyvosklivesubtitle

Star29

PySimpleGUI based DESKTOP APP that can RECOGNIZE any live streaming in 23 languages that supported by VOSK then TRANSLATE (using unofficial online Google Translate API) and display it as LIVE CAPTION / LIVE SUBTITLE

python ffmpeg voice-recognition caption speech-recognition subtitle speechrecognition voicerecognition google-translate-api pysimplegui live-caption vosk auto-caption live-subtitle

UpdatedMay 5, 2024
Python

Improve this page

Add a description, image, and links to thespeechrecognition topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with thespeechrecognition topic, visit your repo's landing page and select "manage topics."

Learn more

Movatterモバイル変換

Navigation Menu

Search code, repositories, users, issues, pull requests...

Provide feedback

Saved searches

Use saved searches to filter your results more quickly