rnnt
Here are 10 public repositories matching this topic...
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
- Updated
Oct 1, 2025 - Python
PyTorch implementation of "Transformer Transducer: A Streamable Speech Recognition Model with Transformer Encoders and RNN-T Loss" (ICASSP 2020)
- Updated
Feb 27, 2022 - Python
A curated list of awesome papers on contextualizing E2E ASR outputs
- Updated
May 10, 2023
An efficient implementation of RNN-T Prefix Beam Search in C++/CUDA.
- Updated
Dec 8, 2020 - Cuda
An implementation of RNN-Transducer loss in TF-2.0.
- Updated
Mar 25, 2023 - Python
I'm building an end-to-end Vietnamese Speech Recognition System. I'll deploy it into production with the help of Flask, Uwsgi, Nginx, and AWS ...
- Updated
Sep 9, 2022 - Python
FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件
- Updated
Sep 12, 2025
Pure PyTorch implementation of the loss described in "Online Segment to Segment Neural Transduction"https://arxiv.org/abs/1609.08194
- Updated
Mar 12, 2022 - Python
Deep learning-based subtitle generation model that processes audio datasets to generate accurate text transcriptions. Includes audio feature extraction, encoder-decoder architecture, training pipelines, and evaluation metrics for subtitle alignment.
- Updated
Aug 26, 2025
Improve this page
Add a description, image, and links to thernnt topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thernnt topic, visit your repo's landing page and select "manage topics."