speechllm
Here are 9 public repositories matching this topic...
A Fundamental End-to-End Speech Recognition Toolkit and Open Source SOTA Pretrained Models, Supporting Speech Recognition, Voice Activity Detection, Text Post-processing etc.
- Updated
Oct 1, 2025 - Python
Open-source industrial-grade ASR models supporting Mandarin, Chinese dialects and English, achieving a new SOTA on public Mandarin ASR benchmarks, while also offering outstanding singing lyrics recognition capability.
- Updated
Sep 22, 2025 - Python
FunASR实时语音识别版,识别麦克风和电脑内播放的声音,电脑语音打字软件
- Updated
Sep 12, 2025
TASU: A New Style of Alignment of Speech LLM with only Text Training Data, zero-shot on ASR and Other SU tasks
- Updated
Oct 4, 2025 - Python
- Updated
May 9, 2025
- Updated
May 9, 2025
- Updated
May 9, 2025
- Updated
May 9, 2025
SHALLOW, the first hallucination benchmark for ASR models
- Updated
May 23, 2025 - Python
Improve this page
Add a description, image, and links to thespeechllm topic page so that developers can more easily learn about it.
Add this topic to your repo
To associate your repository with thespeechllm topic, visit your repo's landing page and select "manage topics."