- Notifications
You must be signed in to change notification settings - Fork2
STT Service based on Kaldi ASR
License
NotificationsYou must be signed in to change notification settings
mpuels/docker-py-kaldi-asr-and-model
Folders and files
| Name | Name | Last commit message | Last commit date | |
|---|---|---|---|---|
Repository files navigation
This image contains a demo STT service based onKaldi ASR andpy-kaldi-asr. Try it out by followingthese steps.
To start the STT service on your local machine, execute:
$ docker pull quay.io/mpuels/docker-py-kaldi-asr-and-model:kaldi-generic-en-tdnn_sp-r20180815$ docker run --rm -p 127.0.0.1:8080:80/tcp quay.io/mpuels/docker-py-kaldi-asr-and-model:kaldi-generic-en-tdnn_sp-r20180815To transfer an audio file for transcription to the service, in a secondterminal, execute:
$ conda env create -f environment.yml$ source activate py-kaldi-asr-client$ ./asr_client.py asr.wavFor a list of available Kaldi models packaged in Docker containers, seehttps://quay.io/repository/mpuels/docker-py-kaldi-asr-and-model?tab=tags .
For a description of the available models, seehttps://github.com/gooofy/zamia-speech#asr-models .
Docker images are named according to the format
kaldi-generic-<LANG>-tdnn-<SIZE>-<RELEASEDATE><LANG>: There are models for English (en) and German (de).<SIZE>: Kaldi models come in two sizes:sp(standard size) and250(smaller size, suitable for realtime decoding on Raspberry Pi).<RELEASEDATE>: Usually, models released later are trained on more data andhence have a lower word error rate.
The image is part ofZamia Speech.
About
STT Service based on Kaldi ASR
Resources
License
Uh oh!
There was an error while loading.Please reload this page.
Stars
Watchers
Forks
Packages0
No packages published