In this paper a new automatic speech recognition (ASR) CPU-based software, called AlfaNum, with the chosen few heuristics optimized for applications in heterogeneous conditions is described. AlfaNum is a discrete speaker-independent ASR product intended for application in the largest bank-by-phone interactive voice response (IVR) system in Yugoslavia, with a lot of customers all over Serbia. That means a large variety of dialects, telephone line quality, and microphones used. This system has been tested on 500 speakers and it achieved an average accuracy of 98,2% in real life conditions. The whole software is developed in C++ programming language. Object oriented programming gave the software an elegant look, and minimized all possible errors. On the other hand, the power of C++ language and its tight interaction with machine made the software fast and efficient.
@inproceedings{obradovic99_eurospeech, title = {A robust speaker-independent CPU-based ASR system}, author = {R. Obradovic and D. Pekar and S. Krco and V. Delic and V. Senk}, year = {1999}, booktitle = {6th European Conference on Speech Communication and Technology}, pages = {2881--2884}, doi = {10.21437/Eurospeech.1999-639},}
Cite as:Obradovic, R., Pekar, D., Krco, S., Delic, V., Senk, V. (1999) A robust speaker-independent CPU-based ASR system. Proc. 6th European Conference on Speech Communication and Technology, 2881-2884, doi: 10.21437/Eurospeech.1999-639