Semi-supervised learning has been recognized as an effective way to improve acoustic model training in cases where sufficient transcribed data are not available. Different from most of existing approaches only using single acoustic model and focusing on how to refine it, this paper investigates the feasibility of using ensemble methods for semi-supervised acoustic modeling training. Two methods are investigated here, one is a generalized Boosting algorithm, a second one is based on data partitions. Both methods demonstrate substantial improvement over baseline. More than 15% relative reduction of word error rate was observed in our experiments using a large real-world meeting recognition dataset.
@inproceedings{zhang05c_interspeech, title = {Investigations on ensemble based semi-supervised acoustic model training}, author = {Rong Zhang and Ziad Al Bawab and Arthur Chan and Ananlada Chotimongkol and David Huggins-Daines and Alexander I. Rudnicky}, year = {2005}, booktitle = {Interspeech 2005}, pages = {1677--1680}, doi = {10.21437/Interspeech.2005-547}, issn = {2958-1796},}
Cite as:Zhang, R., Bawab, Z.A., Chan, A., Chotimongkol, A., Huggins-Daines, D., Rudnicky, A.I. (2005) Investigations on ensemble based semi-supervised acoustic model training. Proc. Interspeech 2005, 1677-1680, doi: 10.21437/Interspeech.2005-547