Movatterモバイル変換

Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework

Sakriani SAKTI,Satoshi NAKAMURA,Konstantin MARKOV

Full Text Views
0

Cite this

Summary :

Over the last decade, the Bayesian approach has increased in popularity in many application areas. It uses a probabilistic framework which encodes our beliefs or actions in situations of uncertainty. Information from several models can also be combined based on the Bayesian framework to achieve better inference and to better account for modeling uncertainty. The approach we adopted here is to utilize the benefits of the Bayesian framework to improve acoustic model precision in speech recognition systems, which modeling a wider-than-triphone context by approximating it using several less context-dependent models. Such a composition was developed in order to avoid the crucial problem of limited training data and to reduce the model complexity. To enhance the model reliability due to unseen contexts and limited training data, flooring and smoothing techniques are applied. Experimental results show that the proposed Bayesian pentaphone model improves word accuracy in comparison with the standard triphone model.

Publication: IEICE TRANSACTIONS on InformationVol.E89-D No.3 pp.946-953

Publication Date: 2006/03/01

Publicized

Online ISSN: 1745-1361

DOI: 10.1093/ietisy/e89-d.3.946

Type of Manuscript: Special Section PAPER (Special Section on Statistical Modeling for Speech Processing)

Category: Speech Recognition

Authors

Sakriani SAKTI
Satoshi NAKAMURA
Konstantin MARKOV

Keyword

Bayesian framework, wide phonetic context model, acoustic rescoring

Latest Issue

Copyrights notice of machine-translated contents

The copyright of the original papers published on this site belongs to IEICE. Unauthorized use of the original or translated papers is prohibited. SeeIEICE Provisions on Copyright for details.

Cite this

Copy

Sakriani SAKTI, Satoshi NAKAMURA, Konstantin MARKOV, "Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework" in IEICE TRANSACTIONS on Information, vol. E89-D, no. 3, pp. 946-953, March 2006, doi:10.1093/ietisy/e89-d.3.946.
Abstract:Over the last decade, the Bayesian approach has increased in popularity in many application areas. It uses a probabilistic framework which encodes our beliefs or actions in situations of uncertainty. Information from several models can also be combined based on the Bayesian framework to achieve better inference and to better account for modeling uncertainty. The approach we adopted here is to utilize the benefits of the Bayesian framework to improve acoustic model precision in speech recognition systems, which modeling a wider-than-triphone context by approximating it using several less context-dependent models. Such a composition was developed in order to avoid the crucial problem of limited training data and to reduce the model complexity. To enhance the model reliability due to unseen contexts and limited training data, flooring and smoothing techniques are applied. Experimental results show that the proposed Bayesian pentaphone model improves word accuracy in comparison with the standard triphone model.
URL: https://globals.ieice.org/en_transactions/information/10.1093/ietisy/e89-d.3.946/_p

Copy

@ARTICLE{e89-d_3_946,
author={Sakriani SAKTI, Satoshi NAKAMURA, Konstantin MARKOV, },
journal={IEICE TRANSACTIONS on Information},
title={Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework},
year={2006},
volume={E89-D},
number={3},
pages={946-953},
abstract={Over the last decade, the Bayesian approach has increased in popularity in many application areas. It uses a probabilistic framework which encodes our beliefs or actions in situations of uncertainty. Information from several models can also be combined based on the Bayesian framework to achieve better inference and to better account for modeling uncertainty. The approach we adopted here is to utilize the benefits of the Bayesian framework to improve acoustic model precision in speech recognition systems, which modeling a wider-than-triphone context by approximating it using several less context-dependent models. Such a composition was developed in order to avoid the crucial problem of limited training data and to reduce the model complexity. To enhance the model reliability due to unseen contexts and limited training data, flooring and smoothing techniques are applied. Experimental results show that the proposed Bayesian pentaphone model improves word accuracy in comparison with the standard triphone model.},
keywords={},
doi={10.1093/ietisy/e89-d.3.946},
ISSN={1745-1361},
month={March},}

Copy

TY - JOUR
TI - Improving Acoustic Model Precision by Incorporating a Wide Phonetic Context Based on a Bayesian Framework
T2 - IEICE TRANSACTIONS on Information
SP - 946
EP - 953
AU - Sakriani SAKTI
AU - Satoshi NAKAMURA
AU - Konstantin MARKOV
PY - 2006
DO -10.1093/ietisy/e89-d.3.946
JO - IEICE TRANSACTIONS on Information
SN -1745-1361
VL - E89-D
IS - 3
JA - IEICE TRANSACTIONS on Information
Y1 - March 2006
AB -Over the last decade, the Bayesian approach has increased in popularity in many application areas. It uses a probabilistic framework which encodes our beliefs or actions in situations of uncertainty. Information from several models can also be combined based on the Bayesian framework to achieve better inference and to better account for modeling uncertainty. The approach we adopted here is to utilize the benefits of the Bayesian framework to improve acoustic model precision in speech recognition systems, which modeling a wider-than-triphone context by approximating it using several less context-dependent models. Such a composition was developed in order to avoid the crucial problem of limited training data and to reduce the model complexity. To enhance the model reliability due to unseen contexts and limited training data, flooring and smoothing techniques are applied. Experimental results show that the proposed Bayesian pentaphone model improves word accuracy in comparison with the standard triphone model.
ER -