Movatterモバイル変換

Part of the book series:Lecture Notes in Computer Science ((LNISA,volume 5061))

Included in the following conference series:

International Conference on Ubiquitous Intelligence and Computing

1195Accesses
5Citations

Abstract

This paper presents a ubiquitous and robust text-independent speaker recognitionarchitecture for home automation digital life. In this architecture, a multiple microphone configuration is adopted to receive the pervasive speech signals. The multi-channel speech signals are then added together with a mixer. In a ubiquitous computing environment, the received speech signal is usually heavily corrupted by background noises. An SNR-aware subspace speech enhancement approach is used as a pre-processing to enhance the mixed signal. Considering the text-independent speaker recognition, this paper applies a multi-class support vectors machine (SVM)[10][11] instead of conventional Gaussian mixture models (GMMs)[12]. In our experiments, the speaker recognition rate can averagely reach 97.2% with the proposed ubiquitous speaker recognitionarchitecture.

This is a preview of subscription content,log in via an institution to check access.

Access this chapter

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Virtual home assistant for voice based controlling and scheduling with short speech speaker identification

Article23 July 2018

Text-dependent Speaker Recognition System Based on Speaking Frequency Characteristics

iHouse: A Voice-Controlled, Centralized, Retrospective Smart Home

References

Cortes, C., Vapnik, V.: Support vector networks. Machine Learning 20, 273–297 (1995)
MATH Google Scholar
Vapnik, V.: The Nature of Statistical Learning Theory. Springer, New York (1995)
MATH Google Scholar
Vapnik, V.: Statistical Learning Theory. Wiley, New York (1998)
MATH Google Scholar
Schölkopf, B., Mika, S., Burges, C., Knirsch, P., Müller, K.-R., Rätsch, G., Smola, A.: Input space vs. feature space in kernel-based methods. IEEE Transactions on Neural Networks 10(5), 1000–1017 (1999)
Article Google Scholar
Ephraim, Y., Van Trees, H.L.: A signal subspace approach for speech enhancement. IEEE Transactions on Speech and Audio Processing 3(4), 251–266 (1995)
Article Google Scholar
Jia-Ching, W., Hsiao-Ping, L., Jhing-Fa, W., Chung-Hsien, Y.: Critical Band Subspace-Based Speech Enhancement Using SNR and Auditory Masking Aware Technique. IEICE Transactions on Information and Systems E90-D(7), 1055–1062 (2007)
Article Google Scholar
Hui-Ling, H., Fang-Lin, C.: ESVM: Evolutionary support vector machine for automatic feature selection and Classification of micro array data. BioSystems 90, 516–528 (2007)
Article Google Scholar
Shung-Yung, L.: Efficient text independent speaker recognition withwavelet feature selection based multilayered neural network using supervised learning algorithm. Pattern Recognition 40, 3616–3620 (2007)
Article MATH Google Scholar
Shung-Yung, L.: Wavelet feature selection based neural networks with application to the text independent speaker identification. BioSystems 90, 516–528 (2007)
Article Google Scholar
Vincent, W., Steve, R.: Speaker verification using sequence discriminant support vector machines. IEEE transactions on speech and audio processing 13(2) (March 2005)
Google Scholar
Campbell, W.M., Campbell, J.P., Gleason, T.P., Reynolds, D.A., Shen, W.: Speaker Verification Using Support Vector Machines and High-Level Features. IEEE transactions on speech, audio and language processing 15(7) (September 2007)
Google Scholar
Burget, L., Matĕjka, P., Schwarz, P., Glembek, O., Cĕrnocký, J.H.: Analysis of Feature Extraction and Channel Compensation in a GMM Speaker Recognition System. IEEE transactions on speech, audio and language processing 15(7), 1979–1985 (2007)
Google Scholar
Rabiner, L.R., Schafer, R.W.: Digital Processing of Speech Recognition Signals. Prentice-Hall Co. Ltd, Englewood Cliffs (1978)
Google Scholar
Huang, X., Acero, A., Hon, H.: Spoken Language Processing: A Guide to Theory, Algorithm and System Development. Prentice-Hall Co. Ltd, Englewood Cliffs (2001)
Google Scholar

Download references

Author information

Authors and Affiliations

Department of Electrical Engineering, National Cheng-Kung University, No.1, Dasyue Rd., East District, Tainan City, 701, Taiwan, R.O.C.
Jhing-Fa Wang, Ta-Wen Kuan, Jia-chang Wang & Gaung-Hui Gu

Authors

Jhing-Fa Wang
View author publications
You can also search for this author inPubMed Google Scholar
Ta-Wen Kuan
View author publications
You can also search for this author inPubMed Google Scholar
Jia-chang Wang
View author publications
You can also search for this author inPubMed Google Scholar
Gaung-Hui Gu
View author publications
You can also search for this author inPubMed Google Scholar

Editor information

Frode Eika Sandnes Yan Zhang Chunming Rong Laurence T. Yang Jianhua Ma

Rights and permissions

Reprints and permissions

Copyright information

About this paper

Cite this paper

Wang, JF., Kuan, TW., Wang, Jc., Gu, GH. (2008). Ubiquitous and Robust Text-Independent Speaker Recognition for Home Automation Digital Life. In: Sandnes, F.E., Zhang, Y., Rong, C., Yang, L.T., Ma, J. (eds) Ubiquitous Intelligence and Computing. UIC 2008. Lecture Notes in Computer Science, vol 5061. Springer, Berlin, Heidelberg. https://doi.org/10.1007/978-3-540-69293-5_24

Download citation

DOI:https://doi.org/10.1007/978-3-540-69293-5_24
Publisher Name:Springer, Berlin, Heidelberg
Print ISBN:978-3-540-69292-8
Online ISBN:978-3-540-69293-5
eBook Packages:Computer ScienceComputer Science (R0)

Publish with us

Policies and ethics

Movatterモバイル変換

Ubiquitous and Robust Text-Independent Speaker Recognition for Home Automation Digital Life

Abstract

Access this chapter

Preview

Similar content being viewed by others

Virtual home assistant for voice based controlling and scheduling with short speech speaker identification

Text-dependent Speaker Recognition System Based on Speaking Frequency Characteristics

iHouse: A Voice-Controlled, Centralized, Retrospective Smart Home

References

Author information

Authors and Affiliations

Editor information

Rights and permissions

Copyright information

About this paper

Cite this paper

Download citation

Share this paper

Publish with us

Access this chapter