Movatterモバイル変換


[0]ホーム

URL:


Zhang et al., 2000 - Google Patents

A two-stage scoring method combining world and cohort models for speaker verification

Zhang et al., 2000

ViewPDF
Document ID
2377414777626998843
Author
Zhang W
Mak M
He X
Publication year
Publication venue
2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No. 00CH37100)

External Links

Snippet

The cohort and world models are commonly used for scoring normalization in speaker verification. As these models represent different regions of the feature space, a better solution could be obtained by integrating them into a single framework. In this paper, we …
Continue reading atwww.researchgate.net (PDF) (other versions)

Classifications

The classifications are assigned by a computer and are not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the classifications listed.
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/06Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
    • G10L15/065Adaptation
    • G10L15/07Adaptation to the speaker
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/04Training, enrolment or model building
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/18Speech classification or search using natural language modelling
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L15/00Speech recognition
    • G10L15/08Speech classification or search
    • G10L15/14Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
    • G10L15/142Hidden Markov Models [HMMs]
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/06Decision making techniques; Pattern matching strategies
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L25/00Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
    • G10L25/78Detection of presence or absence of voice signals
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L17/00Speaker identification or verification
    • G10L17/005Speaker recognisers specially adapted for particular applications
    • GPHYSICS
    • G10MUSICAL INSTRUMENTS; ACOUSTICS
    • G10LSPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
    • G10L19/00Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

PublicationPublication DateTitle
Paul et al.Bangla speech recognition system using LPC and ANN
EP2364495B1 (en)Method for verifying the identify of a speaker and related computer readable medium and computer
US9489950B2 (en)Method and system for dual scoring for text-dependent speaker verification
US5950157A (en)Method for establishing handset-dependent normalizing models for speaker recognition
US7822605B2 (en)Method and apparatus for large population speaker identification in telephone interactions
JPH11507443A (en) Speaker identification system
Angkititrakul et al.Discriminative in-set/out-of-set speaker recognition
OzaydinDesign of a text independent speaker recognition system
Rouvier et al.Study on the temporal pooling used in deep neural networks for speaker verification
Chakroun et al.Improving text-independent speaker recognition with GMM
Singh et al.Replay attack: Its effect on GMM-UBM based text-independent speaker verification system
Wildermoth et al.GMM based speaker recognition on readily available databases
Zhang et al.A two-stage scoring method combining world and cohort models for speaker verification
Singh et al.Efficient Modelling Technique based Speaker Recognition under Limited Speech Data
Larcher et al.Imposture classification for text-dependent speaker verification
Gammal et al.Combating reverberation in speaker verification
Karakos et al.Individual ship detection using underwater acoustics
Webb et al.Speaker identification experiments using HMMs
Munteanu et al.Automatic speaker verification experiments using HMM
Zigel et al.On cohort selection for speaker verification.
Bouwman et al.Weighting phone confidence measures for automatic speech recognition
Villalba et al.Bayesian networks to model the variability of speaker verification scores in adverse environments
Mak et al.A new two-stage scoring normalization approach to speaker verification
Tsao et al.An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
Chakroun et al.An improved approach for text-independent speaker recognition

[8]
ページ先頭

©2009-2025 Movatter.jp