Zhang et al., 2000 - Google Patents

A two-stage scoring method combining world and cohort models for speaker verification

Zhang et al., 2000

Document ID: 2377414777626998843
Author: Zhang W; Mak M; He X
Publication year: 2000
Publication venue: 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No. 00CH37100)

External Links

Cited by

Snippet

The cohort and world models are commonly used for scoring normalization in speaker verification. As these models represent different regions of the feature space, a better solution could be obtained by integrating them into a single framework. In this paper, we …

Continue reading atwww.researchgate.net (PDF) (other versions)

238000010606normalization0abstractdescription14

Classifications

The classifications are assigned by a computer and are not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the classifications listed.

- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/06—Creation of reference templates; Training of speech recognition systems, e.g. adaptation to the characteristics of the speaker's voice
- G10L15/065—Adaptation
- G10L15/07—Adaptation to the speaker
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/04—Training, enrolment or model building
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/18—Speech classification or search using natural language modelling
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L15/00—Speech recognition
- G10L15/08—Speech classification or search
- G10L15/14—Speech classification or search using statistical models, e.g. hidden Markov models [HMMs]
- G10L15/142—Hidden Markov Models [HMMs]
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/06—Decision making techniques; Pattern matching strategies
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L25/00—Speech or voice analysis techniques not restricted to a single one of groups G10L15/00-G10L21/00
- G10L25/78—Detection of presence or absence of voice signals
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L17/00—Speaker identification or verification
- G10L17/005—Speaker recognisers specially adapted for particular applications
- G—PHYSICS
- G10—MUSICAL INSTRUMENTS; ACOUSTICS
- G10L—SPEECH ANALYSIS OR SYNTHESIS; SPEECH RECOGNITION; SPEECH OR VOICE PROCESSING; SPEECH OR AUDIO CODING OR DECODING
- G10L19/00—Speech or audio signal analysis-synthesis techniques for redundancy reduction, e.g. in vocoders; Coding or decoding of speech or audio signal, using source filter models or psychoacoustic analysis

Similar Documents

Publication	Publication Date	Title
Paul et al.	2009	Bangla speech recognition system using LPC and ANN
EP2364495B1 (en)	2016-10-12	Method for verifying the identify of a speaker and related computer readable medium and computer
US9489950B2 (en)	2016-11-08	Method and system for dual scoring for text-dependent speaker verification
US5950157A (en)	1999-09-07	Method for establishing handset-dependent normalizing models for speaker recognition
US7822605B2 (en)	2010-10-26	Method and apparatus for large population speaker identification in telephone interactions
JPH11507443A (en)	1999-06-29	Speaker identification system
Angkititrakul et al.	2007	Discriminative in-set/out-of-set speaker recognition
Ozaydin	2017	Design of a text independent speaker recognition system
Rouvier et al.	2021	Study on the temporal pooling used in deep neural networks for speaker verification
Chakroun et al.	2016	Improving text-independent speaker recognition with GMM
Singh et al.	2016	Replay attack: Its effect on GMM-UBM based text-independent speaker verification system
Wildermoth et al.	2003	GMM based speaker recognition on readily available databases
Zhang et al.	2000	A two-stage scoring method combining world and cohort models for speaker verification
Singh et al.	2016	Efficient Modelling Technique based Speaker Recognition under Limited Speech Data
Larcher et al.	2014	Imposture classification for text-dependent speaker verification
Gammal et al.	2005	Combating reverberation in speaker verification
Karakos et al.	2018	Individual ship detection using underwater acoustics
Webb et al.	1993	Speaker identification experiments using HMMs
Munteanu et al.	2010	Automatic speaker verification experiments using HMM
Zigel et al.	2003	On cohort selection for speaker verification.
Bouwman et al.	2000	Weighting phone confidence measures for automatic speech recognition
Villalba et al.	2016	Bayesian networks to model the variability of speaker verification scores in adverse environments
Mak et al.	2001	A new two-stage scoring normalization approach to speaker verification
Tsao et al.	2010	An acoustic segment model approach to incorporating temporal information into speaker modeling for text-independent speaker recognition
Chakroun et al.	2016	An improved approach for text-independent speaker recognition

Movatterモバイル変換

Zhang et al., 2000 - Google Patents

External Links

Snippet

Classifications

Similar Documents