Movatterモバイル変換


[0]ホーム

URL:


US20040158465A1 - Speech processing apparatus and method - Google Patents

Speech processing apparatus and method
Download PDF

Info

Publication number
US20040158465A1
US20040158465A1US10/770,421US77042104AUS2004158465A1US 20040158465 A1US20040158465 A1US 20040158465A1US 77042104 AUS77042104 AUS 77042104AUS 2004158465 A1US2004158465 A1US 2004158465A1
Authority
US
United States
Prior art keywords
energy
signal
speech
determining
likelihood
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US10/770,421
Inventor
David Rees
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Canon Inc
Original Assignee
Canon Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Priority claimed from GBGB9822932.1Aexternal-prioritypatent/GB9822932D0/en
Priority claimed from GBGB9822928.9Aexternal-prioritypatent/GB9822928D0/en
Application filed by Canon IncfiledCriticalCanon Inc
Priority to US10/770,421priorityCriticalpatent/US20040158465A1/en
Publication of US20040158465A1publicationCriticalpatent/US20040158465A1/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

An apparatus is provided for detecting the presence of speech within an input speech signal. Speech is detected by treating the average frame energy of an input speech signal as a sampled signal and looking for modulations within the sampled signal that are characteristic of speech.

Description

Claims (52)

US10/770,4211998-10-202004-02-04Speech processing apparatus and methodAbandonedUS20040158465A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US10/770,421US20040158465A1 (en)1998-10-202004-02-04Speech processing apparatus and method

Applications Claiming Priority (6)

Application NumberPriority DateFiling DateTitle
GB9822928.91998-10-20
GBGB9822932.1AGB9822932D0 (en)1998-10-201998-10-20Speech processing apparatus and method
GBGB9822928.9AGB9822928D0 (en)1998-10-201998-10-20Speech processing apparatus and method
GB9822932.11998-10-20
US09/409,247US6711536B2 (en)1998-10-201999-09-30Speech processing apparatus and method
US10/770,421US20040158465A1 (en)1998-10-202004-02-04Speech processing apparatus and method

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US09/409,247DivisionUS6711536B2 (en)1998-10-201999-09-30Speech processing apparatus and method

Publications (1)

Publication NumberPublication Date
US20040158465A1true US20040158465A1 (en)2004-08-12

Family

ID=26314539

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US09/409,247Expired - LifetimeUS6711536B2 (en)1998-10-201999-09-30Speech processing apparatus and method
US10/770,421AbandonedUS20040158465A1 (en)1998-10-202004-02-04Speech processing apparatus and method

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US09/409,247Expired - LifetimeUS6711536B2 (en)1998-10-201999-09-30Speech processing apparatus and method

Country Status (4)

CountryLink
US (2)US6711536B2 (en)
EP (1)EP0996110B1 (en)
JP (1)JP4484283B2 (en)
DE (1)DE69926851T2 (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20030061037A1 (en)*2001-09-272003-03-27Droppo James G.Method and apparatus for identifying noise environments from noisy signals
US20060100866A1 (en)*2004-10-282006-05-11International Business Machines CorporationInfluencing automatic speech recognition signal-to-noise levels
US20080033723A1 (en)*2006-08-032008-02-07Samsung Electronics Co., Ltd.Speech detection method, medium, and system
US20090210224A1 (en)*2007-08-312009-08-20Takashi FukudaSystem, method and program for speech processing
US20120116754A1 (en)*2010-11-102012-05-10Broadcom CorporationNoise suppression in a mel-filtered spectral domain
US20120271632A1 (en)*2011-04-252012-10-25Microsoft CorporationSpeaker Identification
US20130096915A1 (en)*2011-10-172013-04-18Nuance Communications, Inc.System and Method for Dynamic Noise Adaptation for Robust Automatic Speech Recognition
US20170084295A1 (en)*2015-09-182017-03-23Sri InternationalReal-time speaker state analytics platform
US10478111B2 (en)2014-08-222019-11-19Sri InternationalSystems for speech-based assessment of a patient's state-of-mind

Families Citing this family (52)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6711536B2 (en)*1998-10-202004-03-23Canon Kabushiki KaishaSpeech processing apparatus and method
US6327564B1 (en)*1999-03-052001-12-04Matsushita Electric Corporation Of AmericaSpeech detection using stochastic confidence measures on the frequency spectrum
US6868380B2 (en)*2000-03-242005-03-15Eliza CorporationSpeech recognition system and method for generating phonotic estimates
WO2002029780A2 (en)*2000-10-042002-04-11Clarity, LlcSpeech detection with source separation
JP2002132287A (en)*2000-10-202002-05-09Canon Inc Voice recording method, voice recording device, and storage medium
US6850887B2 (en)*2001-02-282005-02-01International Business Machines CorporationSpeech recognition in noisy environments
ATE288615T1 (en)*2001-03-142005-02-15Ibm METHOD AND PROCESSOR SYSTEM FOR AUDIO SIGNAL PROCESSING
GB2380644A (en)*2001-06-072003-04-09Canon KkSpeech detection
US7299173B2 (en)*2002-01-302007-11-20Motorola Inc.Method and apparatus for speech detection using time-frequency variance
CA2773294C (en)*2002-05-032013-03-12Harman International Industries, IncorporatedSound detection and localization system
US7072828B2 (en)*2002-05-132006-07-04Avaya Technology Corp.Apparatus and method for improved voice activity detection
US20040064314A1 (en)*2002-09-272004-04-01Aubert Nicolas De SaintMethods and apparatus for speech end-point detection
US7895036B2 (en)*2003-02-212011-02-22Qnx Software Systems Co.System for suppressing wind noise
US8073689B2 (en)*2003-02-212011-12-06Qnx Software Systems Co.Repetitive transient noise removal
US7885420B2 (en)*2003-02-212011-02-08Qnx Software Systems Co.Wind noise suppression system
US7949522B2 (en)2003-02-212011-05-24Qnx Software Systems Co.System for suppressing rain noise
US8271279B2 (en)2003-02-212012-09-18Qnx Software Systems LimitedSignature noise removal
US7725315B2 (en)*2003-02-212010-05-25Qnx Software Systems (Wavemakers), Inc.Minimization of transient noises in a voice signal
US8326621B2 (en)2003-02-212012-12-04Qnx Software Systems LimitedRepetitive transient noise removal
JP4348970B2 (en)*2003-03-062009-10-21ソニー株式会社 Information detection apparatus and method, and program
US8918316B2 (en)*2003-07-292014-12-23Alcatel LucentContent identification system
GB2405949A (en)*2003-09-122005-03-16Canon KkVoice activated device with periodicity determination
GB2405948B (en)*2003-09-122006-06-28Canon Res Ct Europ LtdVoice activated device
US7756709B2 (en)*2004-02-022010-07-13Applied Voice & Speech Technologies, Inc.Detection of voice inactivity within a sound stream
WO2006008810A1 (en)*2004-07-212006-01-26Fujitsu LimitedSpeed converter, speed converting method and program
WO2006077626A1 (en)*2005-01-182006-07-27Fujitsu LimitedSpeech speed changing method, and speech speed changing device
FR2881867A1 (en)*2005-02-042006-08-11France Telecom METHOD FOR TRANSMITTING END-OF-SPEECH MARKS IN A SPEECH RECOGNITION SYSTEM
US8219391B2 (en)*2005-02-152012-07-10Raytheon Bbn Technologies Corp.Speech analyzing system with speech codebook
US7962340B2 (en)*2005-08-222011-06-14Nuance Communications, Inc.Methods and apparatus for buffering data for use in accordance with a speech recognition system
US7697827B2 (en)2005-10-172010-04-13Konicek Jeffrey CUser-friendlier interfaces for a camera
WO2008007616A1 (en)*2006-07-132008-01-17Nec CorporationNon-audible murmur input alarm device, method, and program
US8775168B2 (en)*2006-08-102014-07-08Stmicroelectronics Asia Pacific Pte, Ltd.Yule walker based low-complexity voice activity detector in noise suppression systems
KR100897554B1 (en)*2007-02-212009-05-15삼성전자주식회사 Distributed speech recognition system and method and terminal for distributed speech recognition
US8473282B2 (en)2008-01-252013-06-25Yamaha CorporationSound processing device and program
JP5169297B2 (en)*2008-02-222013-03-27ヤマハ株式会社 Sound processing apparatus and program
US8190440B2 (en)*2008-02-292012-05-29Broadcom CorporationSub-band codec with native voice activity detection
US8762150B2 (en)2010-09-162014-06-24Nuance Communications, Inc.Using codec parameters for endpoint detection in speech recognition
CN104221079B (en)*2012-02-212017-03-01塔塔顾问服务有限公司Carry out the improved Mel filter bank structure of phonetic analysiss using spectral characteristic
US9060052B2 (en)2013-03-132015-06-16Accusonus S.A.Single channel, binaural and multi-channel dereverberation
CN104599675A (en)*2015-02-092015-05-06宇龙计算机通信科技(深圳)有限公司Speech processing method, device and terminal
US10134425B1 (en)*2015-06-292018-11-20Amazon Technologies, Inc.Direction-based speech endpointing
CN106157951B (en)*2016-08-312019-04-23北京华科飞扬科技股份公司Carry out the automatic method for splitting and system of audio punctuate
CN106373592B (en)*2016-08-312019-04-23北京华科飞扬科技股份公司Audio holds processing method and the system of making pauses in reading unpunctuated ancient writings of making an uproar
JP2018072723A (en)*2016-11-022018-05-10ヤマハ株式会社Acoustic processing method and sound processing apparatus
US11216724B2 (en)*2017-12-072022-01-04Intel CorporationAcoustic event detection based on modelling of sequence of event subparts
JP6838588B2 (en)*2018-08-282021-03-03横河電機株式会社 Voice analyzers, voice analysis methods, programs, and recording media
WO2020177120A1 (en)*2019-03-072020-09-10Harman International Industries, IncorporatedMethod and system for speech sepatation
CN110136715B (en)2019-05-162021-04-06北京百度网讯科技有限公司 Speech recognition method and device
US11170760B2 (en)*2019-06-212021-11-09Robert Bosch GmbhDetecting speech activity in real-time in audio signal
CN113593539B (en)*2020-04-302024-08-02阿里巴巴集团控股有限公司Stream end-to-end voice recognition method and device and electronic equipment
TWI748587B (en)*2020-08-042021-12-01瑞昱半導體股份有限公司Acoustic event detection system and method
US20230419965A1 (en)*2022-06-222023-12-28Cerence Operating CompanyEmotion detection in barge-in analysis

Citations (25)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US3873926A (en)*1974-05-031975-03-25Motorola IncAudio frequency squelch system
US3873925A (en)*1974-03-071975-03-25Motorola IncAudio frequency squelch system
US4187396A (en)*1977-06-091980-02-05Harris CorporationVoice detector circuit
US4481593A (en)*1981-10-051984-11-06Exxon CorporationContinuous speech recognition
US4484344A (en)*1982-03-011984-11-20Rockwell International CorporationVoice operated switch
US4489434A (en)*1981-10-051984-12-18Exxon CorporationSpeech recognition method and apparatus
US4718092A (en)*1984-03-271988-01-05Exxon Research And Engineering CompanySpeech recognition activation and deactivation method
US4870686A (en)*1987-10-191989-09-26Motorola, Inc.Method for entering digit sequences by voice command
US4956865A (en)*1985-01-301990-09-11Northern Telecom LimitedSpeech recognition
US5305422A (en)*1992-02-281994-04-19Panasonic Technologies, Inc.Method for determining boundaries of isolated words within a speech signal
US5473726A (en)*1993-07-061995-12-05The United States Of America As Represented By The Secretary Of The Air ForceAudio and amplitude modulated photo data collection for speech recognition
US5572623A (en)*1992-10-211996-11-05Sextant AvioniqueMethod of speech detection
US5617508A (en)*1992-10-051997-04-01Panasonic Technologies Inc.Speech detection device for the detection of speech end points based on variance of frequency band limited energy
US5638487A (en)*1994-12-301997-06-10Purespeech, Inc.Automatic speech recognition
US5649055A (en)*1993-03-261997-07-15Hughes ElectronicsVoice activity detector for speech signals in variable background noise
US5692104A (en)*1992-12-311997-11-25Apple Computer, Inc.Method and apparatus for detecting end points of speech activity
US5778342A (en)*1996-02-011998-07-07Dspc Israel Ltd.Pattern recognition system and method
US5794195A (en)*1994-06-281998-08-11Alcatel N.V.Start/end point detection for word recognition
US5812973A (en)*1994-09-301998-09-22Motorola, Inc.Method and system for recognizing a boundary between contiguous sounds for use with a speech recognition system
US5842161A (en)*1996-06-251998-11-24Lucent Technologies Inc.Telecommunications instrument employing variable criteria speech recognition
US6138095A (en)*1998-09-032000-10-24Lucent Technologies Inc.Speech recognition
US6249757B1 (en)*1999-02-162001-06-193Com CorporationSystem for detecting voice activity
US6411925B1 (en)*1998-10-202002-06-25Canon Kabushiki KaishaSpeech processing apparatus and method for noise masking
US6560575B1 (en)*1998-10-202003-05-06Canon Kabushiki KaishaSpeech processing apparatus and method
US6711536B2 (en)*1998-10-202004-03-23Canon Kabushiki KaishaSpeech processing apparatus and method

Family Cites Families (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
JPS5868097A (en)*1981-10-201983-04-22日産自動車株式会社Voice recognition equipment for vehicle
JPS6048100A (en)*1983-08-261985-03-15松下電器産業株式会社Voice recognition equipment
JPS60200300A (en)*1984-03-231985-10-09松下電器産業株式会社Voice head/end detector
JPS6148898A (en)*1984-08-161986-03-10松下電器産業株式会社 Voiced/unvoiced determination device
JPH0619498A (en)*1992-07-011994-01-28Fujitsu LtdSpeech detector
JPH07273738A (en)*1994-03-281995-10-20Toshiba Corp Voice transmission control circuit
US6570991B1 (en)1996-12-182003-05-27Interval Research CorporationMulti-feature speech/music discrimination system
JP2000047697A (en)*1998-07-302000-02-18Nec Eng LtdNoise canceler
JP3310225B2 (en)*1998-09-292002-08-05松下電器産業株式会社 Noise level time variation calculation method and apparatus, and noise reduction method and apparatus

Patent Citations (25)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US3873925A (en)*1974-03-071975-03-25Motorola IncAudio frequency squelch system
US3873926A (en)*1974-05-031975-03-25Motorola IncAudio frequency squelch system
US4187396A (en)*1977-06-091980-02-05Harris CorporationVoice detector circuit
US4481593A (en)*1981-10-051984-11-06Exxon CorporationContinuous speech recognition
US4489434A (en)*1981-10-051984-12-18Exxon CorporationSpeech recognition method and apparatus
US4484344A (en)*1982-03-011984-11-20Rockwell International CorporationVoice operated switch
US4718092A (en)*1984-03-271988-01-05Exxon Research And Engineering CompanySpeech recognition activation and deactivation method
US4956865A (en)*1985-01-301990-09-11Northern Telecom LimitedSpeech recognition
US4870686A (en)*1987-10-191989-09-26Motorola, Inc.Method for entering digit sequences by voice command
US5305422A (en)*1992-02-281994-04-19Panasonic Technologies, Inc.Method for determining boundaries of isolated words within a speech signal
US5617508A (en)*1992-10-051997-04-01Panasonic Technologies Inc.Speech detection device for the detection of speech end points based on variance of frequency band limited energy
US5572623A (en)*1992-10-211996-11-05Sextant AvioniqueMethod of speech detection
US5692104A (en)*1992-12-311997-11-25Apple Computer, Inc.Method and apparatus for detecting end points of speech activity
US5649055A (en)*1993-03-261997-07-15Hughes ElectronicsVoice activity detector for speech signals in variable background noise
US5473726A (en)*1993-07-061995-12-05The United States Of America As Represented By The Secretary Of The Air ForceAudio and amplitude modulated photo data collection for speech recognition
US5794195A (en)*1994-06-281998-08-11Alcatel N.V.Start/end point detection for word recognition
US5812973A (en)*1994-09-301998-09-22Motorola, Inc.Method and system for recognizing a boundary between contiguous sounds for use with a speech recognition system
US5638487A (en)*1994-12-301997-06-10Purespeech, Inc.Automatic speech recognition
US5778342A (en)*1996-02-011998-07-07Dspc Israel Ltd.Pattern recognition system and method
US5842161A (en)*1996-06-251998-11-24Lucent Technologies Inc.Telecommunications instrument employing variable criteria speech recognition
US6138095A (en)*1998-09-032000-10-24Lucent Technologies Inc.Speech recognition
US6411925B1 (en)*1998-10-202002-06-25Canon Kabushiki KaishaSpeech processing apparatus and method for noise masking
US6560575B1 (en)*1998-10-202003-05-06Canon Kabushiki KaishaSpeech processing apparatus and method
US6711536B2 (en)*1998-10-202004-03-23Canon Kabushiki KaishaSpeech processing apparatus and method
US6249757B1 (en)*1999-02-162001-06-193Com CorporationSystem for detecting voice activity

Cited By (19)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050071157A1 (en)*2001-09-272005-03-31Microsoft CorporationMethod and apparatus for identifying noise environments from noisy signals
US6959276B2 (en)*2001-09-272005-10-25Microsoft CorporationIncluding the category of environmental noise when processing speech signals
US7266494B2 (en)2001-09-272007-09-04Microsoft CorporationMethod and apparatus for identifying noise environments from noisy signals
US20030061037A1 (en)*2001-09-272003-03-27Droppo James G.Method and apparatus for identifying noise environments from noisy signals
US20060100866A1 (en)*2004-10-282006-05-11International Business Machines CorporationInfluencing automatic speech recognition signal-to-noise levels
US20080033723A1 (en)*2006-08-032008-02-07Samsung Electronics Co., Ltd.Speech detection method, medium, and system
US9009048B2 (en)*2006-08-032015-04-14Samsung Electronics Co., Ltd.Method, medium, and system detecting speech using energy levels of speech frames
US8812312B2 (en)*2007-08-312014-08-19International Business Machines CorporationSystem, method and program for speech processing
US20090210224A1 (en)*2007-08-312009-08-20Takashi FukudaSystem, method and program for speech processing
US8942975B2 (en)*2010-11-102015-01-27Broadcom CorporationNoise suppression in a Mel-filtered spectral domain
US20120116754A1 (en)*2010-11-102012-05-10Broadcom CorporationNoise suppression in a mel-filtered spectral domain
US8719019B2 (en)*2011-04-252014-05-06Microsoft CorporationSpeaker identification
US20120271632A1 (en)*2011-04-252012-10-25Microsoft CorporationSpeaker Identification
US20130096915A1 (en)*2011-10-172013-04-18Nuance Communications, Inc.System and Method for Dynamic Noise Adaptation for Robust Automatic Speech Recognition
US8972256B2 (en)*2011-10-172015-03-03Nuance Communications, Inc.System and method for dynamic noise adaptation for robust automatic speech recognition
US9741341B2 (en)2011-10-172017-08-22Nuance Communications, Inc.System and method for dynamic noise adaptation for robust automatic speech recognition
US10478111B2 (en)2014-08-222019-11-19Sri InternationalSystems for speech-based assessment of a patient's state-of-mind
US20170084295A1 (en)*2015-09-182017-03-23Sri InternationalReal-time speaker state analytics platform
US10706873B2 (en)*2015-09-182020-07-07Sri InternationalReal-time speaker state analytics platform

Also Published As

Publication numberPublication date
EP0996110B1 (en)2005-08-24
JP2000132177A (en)2000-05-12
DE69926851D1 (en)2005-09-29
EP0996110A1 (en)2000-04-26
DE69926851T2 (en)2006-06-08
US20030055639A1 (en)2003-03-20
JP4484283B2 (en)2010-06-16
US6711536B2 (en)2004-03-23

Similar Documents

PublicationPublication DateTitle
US6711536B2 (en)Speech processing apparatus and method
US6411925B1 (en)Speech processing apparatus and method for noise masking
JP3604393B2 (en) Voice detection device
US7756707B2 (en)Signal processing apparatus and method
US6289309B1 (en)Noise spectrum tracking for speech enhancement
US6415253B1 (en)Method and apparatus for enhancing noise-corrupted speech
US8165880B2 (en)Speech end-pointer
US7542900B2 (en)Noise reduction using correction vectors based on dynamic aspects of speech and noise normalization
US6453285B1 (en)Speech activity detector for use in noise reduction system, and methods therefor
US5970441A (en)Detection of periodicity information from an audio signal
US6560575B1 (en)Speech processing apparatus and method
US5579431A (en)Speech detection in presence of noise by determining variance over time of frequency band limited energy
US20020165713A1 (en)Detection of sound activity
JP3451146B2 (en) Denoising system and method using spectral subtraction
JP3105465B2 (en) Voice section detection method
US7165031B2 (en)Speech processing apparatus and method using confidence scores
KR100429896B1 (en)Speech detection apparatus under noise environment and method thereof
US20030046069A1 (en)Noise reduction system and method
JP2000163099A (en) Noise removal device, speech recognition device, and storage medium
JPH0844390A (en) Voice recognition device
GB2354363A (en)Apparatus detecting the presence of speech
CN1131472A (en)Speech detection device
WO2001031640A1 (en)Elimination of noise from a speech signal
JP2002189495A (en) Speech recognition feature extractor and speech recognition device

Legal Events

DateCodeTitleDescription
STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp