Movatterモバイル変換


[0]ホーム

URL:


EP0167364A1 - Speech-silence detection with subband coding - Google Patents

Speech-silence detection with subband coding
Download PDF

Info

Publication number
EP0167364A1
EP0167364A1EP85304627AEP85304627AEP0167364A1EP 0167364 A1EP0167364 A1EP 0167364A1EP 85304627 AEP85304627 AEP 85304627AEP 85304627 AEP85304627 AEP 85304627AEP 0167364 A1EP0167364 A1EP 0167364A1
Authority
EP
European Patent Office
Prior art keywords
speech
value
statistic
detection
step size
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Withdrawn
Application number
EP85304627A
Other languages
German (de)
French (fr)
Inventor
Marc Bernard Donvito
Brian William Schoenherr
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
AT&T Corp
Original Assignee
American Telephone and Telegraph Co Inc
AT&T Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by American Telephone and Telegraph Co Inc, AT&T CorpfiledCriticalAmerican Telephone and Telegraph Co Inc
Publication of EP0167364A1publicationCriticalpatent/EP0167364A1/en
Withdrawnlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

Speech detection is accomplished in conjunction with two-band subband encoding. A detection statistic T (iTo), used to estimate the short-term speech energy, is developed from energy estimates made in each subband. A speech presence energy threshold λON a speech silence energy threshold λOFF and λOFF are computed which adapt to the long-term speech level. The detection statistic is compared to the thresholds to make a decision concerning the presence or absence of speech.
Also disclosed are considerations for extrapolating the detection to result in an arrangement with more than two subbands.

Description

Claims (9)

1. Signal encoding apparatus
CHARACTERIZED BY
means for encoding a plurality of frequency subband portions of a signal, including means for generating voltage step size values for signal samples of each subband;
means for computing speech statistic values based on the voltage step size values for the one frequency subband and the voltage step size values for another of the frequency subbands; and
means for comparing speech presence energy threshold values and speech silence energy threshold values to the speech statistic values to selectively generate speech presence output signals.
2. The apparatus defined in claim 1 wherein said speech statistic value computing means is
CHARACTERIZED BY
means for multiplying the step size values of each subband by a corresponding speech detection coefficient to generate respective speech detection value products;
means for summing the speech detection value products to generate speech detection value sums, and
means for smoothing the speech detection value sum.
3. The apparatus defined in claim 2
CHARACTERIZED IN THAT
said smoothing means comprises means for summing each speech detection value sum with a delay value to generate a speech detection statistic output value, the delay value being the product of a detection constant and a previous detection statistic output value.
4. The apparatus defined in claim 3
CHARACTERIZED BY
means for computing speech energy threshold values and speech silence threshold values based on the voltage step size values for one of the subbands.
5. The apparatus defined in claim 4 wherein said speech statistic value computing means is
CHARACTERIZED BY
means for generating a speech presence threshold value and a speech silence value from a maximum energy level value, the maximum energy level value being generated by choosing the maximum of first and second energy levels, the first energy level being the product of a step size value of the low frequency subband and the second energy level being the larger of the previous sample maximum energy level value multiplied by a coefficient and a lower limit.
6. The apparatus defined in claim 5
CHARACTERIZED BY
switch means which connect either the speech threshold value or the speech silence value from the generating means to a one input of a comparator in response to a control signal, the other input of the comparator being connected to receive the speech detection statistic, and
feedback means including a one-sample delay means connected between the output of said comparator and said switch for generating the control signals.
7. A method of detecting the presence of speech content in a signal,
CHARACTERIZED BY
computing a short term speech statistic from the step size value information of at least two of the subbands, and
comparing the speech statistic to a long term speech energy threshold to selectively generate a speech presence indication signal.
8. The method defined in claim 7 further
CHARACTERIZED BY
computing a long term speech energy threshold from the step size information of at least one of the subbands.
9. The method defined in claim 8
CHARACTERIZED BY
giving greater weight to the step size values for a higher frequency subband than to those of a lower frequency subband when computing the short term speech statistic.
EP85304627A1984-07-061985-06-28Speech-silence detection with subband codingWithdrawnEP0167364A1 (en)

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US62858384A1984-07-061984-07-06
US6285832003-07-29

Publications (1)

Publication NumberPublication Date
EP0167364A1true EP0167364A1 (en)1986-01-08

Family

ID=24519496

Family Applications (1)

Application NumberTitlePriority DateFiling Date
EP85304627AWithdrawnEP0167364A1 (en)1984-07-061985-06-28Speech-silence detection with subband coding

Country Status (2)

CountryLink
EP (1)EP0167364A1 (en)
JP (1)JPS6132900A (en)

Cited By (7)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP0341128A1 (en)*1988-05-041989-11-08Thomson-CsfMethod and apparatus for speech signal detection
EP0565947A1 (en)*1992-04-131993-10-20NOKIA TECHNOLOGY GmbHProcedure for including digital information in an audio signal prior to channel coding
WO1996002911A1 (en)*1992-10-051996-02-01Matsushita Electric Industrial Co., Ltd.Speech detection device
WO1996005592A1 (en)*1994-08-101996-02-22Qualcomm IncorporatedMethod and apparatus for selecting an encoding rate in a variable rate vocoder
US5826230A (en)*1994-07-181998-10-20Matsushita Electric Industrial Co., Ltd.Speech detection device
WO2000042600A3 (en)*1999-01-182000-09-28Nokia Mobile Phones LtdMethod in speech recognition and a speech recognition device
US6182035B1 (en)*1998-03-262001-01-30Telefonaktiebolaget Lm Ericsson (Publ)Method and apparatus for detecting voice activity

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
FR2451680A1 (en)*1979-03-121980-10-10Soumagne Joel SPEECH / SILENCE DISCRIMINATOR FOR SPEECH INTERPOLATION
DE3235279A1 (en)*1981-09-251983-04-21Nissan Motor Co., Ltd., Yokohama, Kanagawa VOICE RECOGNITION DEVICE
EP0110467A1 (en)*1982-11-231984-06-13Philips Kommunikations Industrie AGArrangement for the detection of speech intervals

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
FR2451680A1 (en)*1979-03-121980-10-10Soumagne Joel SPEECH / SILENCE DISCRIMINATOR FOR SPEECH INTERPOLATION
DE3235279A1 (en)*1981-09-251983-04-21Nissan Motor Co., Ltd., Yokohama, Kanagawa VOICE RECOGNITION DEVICE
EP0110467A1 (en)*1982-11-231984-06-13Philips Kommunikations Industrie AGArrangement for the detection of speech intervals

Non-Patent Citations (3)

* Cited by examiner, † Cited by third party
Title
IEEE TRANSACTIONS ON ACOUSTICS, SPEECH, AND SIGNAL PROCESSING, vol. ASSP-28, no. 5, October 1980, pages 550-561, IEEE, New York, US; B.V. COX et al.: "Nonparametric rank-order statistics applied to robust-voiced-unvoiced-silence classification"*
IEEE TRANSACTIONS ON COMMUNICATIONS, vol. COM-24, no. 5, May 1976, pages 563-567, New York, US; R.W. SCHAFER et al.: "Detecting the presence of speech using ADPCM coding"*
TELECOMMUNICATIONS AND RADIO ENGINEERING, vol. 4, April 1965, pages 70-72, Washington, US; V.N. TETEREV: "A combinatorial method of detecting speech signals in a background of smooth noise"*

Cited By (15)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP0341128A1 (en)*1988-05-041989-11-08Thomson-CsfMethod and apparatus for speech signal detection
FR2631147A1 (en)*1988-05-041989-11-10Thomson Csf METHOD AND DEVICE FOR DETECTING VOICE SIGNALS
US4982341A (en)*1988-05-041991-01-01Thomson CsfMethod and device for the detection of vocal signals
EP0565947A1 (en)*1992-04-131993-10-20NOKIA TECHNOLOGY GmbHProcedure for including digital information in an audio signal prior to channel coding
WO1996002911A1 (en)*1992-10-051996-02-01Matsushita Electric Industrial Co., Ltd.Speech detection device
US5826230A (en)*1994-07-181998-10-20Matsushita Electric Industrial Co., Ltd.Speech detection device
US5742734A (en)*1994-08-101998-04-21Qualcomm IncorporatedEncoding rate selection in a variable rate vocoder
WO1996005592A1 (en)*1994-08-101996-02-22Qualcomm IncorporatedMethod and apparatus for selecting an encoding rate in a variable rate vocoder
AU711401B2 (en)*1994-08-101999-10-14Qualcomm IncorporatedMethod and apparatus for selecting an encoding rate in a variable rate vocoder
EP1233408A1 (en)*1994-08-102002-08-21QUALCOMM IncorporatedMethod and apparatus for selecting an encoding rate in a variable rate vocoder
KR100455826B1 (en)*1994-08-102005-04-06콸콤 인코포레이티드 Method and apparatus for selecting encoding rate of variable rate vocoder
EP1530201A3 (en)*1994-08-102005-08-10QUALCOMM IncorporatedMethod and apparatus for selecting an encoding rate in a variable rate vocoder
CN1320521C (en)*1994-08-102007-06-06高通股份有限公司 Method and apparatus for selecting a coding rate in a variable-rate vocoder
US6182035B1 (en)*1998-03-262001-01-30Telefonaktiebolaget Lm Ericsson (Publ)Method and apparatus for detecting voice activity
WO2000042600A3 (en)*1999-01-182000-09-28Nokia Mobile Phones LtdMethod in speech recognition and a speech recognition device

Also Published As

Publication numberPublication date
JPS6132900A (en)1986-02-15

Similar Documents

PublicationPublication DateTitle
EP0145332B1 (en)Digital audio transmission
CA1181857A (en)Silence editing speech processor
CA1139884A (en)Half duplex integral vocoder modem system
US4912763A (en)Process for multirate encoding signals and device for implementing said process
EP0099397B1 (en)Adaptive differential pcm coding
US5125030A (en)Speech signal coding/decoding system based on the type of speech signal
KR950014622B1 (en) Input signal processing method
US7933216B2 (en)Method and apparatus for coding modem signals for transmission over voice networks
EP0228696A2 (en)ADPCM coder-decoder including partial band energy transition detection
NO146521B (en) PROCEDURE AND DEVICE FOR DETECTING PRESENCE OR REQUIREMENTS OF A VOICE SIGNAL ON A VOICE CHANNEL
US4464782A (en)Transmission process and device for implementing the so-improved process
EP1042861B1 (en)Device and method for detecting pcm upstream digital impairments in a communication network
US6424940B1 (en)Method and system for determining gain scaling compensation for quantization
US4319082A (en)Adaptive prediction differential-PCM transmission method and circuit using filtering by sub-bands and spectral analysis
CA1288867C (en)Adaptive differential pulse code modulation system
EP0049271B1 (en)Predictive signals coding with partitioned quantization
HK1040342A1 (en)Method and apparatus for identifying the encoding type of a central office codec
EP0167364A1 (en)Speech-silence detection with subband coding
US5298899A (en)PCM encoder and decoder using exkrema
CA1321025C (en)Speech signal coding/decoding system
US4912765A (en)Voice band data rate detector
Raulin et al.A 60 Channel PCM-ADPCM Converter
US6553074B1 (en)Method and device for combating PCM line impairments
JPH0242258B2 (en)
CointotA 32-kbit/sec ADPCM coder robust to channel errors

Legal Events

DateCodeTitleDescription
PUAIPublic reference made under article 153(3) epc to a published international application that has entered the european phase

Free format text:ORIGINAL CODE: 0009012

AKDesignated contracting states

Designated state(s):DE FR

17PRequest for examination filed

Effective date:19860611

17QFirst examination report despatched

Effective date:19871020

STAAInformation on the status of an ep patent application or granted ep patent

Free format text:STATUS: THE APPLICATION IS DEEMED TO BE WITHDRAWN

18DApplication deemed to be withdrawn

Effective date:19880104

RIN1Information on inventor provided before grant (corrected)

Inventor name:SCHOENHERR, BRIAN WILLIAM

Inventor name:DONVITO, MARC BERNARD


[8]ページ先頭

©2009-2025 Movatter.jp