Movatterモバイル変換


[0]ホーム

URL:


US7254535B2 - Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system - Google Patents

Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system
Download PDF

Info

Publication number
US7254535B2
US7254535B2US10/882,450US88245004AUS7254535B2US 7254535 B2US7254535 B2US 7254535B2US 88245004 AUS88245004 AUS 88245004AUS 7254535 B2US7254535 B2US 7254535B2
Authority
US
United States
Prior art keywords
noise
speech signal
inhalation
mask
noise model
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Lifetime, expires
Application number
US10/882,450
Other versions
US20060020451A1 (en
Inventor
William M. Kushner
Sara M. Harton
Mark A. Jasiuk
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Motorola Solutions Inc
Original Assignee
Motorola Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Motorola IncfiledCriticalMotorola Inc
Assigned to MOTOROLA, INC.reassignmentMOTOROLA, INC.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: HARTON, SARA M., JASIUK, MARK A., KUSHNER, WILLIAM M.
Priority to US10/882,450priorityCriticalpatent/US7254535B2/en
Priority to JP2007519235Aprioritypatent/JP4468447B2/en
Priority to DE602005008974Tprioritypatent/DE602005008974D1/en
Priority to MXPA06015235Aprioritypatent/MXPA06015235A/en
Priority to CA2572715Aprioritypatent/CA2572715C/en
Priority to PCT/US2005/019827prioritypatent/WO2006007290A2/en
Priority to CNA2005800219855Aprioritypatent/CN101010731A/en
Priority to AU2005262623Aprioritypatent/AU2005262623B2/en
Priority to EP05756478Aprioritypatent/EP1769493B1/en
Publication of US20060020451A1publicationCriticalpatent/US20060020451A1/en
Publication of US7254535B2publicationCriticalpatent/US7254535B2/en
Application grantedgrantedCritical
Assigned to MOTOROLA SOLUTIONS, INC.reassignmentMOTOROLA SOLUTIONS, INC.CHANGE OF NAME (SEE DOCUMENT FOR DETAILS).Assignors: MOTOROLA, INC
Adjusted expirationlegal-statusCritical
Expired - Lifetimelegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

A method for equalizing a speech signal generated within a pressurized air delivery system, the method including the steps of: generating an inhalation noise model (1152) based on inhalation noise; receiving an input signal (802) that includes a speech signal; and equalizing the speech signal (1156) based on the noise model.

Description

REFERENCE TO RELATED APPLICATIONS
The present invention is related to the following U.S. applications commonly owned together with this application by Motorola, Inc.:
Ser. No. 10/882,452, filed Jun. 30, 2004, titled “Method an Apparatus for Detecting and Attenuating Inhalation Noise in a Communication System” by Harton, et al.; and
Ser. No. 10/882,715, filed Jun. 30, 2004, titled “Method and Apparatus for Characterizing Inhalation Noise and Calculating Parameters Based on the Characterization” by Kushner, et al.
FIELD OF THE INVENTION
The present invention relates generally to a pressurized air delivery system coupled to a communication system.
BACKGROUND OF THE INVENTION
Good, reliable communications among personnel engaged in hazardous environmental activities, such as fire fighting, are essential for accomplishing their missions while maintaining their own health and safety. Working conditions may require the use of a pressurized air delivery system such as, for instance, a Self Contained Breathing Apparatus (SCBA) mask and air delivery system, a Self Contained Underwater Breathing Apparatus (SCUBA) mask and air delivery system, or an aircraft oxygen mask system. However, even while personnel are using such pressurized air delivery systems, it is desirable that good, reliable communications be maintained and personnel health and safety be effectively monitored.
FIG. 1 illustrates a simple block diagram of aprior art system100 that includes a pressurizedair delivery system110 coupled to acommunication system130. The pressurized air delivery system typically includes: abreathing mask112, such as a SCBA mask; an air cylinder (not shown); aregulator118; and ahigh pressure hose120 connecting theregulator118 to the air cylinder. Depending upon the type ofair delivery system110 being used, thesystem110 may provide protection to a user by, for example: providing the user with clean breathing air; keeping harmful toxins from reaching the user's lungs; protecting the user's lungs from being burned by superheated air inside of a burning structure; protecting the user's lungs from water; and providing protection to the user from facial and respiratory burns. Moreover, in general the mask is considered a pressure demand breathing system because air is typically only supplied when the mask wearer inhales.
Communication system130 typically includes aconventional microphone132 that is designed to record the speech of the mask wearer and that may be mounted inside the mask, outside and attached to the mask, or held in the hand over a voicemitter port on themask112.Communication system130 further includes acommunication unit134 such as a two-way radio that the mask wearer can use to communicate her speech, for example, to other communication units. Themask microphone device132 may be connected directly to theradio134 or through an intermediaryelectronic processing device138. This connection may be through a conventional wire cable (e.g.,136), or could be done wirelessly using a conventional RF, infrared, or ultrasonic short-range transmitter/receiver system. The intermediaryelectronic processing device138 may be implemented, for instance, as a digital signal processor and may contain interface electronics, audio amplifiers, and battery power for the device and for the mask microphone.
There are some shortcomings associated with the use of systems such assystem100. These limitations will be described, for ease of illustration, by reference to the block diagram ofFIG. 2, which illustrates the mask-to-radio audio path ofsystem100 illustrated inFIG. 1. Speech input210 (e.g., Si(f)) from the lips enters the mask (e.g. a SCBA mask), which has an acoustic transfer function220 (e.g., MSK(f)) that is characterized by acoustic resonances and nulls. These resonances and nulls are due to the mask cavity volume and reflections of the sound from internal mask surfaces. These effects characterized by the transfer function MSK(f) distort the input speech waveform Si(f) and alter its spectral content. Another sound source isnoise230 generated from the breathing equipment (e.g. regulator inhalation noise) that also enters the mask and is affected by MSK(f). Another transfer function240 (e.g., NPk(f)) accounts for the fact that the noise is generated from a slightly different location in the mask than that of the speech. The speech and noise S(f) are converted from acoustical energy to an electronic signal by a microphone which has its own transfer function250 (e.g., MIC(f)). The microphone signal then typically passes through an audio amplifier and other circuitry, which also has a transfer function260 (e.g., MAA(f)). An output signal270 (e.g., So(f)) from MAA(f) may then be input into a radio for further processing and transmission.
Returning to the shortcomings of systems such assystem100, an example of such a shortcoming relates to the generation by these systems of loud acoustic noises as part of their operation. More specifically, these noises can significantly degrade the quality of communications, especially when used with electronic systems such as radios. One such noise that is a prominent audio artifact introduced by a pressurized air delivery system, like a SCBA system, is regulator inhalation noise, which is illustrated inFIG. 2 asbox230.
The regulator inhalation noise occurs as a broadband noise burst occurring every time the mask wearer inhales. Negative pressure in the mask causes the air regulator valve to open, allowing high-pressure air to enter the mask and producing a loud hissing sound. This noise is picked up by the mask communications system microphone along with ensuing speech, and has about the same energy as the speech. The inhalation noise generally does not mask the speech since it typically occurs only upon inhalation. However, it can cause problems—examples of which are described as follows. For example, the inhalation noise can trigger VOX (voice-operated switch) circuits, thereby opening and occupying radio channels and potentially interfering with other speakers on the same radio channel. Moreover, in communication systems that use digital radios, the inhalation noise can trigger VAD (Voice Activity Detector) algorithms causing noise estimate confusion in noise suppression algorithms farther down the radio signal processing chain. In addition, the inhalation noise is, in general, annoying to a listener.
A second shortcoming of systems such assystem100 is described below. These systems use masks that typically encompass the nose and mouth, or the entire face. The air system mask forms an enclosed air cavity of fixed geometry that exhibits a particular set of acoustic resonances and anti-resonances (nulls) that are a function of mask volume and internal reflective surface geometries, and that alters the spectral properties of speech produced within the mask. More specifically, in characterizing the air mask audio path (FIG. 2), the most challenging part of the system is the acoustic transfer function (220) from the speaker's lips to the mask microphone. These spectral distortions can significantly degrade the performance of attached speech communication systems, especially systems using parametric digital codecs that are not optimized to handle corrupted speech. Acoustic mask distortion has been shown to affect communication system quality and intelligibility, especially when parametric digital codecs are involved. Generally, aside from the inhalation noise, the air system effects causing the largest loss of speech quality appear to be due to the poor acoustics of the mask.
FIG. 3 illustrates an example of a measured spectral magnitude response inside the mask (320) and at the mask microphone output (310) and a calculated combined transfer function (330) for the mask, microphone, and microphone amplifier. These particular data were obtained using a SCBA mask mounted on a head and torso simulator. The acoustic excitation consisted of a 3 Hz-10 KHz swept sine wave driving an artificial mouth simulator. AsFIG. 3 illustrates, the spectrum is significantly attenuated at frequencies below 500 Hz and above 4.0 KHz, mostly due to a preamp band pass filter in the microphone, and contains a number of strong spectral peaks and notches in the significant speech pass band region between 50 and 4.0 KHz. These spectral peaks and notches are generally caused by reflections inside the mask that cause comb filtering, and by cavity resonance conditions. The significant spectral peaking and notching modulate the speech pitch components and formants as they move back and forth through the pass band, resulting in degraded quality and distorted speech. It may be desirable to determine a transfer function or transfer functions characterizing such a system with such transfer functions being used to define an equalization system to reduce speech distortion.
A number of proven techniques exist to adaptively determine a system transfer function and equalize a transmission channel. One effective method to determine a system transfer function is to use a broadband reference signal to excite the system and determine the system parameters. A problem in estimating the transfer function of many speech transmission environments is that a suitable broadband excitation signal is not readily available. One common approach is to use the long-term average speech spectrum as a reference. However, adaptation time using this reference can take a long time, particularly if the speech input is sparse. In addition, the long-term speech spectrum can vary considerably for and among individuals in public service activities that frequently involve shouting and emotional stress that can alter the speech spectrum considerably.
Another shortcoming associated with systems such assystem100 is the lack of more efficient methods and apparatus for measuring certain parameters of the mask wearer including, for example, biometric parameters. Measurement of such parameters of individuals working in hazardous environments, who may be using systems such assystem100, is important for monitoring the safety and performance of those individuals. For example, measurements of the individual's respiration rate and air consumption are important parameters that characterize his work-load, physiological fitness, stress level, and consumption of the stored air supply (i.e. available working time). Conventional methods of measuring respiration involve the use of chest impedance plethysmography or airflow temperature measurements using a thermistor sensor. However, getting reliable measurements, using these conventional methods, from individuals working in physically demanding environments such as firefighting is more difficult due to intense physical movement that can cause displacement of body-mounted sensors and artifacts typically used to take the measurements.
Thus, there exists a need for methods and apparatus for effectively detecting and attenuating inhalation noise, equalizing speech (i.e., removing distortion effects), and measuring parameters associated with users in a system that includes a pressurized air delivery system coupled to a communication system.
BRIEF DESCRIPTION OF THE FIGURES
A preferred embodiment of the invention is now described, by way of example only, with reference to the accompanying figures in which:
FIG. 1 illustrates a simple block diagram of a prior art system that includes a pressurized air delivery system for breathing coupled to a communication system;
FIG. 2 illustrates the mask-to-radio audio path of the system illustrated inFIG. 1;
FIG. 3 illustrates an example of a measured spectral magnitude response inside a mask and at the mask microphone output and a calculated combined transfer function for the mask, microphone, and microphone amplifier;
FIG. 4 illustrates an example of an inhalation noise generated by a SCBA air regulator;
FIG. 5 illustrates the long-term magnitude spectrum of the inhalation noise illustrated inFIG. 4;
FIG. 6 illustrates four overlapping spectra of inhalation noises generated by a single speaker wearing a given SCBA mask;
FIG. 7 illustrates audio output from a SCBA microphone showing inhalation noise bursts intermingled with speech;
FIG. 8 illustrates a simple block diagram of a method for detecting and eliminating inhalation noise in accordance with one embodiment of the present invention;
FIG. 9 illustrates a simple block diagram of one embodiment of a spectral matcher used in the method ofFIG. 8;
FIG. 10 illustrates a simple block diagram of another embodiment of a spectral matcher used in the method ofFIG. 8;
FIG. 11 illustrates a simple block diagram of a method for equalizing a speech signal in accordance with another embodiment of the present invention;
FIG. 12 illustrates an inhalation noise spectrum before equalization as compared to the spectra after 14thorder and 20thorder LPC inverse filter equalization in accordance with the present invention;
FIG. 13 illustrates a simple block diagram of a method for determining the duration of frequency of inhalation noise and determining respiration rate and air usage volume in accordance with another embodiment of the present invention for use in measuring biometric parameters;
FIG. 14 illustrates a signal from a microphone input that contains speech and air regulation inhalation noise;
FIG. 15 illustrates the average normalized model error of the signal illustrated inFIG. 14 as determined by the method illustrated inFIG. 13;
FIG. 16 illustrates the inhalation noise detector output signal as generated by the method illustrated inFIG. 13; and
FIG. 17 illustrates the integrated inhalation detector output as generated by the method illustrated inFIG. 13.
DETAILED DESCRIPTION OF THE INVENTION
While this invention is representative of embodiments in many different forms, there are shown in the figures and will herein be described in detail specific embodiments, with the understanding that the present disclosure is to be considered as an example of the principles of the invention and not intended to limit the invention to the specific embodiments shown and described. Further, the terms and words used herein are not to be considered limiting, but rather merely descriptive. It will also be appreciated that for simplicity and clarity of illustration, elements shown in the figures have not necessarily been drawn to scale. For example, the dimensions of some of the elements are exaggerated relative to each other. Further, where considered appropriate, reference numerals have been repeated among the figures to indicate corresponding elements.
Before describing in detail the various aspects of the present invention, it would be useful in the understanding of the invention to provide a more detailed description of the air regulator inhalation noise that was briefly described above. Inhalation noise is a result of high-pressure air entering a SCBA or other pressurized air delivery system mask when a person inhales and the regulator valve opens. Turbulence at the valve creates a very loud, broadband hissing noise, directly coupled into the SCBA mask, which is comparable in amplitude at the microphone with the speech signal. An example of atypical inhalation noise400 recorded inside of a SCBA mask and its wide-band spectrogram500 are shown, respectively, inFIGS. 4 and 5.
As can be seen inFIG. 5, the noise spectrum is broadband with prominent spectral peaks occurring at approximately 500, 1700, 2700, and 6000 Hz. The peaks are due to resonances within the mask and comb filtering due to internal mask reflections, and may vary in frequency and magnitude with different mask models, sizes, and configurations. The coloration of the noise spectrum is typically stationary for a particular mask/wearer combination since the gross internal geometry is essentially constant once the mask is placed on the face. This is demonstrated inFIG. 6 where the spectra of three separate inhalation noises (610,620 and630) from a SCBA mask microphone, for the same speaker wearing a given SCBA mask, are shown superimposed. This consistency has also been observed for different speakers and for masks from different manufacturers. Moreover, high spectral similarity of the air regulator noise from different speakers wearing the same mask was also observed.
Finally,FIG. 7, illustrates an example ofspeech710 recorded from a SCBA system. AsFIG. 7 demonstrates, the effects of inhalation noise720 are not on the speech itself, since people do not normally try to speak while inhaling. However, the noise is of sufficient energy and spectrum to cause problems with speech detector and noise suppression circuitry in radios and to present a listening annoyance.
In a first aspect of the present invention is a method and apparatus for detecting and eliminating inhalation noise in a pressurized air delivery system coupled to a communication system, such as asystem100 illustrated inFIG. 1. The method in accordance with this embodiment of the present invention is also referred to herein as the ARINA (Air Regulator Inhalation Noise Attenuator) method. The basis of the ARINA method for identifying and eliminating air regulator inhalation noise is the relative stationarity of the noise as compared to speech and as compared to other types of noise such as, for instance, various environmental noises. A block diagram of theARINA method800 is shown inFIG. 8 and can be divided into four sections:Noise Model Matching810,Noise Detection830,Noise Attenuation850, andNoise Model Updating870.
The basic methodology of theARINA method800 can be summarized as follows.Method800 models the inhalation noise preferably using a digital filter (e.g. an all pole linear predictive coding (LPC) digital filter).Method800 then filters the audio input signal (i.e., speech and noise picked up by the mask microphone) using an inverse of the noise model filter and compares the energy of the output of the inverse noise model filter with that of the input signal or other energy reference. During the signal periods in which a close spectral match occurs between the input signal and the model, the regulator inhalation noise comprising the input signal may be attenuated to any desired level.
Turning to the specifics of theARINA method800 as illustrated inFIG. 8, the first step in the processing is to detect the occurrence of the inhalation noise by continuously comparing aninput signal802 against a reference noise model via the NoiseModel Matching section810 ofmethod800, which may in the preferred embodiment be implemented in accordance withFIG. 9 orFIG. 10 depending on the complexity of implementation that can be tolerated. However, those of ordinary skill in the art will realize that alternative spectral matching methods may be used. The two preferred matching methods indicated above as illustrated inFIG. 9 andFIG. 10 are referred to herein as the Normalized Model Error (or NME) method and the Itakura-Saito (or I-S) distortion method. In both methods, the reference noise model is represented by a digital filter (912,1012) that approximates the spectral characteristics of the inhalation noise. In the preferred embodiment, this model is represented as an all-pole (autoregressive) filter specified by a set of LPC coefficients. However, those of ordinary skill in the art will realize that alternate filter models may be used in place of the all-pole model such as, for instance, a known ARMA (autoregressive moving average) model.
The reference noise model filter coefficients are obtained from a set of autocorrelation coefficients derived from at least one digitized sample of the inhalation noise. An initial noise sample and corresponding initial autocorrelation coefficients (872) may be obtained off-line from any number of noise pre-recordings and is not critical to the implementation of the present invention. Moreover, experiments have shown that the initial noise sample from one SCBA mask, for example, also works well for other masks of the same design and in some cases for masks of different designs. The autocorrelation coefficients can be calculated directly from raw sampled noise data, or derived from other commonly used spectral parameter representations such as LPC or reflection coefficients, using common methods well known to those skilled in the art.
In the preferred embodiment, the noise model autocorrelation coefficients are calculated according to the following standard formula:
Ri=n=1N-ixnxn+ii=0,1,2,,p,pNEQ-1
where Riis the ith coefficient of a maximum of p autocorrelation coefficients, xnis the nth sample of a typical inhalation noise signal sample segment in which there are a maximum of N samples, and R0represents the energy of the entire segment. The order of the autocorrelation function, p, is typically between 10 and 20 with the value for the preferred embodiment being 14. Moreover, ideally the N signal samples are windowed using a Hamming window before the autocorrelation is performed to smooth the spectral estimate. The Hamming window is described by:
w(n)=0.54−0.46 cos(2πn/N),n=0, 1, 2, . . . , N−1.  EQ-2
Those of ordinary skill in the art will realize that other windowing methods may also be used.
The noise model autocorrelation coefficients are next used to determine a set of 10thorder noise model LPC coefficients, a1, a2. . . , ap, representing an all-pole linear predictive model filter with a z-domain representation transfer function of:
H(z)=11+a1z-1+a2z-2++apz-p,EQ-3
where z=e−jnωTis the z-transform variable. In this example 10thorder LPC coefficients were determined. However, a different order of LPC coefficients may be selected based on the particular implementation. The autocorrelation-to-LPC parameter transformation (step912,1012) may be done using any number of parameter transformation techniques known to those skilled in the art. In the preferred embodiment, the LPC parameters are derived from the autocorrelation parameters using the Durbin method well known to those skilled in the art.
Turning now to the specifics of the NME spectral matching method illustrated inFIG. 9, the derived all-pole LPC noise filter model is inverted to form an inverse LPC filter (step914):
Ĥ(z)=1+a1z−1+a2z−2+ . . . +apz−p.   EQ-4
Ideally a low-pass filtered and sampledaudio input signal802 obtained from the mask microphone and containing speech and inhalation noise, S(z), is passed through the inverse filter Ĥ(z) (step914) to obtain an output signal,
Y(z)=S(z)Ĥ(z).  EQ-5
The energies, Ein, Eout, of the inverse filter input and output signals are then calculated (respectively atsteps918 and916) and a distortion measure D is calculated atstep920 and functions as a similarity measure between the noise model and the input signal. The theoretical lower bound on D is zero for an infinite order, but in practice, the lower bound will be determined by the input signal and how well it matches the noise model of finite order. In this implementation, the distortion measure is defined by a ratio of Eoutto Ein, referred to as the normalized model error (NME), calculated atstep920 as:
D=NME=EoutEin=Y(z)2S(z)2.EQ-6
The energy of the input signal may then be removed in accordance to how well it matches the noise model. In the preferred embodiment, the above described signal filtering is done via convolution in the time domain although it could also be done in the frequency domain as indicated in the preceding equations.
The signal processing for theARINA method800 is generally done on a segmented frame basis. In the preferred embodiment, theinput signal802 is low-pass filtered, sampled at 8.0 KHz, buffered into blocks of 80 samples (10 msec), and passed through the inverse noise model filter (EQ-5). Thus, all filtering is ideally done on consecutive, 80 sample segments of theinput signal802. The normalized model error (NME) of the inverse noise model filter is then calculated by dividing the filter output frame energy by the input signal frame energy (EQ-6). This calculation, however, is ideally done on a sub-frame basis for better time resolution. Thus, each 80-point frame is divided into sub-frames, for example 4, 20-point sub-frames, although alternative sub-frame divisions may be used depending on the degree of accuracy required. The overall normalized model error signal (NME) may then be smoothed by averaging the output filter energy Eoutof the last 16 sub-frames and dividing that quantity by the average of the corresponding time-aligned 16 sub-frame input filter energies Ein. This does not add any delay to the analysis but helps remove transient dropouts and the effects of other loud background noises that may alter the regulator noise spectrum. The average NME value is thereby used, in this implementation of the present invention, as a measure of the noise model to input signal spectral similarity.
In the preferred embodiment, the second, more complex but more accurate noisemodel matching method810 as illustrated inFIG. 10 is a modification of the Itakura-Saito distortion method. The I-S method of determining the spectral similarity between two signals is well known by those skilled in the art. In this method the residual noise model inverse filter energy is compared with the residual energy of the “optimal” signal filter instead of with the input signal energy as in the previously described NME method. The filter is “optimal” in the sense that it best matches the spectrum of the current signal segment.
The residual energy corresponding to the optimally filtered signal is calculated using steps1018-1024. In the I-S method atstep1018, ideally two consecutive 80 sample buffers of theinput signal802 are combined into a single 160 sample segment. The 160 sample segment is windowed preferably using a 160 point Hamming window given by:
w(n)=0.54−0.46 cos(2πn/160),n=0, 1, 2, . . . , 159.  EQ-7
The windowed signal data is then autocorrelated using the method described in EQ-1. These autocorrelation coefficients generated instep1018 are designated as {circumflex over (R)}i, i=0, 1, 2, . . . , p. A corresponding set of LPC coefficients is derived from the autocorrelation coefficients preferably using the Durbin algorithm instep1020 in the same manner as used for generating the reference noise model parameters instep1012. The signal model LPC coefficients generated instep1020 are designated as âii=1, 2, . . . , p. Instep1022, these LPC coefficients (step1020) are autocorrelated according to EQ-9 below yielding {circumflex over (b)}i. Using these parameters, the residual energy of the signal, Es, passing through this filter is calculated atstep1024 as:
Es=b^0R^0+2i=1pb^iR^i,EQ-8b^i=j=0p-ia^ja^j+i,0ip,a^0=1.EQ-9
The energy of the input signal passing through the noise model is calculated using steps1012-1016. Atstep1012 the noise model LPC coefficients are calculated from the noise model autocorrelation coefficients (874) as described above. These LPC coefficients generated atstep1012 are designated as aii=1, 2, . . . , p. Atstep1014, the LPC coefficients (from step1012) are autocorrelated according to EQ-11 below yielding bi. Using these parameters and the autocorrelation sequence calculated atstep1018, {circumflex over (R)}i, the energy of the signal passing through the reference noise model is calculated atstep1016 as given by EQ-10:
Em=b0R^0+2i=1pbiR^i,EQ-10bi=j=0p-iajaj+i,0ip,a0=1.EQ-11
A measure of the spectral distortion, D, of the “optimal” signal model to the reference noise model is calculated atstep1028 as defined as:
D=EmEs.EQ-12
The more similar the signal model is to the reference noise model the closer the distortion measure is to 1.0 which is the lower bound. This distortion measure is used by theNoise Detection section830 of theARINA method800 to determine the presence of inhalation noise. The I-S distortion measure is calculated using 160 samples in the preferred embodiment. The inhalation noise classification as determined by the I-S distortion measure is associated with each 80 sample frame of the 160 sample segment. Moreover, steps1012 and1014 need only be performed to generate an initial noise model (e.g., based on initial autocorrelation coefficients872) or to update the noise model in accordance with the NoiseModel Updating section870 referred to above and described in detail below.
In theNoise Detection portion830 of theARINA method800, the value derived from the spectral match810 (i.e. the NME or the I-S distortion measure which represents the similarity measure between the input signal and the noise model) is then compared (step832) to an empirically derived threshold value (e.g., Dmin1). This detection threshold is selected to detect the presence of inhalation noise while not misclassifying speech or other types of noise as inhalation noise.
Moreover, depending on the specificity of the noise filter model, the spectral variations of the inhalation noise, and the similarity of some speech sounds to the noise model, for instance, false detections can occur. Therefore, since the duration of a true air regulator inhalation noise is fairly long compared to the speech artifacts, a noise duration threshold test is ideally also applied (step834). Thus, the detection threshold must be met for a predetermined number of consecutive frames “K1” (e.g. 4 frames) before detection is validated. Relative signal energy, waveform zero-crossings, and other feature parameter information may be included in the detection scheme to improve speech/inhalation noise discrimination. Thus if both threshold criteria are met (fromsteps832 and834), the spectral match is deemed acceptably close and an inhalation noise is assumed currently present.
In theNoise Attenuation portion850 of theARINA method800, the output of theNoise Detection portion830 is used to gate an output signal multiplier (852) through which theinput signal802 is passed. If the inhalation noise was detected, the multiplier gain G is set atstep854 to some desired attenuation value “Gmin”. This attenuation gain value may be 0.0 to completely eliminate the noise or may be set to a higher value to not completely eliminate the inhalation noise but to suppress it. Total suppression may not be desired to assure a listener that the air regulator is functioning. In the preferred embodiment Gminhas a value of 0.05. Otherwise if inhalation noise is not detected, the gain G is ideally set to 1.0 such as not to attenuate the speech signal. Variations of this gating/multiplying scheme can be employed. For example variations may be employed that would enable that the attack and decay of the gating to be less abrupt, reducing the possibility of attenuating speech that may occur directly before or after an inhalation noise, thereby improving the perceived quality of the speech. Moreover as can be readily seen frommethod800, an important benefit of this invention is that the original signal is not altered except when regulator noise is detected, unlike conventional, continuous noise filtering methods.
An important component of theARINA method800 is the ability to periodically update the noise model for detection purposes. For example, over time, movement of the air mask on the face may cause changes in its effects on the acoustic transfer function. Also, an air mask worn by different people or the use of different masks will mean that the spectrum of initial reference noise model may deviate from the actual inhalation noise spectrum. By periodically updating the original reference noise model, an accurate current reference noise model can be maintained. Accordingly, the NoiseModel Updating Section870 of theARINA method800 is used to update the noise model.
The NoiseModel Updating section870 uses the output of theNoise Detection section830 to determine when the reference LPC filter model of the regulator inhalation noise should be updated. For example, the output from theNoise Detection section830 may be compared to a second empirically determined threshold value (e.g., Dmin2) atstep876 to determine whether to update the noise model. When the threshold is met, a number of consecutive sub-frames detected as inhalation noise may be counted (step878), and the signal samples in each sub-frame stored in a buffer. When the number of consecutive noise sub-frames exceeds a threshold number “K2” (e.g., 8 sub-frames, 160 samples in the preferred embodiment) a decision is made to update the noise model atstep880. If a non-noise sub-frame is detected (e.g., at any ofsteps832,834 and876), the noise frame count is reset to zero atstep884, and the noise frame count is updated atstep878. The autocorrelation coefficients for the “K2” consecutive signal sub-frames representing the currently detected inhalation noise may then be calculated atstep882 using the previously stated formulas EQ-1 and EQ-2.
These new autocorrelation coefficients are used to update the noise model autocorrelation coefficients atstep874. Ideally the autocorrelation coefficients calculated atstep882 are averaged with the previous noise model autocorrelation coefficients atstep874 using a simple weighting formula such as, for instance:
RiREF=αRiREF+(1−α)RiNEW,  EQ-13
where RiREFare the autocorrelation coefficients of the current reference noise model, RiNEWare the autocorrelation coefficients of the currently detected inhalation noise sample, and α is a weighting factor between 1.0 and 0.0 that determines how fast the initial reference model is updated. This weighting factor can be adjusted depending on how fast the spectral characteristics of the inhalation noise change, which as noted previously, is usually slow. A new set of LPC coefficients for the noise model inverse filter is then recalculated from the updated model autocorrelations atsteps912 and1012. Constraints can be placed on the adjustment to the noise model so that large deviations from the noise model cannot occur due to false detections. In addition, the initial reference noise model coefficients (872) are stored so that the system can be reset to the initial model state if necessary. The adaptation capability ofmethod800 described above by reference to the NoiseModel Updating section870 enables the system to adapt to the characteristics of a particular mask and regulator and enables optimal detection performance.
Advantages of theARINA method800 include that the speech signal itself is not irreversibly affected by the processing algorithm, as is the case in algorithms employing conventional continuous filtering. An additional advantage is that the LPC modeling used here is simple, easily adaptable in real-time, is straightforward, and computationally efficient. Those of ordinary skill in the art will realize that the above advantages were not meant to encompass all of the advantages associated with the ARINA embodiment of the present invention but only meant to serve as being representative thereof.
In a second aspect of the present invention is a method and apparatus for equalizing a speech signal in a pressurized air delivery system coupled to a communication system, such as asystem100 illustrated inFIG. 1. The method in accordance with this embodiment of the present invention is also referred to herein as the AMSE (Air Mask Speech Equalizer) method. The basis of the AMSE method for equalization is the relative stationarity of the noise as compared to speech and as compared to other types of noise such as, for instance, various environmental noises. Since the same mask resonance conditions affect both the regulator noise and a speech signal, equalizing for the noise should also yield an equalizer appropriate for equalizing the speech signal, although peaks and nulls due to sound reflections will be slightly different between the noise and the speech due to source location differences between the speech and the noise.
The AMSE method uses the broadband air regulator inhalation noise, present in all mask-type pressurized air breathing systems (e.g. an SCBA), to estimate the acoustic resonance spectral peaks and nulls (i.e. spectral magnitude acoustic transfer function) produced by the mask cavity and structures. This spectral knowledge is then used to construct a compensating digital inverse filter in real time, which is applied to equalize the spectrally distorted speech signal and produce an output signal approximating the undistorted speech that would be produced without the mask. This action improves the quality of the audio obtained from the mask microphone and can result in improved communications intelligibility.
Turning to the specifics of the AMSE method, a block diagram of the method1100 is shown inFIG. 11 and can be divided into four sections:Noise Model Matching1110,Noise Detection1130,Mask Speech Equalization1150, andNoise Model Updating1170. The Noise Model Matching, Noise Detection and Noise Model Updating sections of the AMSE method are ideally identical to the corresponding sections of the ARINA method that were described above in detail. Therefore, for the sake of brevity, a detailed description of these three sections will not be repeated here. However, following is a detailed description of the Mask Speech Equalization section1150 (within the dashed area) of the AMSE method1100.
Using theSpeech Equalization Section1150 of the AMSE method1100, the inhalation noise reference autocorrelation coefficients are used to generate an nth order LPC model of the noise atstep1152 using EQ-3 above. The LPC model generated instep1152 characterizes the transfer function of the mask, e.g., MSK(f) inFIG. 2, and for the inhalation noise also includes the noise path transfer function NP(f). Preferably a 14thorder model is suitable but any order can be used. Those of ordinary skill in the art will realize that alternate filter models may be used in place of the all-pole model such as, for instance, a known ARMA (autoregressive moving average) model. Moreover, the filtering operations may be implemented in the frequency domain as opposed to the time domain filtering operations described above with respect to the preferred embodiment of the present invention.
The LPC model coefficients are then preferably used in an inverse filter (in accordance with EQ-4) through which the speech signal is passed atstep1156. Passing the speech signal through the inverse filter effectively equalizes the input signal, thereby removing the spectral distortions (peaks and notches) caused by the mask transfer function MSK(f) inFIG. 2. Post filtering atstep1158 using a suitable fixed post-filter is ideally performed on the equalized signal to correct for any non-whiteness of the inhalation noise, or to give the speech signal a specified tonal quality to optimally match the requirements of a following specific codec or radio. This post-filtering may also be used to compensate for the noise path transfer function NP(f) inFIG. 2.
The effect of the equalizer of theAMSE method800 on air regulator noise is shown inFIG. 12 for two different order equalization filters. Specifically,FIG. 12 illustrates aspectral representation1210 of an inhalation noise burst before equalization. Further illustrated are the spectra of the inhalation noise after equalization using a 14thorder equalization filter (1220) and a 20thorder equalization filter (1230). As can be seen, the spectral peaking is flattened extremely well by the 20thorder equalization filter and reasonable well using the 14thorder equalization filter. Moreover, listening tests on mask speech equalized by these filters showed that the quality of speech was significantly improved by use of the equalization filters as compared to the un-equalized speech. In addition, little difference in perceived quality of the speech was found between the two filter orders.
Advantages of the AMSE algorithm approach include: 1) it uses a regular, spectrally stable, broadband regulator noise inherent in an air-mask system as an excitation source for determining mask acoustic resonance properties; 2) system transfer function modeling is accomplished in real-time using simple, well established, efficient techniques; 3) equalization is accomplished in real-time using the same efficient techniques; and 4) the system transfer function model is continuously adaptable to changing conditions in real time. Those of ordinary skill in the art will realize that the above advantages were not meant to encompass all of the advantages associated with the AMSE embodiment of the present invention but only meant to serve as being representative thereof.
In a third aspect of the present invention is a method and apparatus for determining the duration and frequency of inhalation noise and determining respiration rate and air usage volume in a pressurized air delivery system coupled to a communication system, such as asystem100 illustrated inFIG. 1. The method in accordance with this embodiment of the present invention is also referred to herein as the INRRA (Inhalation Noise Respirator Rate Analyzer) method. The INRRA method is essentially an indirect way of measuring respiration by monitoring the sound produced by the air regulator instead of measuring breathing sounds from a person. The basis of the INRRA method is that a pressurized air breathing system such as an SCBA has one-way airflow. Air can enter the system only from the air source and regulator, and exit only through an exhaust valve. The intake and exhaust valves cannot be open at the same time. Thus, regulator intake valve action is directly related to the user's respiration cycle.
One indicator of the opening of the regulator intake valve is the regulator inhalation noise. Inhalation noise is a result of higher-pressure air entering an SCBA or other pressurized air delivery system mask. The mask is airtight so when a person inhales it produces a slight negative pressure within the mask that causes the regulator valve to open and pressurized tank air to enter. Air turbulence across the valve creates a loud, broadband hissing noise that is directly coupled into the SCBA mask, can be picked up by a microphone, and occurs for every inhalation. As explained previously, the noise is abrupt and has a very constant amplitude over the duration of the inhalation, providing very good start and end time resolution. For a given mask type and wearer, the spectral characteristics of the inhalation noise are very stable, as opposed to direct human breath sounds which vary considerably based on factors such as the size of the mouth opening, vocal tract condition, and lung airflow. INRRA capitalizes on the stability of the air regulator inhalation noise as a measure of respiratory rate.
INRRA uses a matched filtering scheme to identify the presence of an inhalation noise by its entire spectral characteristic. In addition, INRRA is capable of adapting to changes in the spectral characteristics of the noise should they occur, thus providing optimal differentiation between the inhalation noise and other sounds. By calculating the start of each inhalation, the instantaneous respiration rate and it's time average can be easily calculated from the inhalation noise occurances. In addition, by measuring the end and calculating the duration of each inhalation noise, and providing some information about the predictable mask regulator flow rate, the system can provide an estimate of the airflow volume. This may be accomplished using only the signal from the microphone recording the inhalation noise.
A block diagram of theINRRA method1300 is shown inFIG. 13 and can be divided into five sections:Noise Model Matching1310,Noise Detection1330,Inhalation Breath Definer1350,Parameter Estimator1370 and Noise Model Updating1390. The Noise Model Matching, Noise Detection and Noise Model Updating sections of the INRRA method are ideally identical to the corresponding sections of the ARINA method that were described above in detail. Therefore, for the sake of brevity, a detailed description of these three sections will not be repeated here. However, following is a detailed description of theInhalation Breath Definer1350 andParameter Estimator1370 sections of theINRRA method1300.
First, theInhalation Breath Definer1350 will be described. The purpose ofsection1350 of theINRRA method1300 is to characterize the inhalation noise based on at least one factor, for example, in this case based on a set of endpoints and a duration for one or more complete inhalation noise bursts which correspond with inhalation breaths. The decision from the InhalationNoise Detection section1330 is used to generate a preferably binary signal, INMm, m=0, 1, 2, . . . , M−1, instep1352 that represents the presence or absence of inhalation noise as a function of time index m using values of ones and zeros. This binary signal is stored in a rotating buffer of length M samples, M being large enough to store enough samples of the binary signal to encompass the time period of at least two inhalation noise bursts, or breaths at the slowest expected breathing rate. In the preferred embodiment, this amounts to about 15 seconds. The time resolution of this binary signal and the value of M will be determined by the smallest sub-frame time used in the InhalationNoise Detection section1330, described previously, which depends on the Inhalation Noise Model Matching section, and is either 20 samples (2.5 msec) or 80 samples (10 msec), depending on which spectral matching method is used instep1310.
Since the inhalation noise detector output from1330 will not always be perfect, detection mistakes may occur during the detection of an inhalation noise causing some ambiguity as to the true start, and duration times of the noise. Thus, the binary inhalation noise signal generated bystep1352 is integrated using a well known moving-average type or other suitable filter atstep1354. This filter smoothes out any short duration detection mistakes and produces a more accurate signal that defines complete inhalation noise bursts, which correspond with respiratory breaths. From this signal generated atstep1354, at least one factor including accurate start time, Si, end time, Ei, and breath duration time, Di, for each noise burst may be determined within processing frame duration accuracy atstep1356. The start and end times of the inhalation noise bursts as represented by the binary signal INMm, are obtained by noting their relative indices within the signal buffer. The duration Diis defined for a single inhalation noise burst as:
Di=Ei−Si, i=0, 1, 2, . . . , IT,  EQ-14
where i designates the ith of ITinhalation noise bursts present in the binary signal buffer of length M and time period T seconds. These inhalation noise burst factor values are ideally stored in a rotating, finite length buffer, one set of parameters per noise burst/breath. Some results of SCBA mask microphone speech processed by theINRRA algorithm sections1310,1330,1352, and1354 are shown inFIGS. 14-17, which are based on speech from a male speaker wearing an SCBA and recorded in a quiet room.FIG. 14 shows theinput speech1420 intermingled with noise bursts1410.FIG. 15 shows a time-amplitude representation1500 of the spectral distortion measure D output of Inhalation NoiseModel Matching section1310.FIG. 16 shows a time-amplitude representation1600 of the binary output of the inhalation noise detector,1330.FIG. 17 shows a time-amplitude representation1700 of the output of the moving average filter component,1354, of thebreath definer algorithm1350 that integrates the raw detector output and accurately defines the duration of each inhalation.
TheParameter Estimator1370 section describes examples of parameters that may be estimated based on the characterization factors of the inhalation noise by the InhalationBreath Definer section1350. Two such examples of parameters that may be determined are the respiration rate of the user and the approximate inhalation air flow volume. Respiration rate may be easily determined using the sequential start time information, Si, of successive inhalation noise bursts that may be determined in the Inhalation Breath Definer Section. For example, the “instantaneous” respiration rate per minute may be calculated as:
IRR=60(Si-Si-1),EQ-15
where the Siare two successive noise bursts (inhalation breaths) start times in seconds. An average respiration rate may accordingly be calculated as:
RR=60ITi=1IT(Si-Si-1),EQ-16
where ITis the number of detected consecutive breaths (inhalation noise bursts) in a specified time period T.
The approximate airflow volume during an inhalation breath may be estimated from the duration of the breath that may be determined by the Inhalation Breath Definer section, and from some additional information concerning the initial air tank fill pressure and the regulator average flow rate that may be determined off-line, for instance. When the intake valve is open, the air regulator admits a volume of air at nearly constant pressure to the facemask (a function of the ambient air/water pressure) as long as the air supply tank pressure remains above the minimal input pressure level for the air regulator. Moreover, the airflow rate into the mask is approximately constant while the mask regulator intake valve is open. The amount of air removed from the tank supply and delivered to the breather is thus proportional to the time that the intake valve is open. The time that the valve is open can be measured by the duration of each inhalation noise.
The initial quantity of air in the supply tank when filled is a function of the tank volume V0, the fill pressure P0, the gas temperature T0, and the universal gas constant R, the mass of the gas in moles Nm, and can be calculated from the well-known ideal gas equation, PV=NmRT. Since the initial fill pressure and tank cylinder volume may be known, and assuming the temperature of the tank gas and mask gas are the same, the volume of air available for breathing at the mask pressure may be given as:
VM=P0V0PM.EQ-17
The approximate volume of air delivered to the user during inhalation event i is then:
IVi≈KRDi,  EQ-18
where IViis the air volume, Diis the duration of the inhalation event as determined from the inhalation noise, and KRis a calibration factor related to the airflow rate for a particular air regulator. KRcould be derived empirically for an individual system or perhaps determined from manufacturer's data. From the individual inhalation volumes, IVi, the approximate total amount of air used up to a time T, VT, may be defined as:
VTi=1ITIVi,EQ-19
where ITis the total number of inhalations up to a time T. The remaining tank supply air is accordingly:
VR≈VM−VT.  EQ-20
Some advantages of the INRRA method include that any microphone signal that picks up the breath noise over a minimal speech bandwidth can be used, and no special sensors are needed. Another advantage is that the respiration detector is based on detecting the noise produced by the air regulator which has stable spectral characteristics, and not human breath noises which are variable in character. Yet another advantage is that the respiration detector is not locked to examining specific frequencies as are other types of acoustic breath analyzers. Moreover, the system adapts automatically to changes in environment and to different users and pressurized air respirator mask systems. Thus, the INRRA method can provide continuously, instantaneous or average respiration rate and approximate air use volume data, which is valuable information that can be automatically sent outside ofsystem100, for example, via a radio data channel to a monitor. Those of ordinary skill in the art will realize that the above advantages were not meant to encompass all of the advantages associated with the INRRA embodiment of the present invention but only meant to serve as being representative thereof.
All three methods in accordance with the present invention (ARINA, AMSE and INRRA) are preferably implemented as software algorithms stored on a memory device (that would be included in a system in accordance withsystem100 described above) and the steps of which implemented in a suitable processing device such as, forinstance DSP138 ofsystem100. The algorithms corresponding to the autocorrelation and LPC filtering methods of the present invention would likely take up the majority of the processor time. However, these algorithms or the entirety of the algorithms corresponding to the ARINA, AMSE and INRRA methods may, alternatively, be efficiently implemented in a small hardware footprint. Moreover, since the AMSE method uses many of the methodologies as the ARINA method, in another embodiment of the present invention, they may be efficiently combined.
While the invention has been described in conjunction with specific embodiments thereof, additional advantages and modifications will readily occur to those skilled in the art. The invention, in its broader aspects, is therefore not limited to the specific details, representative apparatus, and illustrative examples shown and described. Various alterations, modifications and variations will be apparent to those skilled in the art in light of the foregoing description. For example, although a method for identifying and attenuating inhalation noise was described above, the methodologies presented with respect to the present invention may be applied to other types of noise, such as exhalation noise or other types of noises with pseudo-stationary spectral characteristics lending themselves to efficient detection using the above methods. Thus, it should be understood that the invention is not limited by the foregoing description, but embraces all such alterations, modifications and variations in accordance with the spirit and scope of the appended claims.

Claims (15)

US10/882,4502004-06-302004-06-30Method and apparatus for equalizing a speech signal generated within a pressurized air delivery systemExpired - LifetimeUS7254535B2 (en)

Priority Applications (9)

Application NumberPriority DateFiling DateTitle
US10/882,450US7254535B2 (en)2004-06-302004-06-30Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system
CNA2005800219855ACN101010731A (en)2004-06-302005-06-06Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system
EP05756478AEP1769493B1 (en)2004-06-302005-06-06Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system
MXPA06015235AMXPA06015235A (en)2004-06-302005-06-06Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system.
CA2572715ACA2572715C (en)2004-06-302005-06-06Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system
PCT/US2005/019827WO2006007290A2 (en)2004-06-302005-06-06Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system
JP2007519235AJP4468447B2 (en)2004-06-302005-06-06 Method and apparatus for equalizing audio signals generated within a self-contained breathing apparatus system
AU2005262623AAU2005262623B2 (en)2004-06-302005-06-06Method and apparatus for equalizing a speech signal generated within a self-contained breathing apparatus system
DE602005008974TDE602005008974D1 (en)2004-06-302005-06-06 METHOD AND ARRANGEMENT FOR DECORATING A LANGUAGE SIGNAL GENERATED WITHIN A RESIN-INDEPENDENT RESPIRATORY EQUIPMENT

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
US10/882,450US7254535B2 (en)2004-06-302004-06-30Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system

Publications (2)

Publication NumberPublication Date
US20060020451A1 US20060020451A1 (en)2006-01-26
US7254535B2true US7254535B2 (en)2007-08-07

Family

ID=35658374

Family Applications (1)

Application NumberTitlePriority DateFiling Date
US10/882,450Expired - LifetimeUS7254535B2 (en)2004-06-302004-06-30Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system

Country Status (9)

CountryLink
US (1)US7254535B2 (en)
EP (1)EP1769493B1 (en)
JP (1)JP4468447B2 (en)
CN (1)CN101010731A (en)
AU (1)AU2005262623B2 (en)
CA (1)CA2572715C (en)
DE (1)DE602005008974D1 (en)
MX (1)MXPA06015235A (en)
WO (1)WO2006007290A2 (en)

Cited By (17)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20060025989A1 (en)*2004-07-282006-02-02Nima MesgaraniDiscrimination of components of audio signals based on multiscale spectro-temporal modulations
US20090192799A1 (en)*2008-01-292009-07-30Digital Voice Systems, Inc.Breathing Apparatus Speech Enhancement
US20090306972A1 (en)*2006-12-072009-12-10Martin OpitzDropout Concealment for a Multi-Channel Arrangement
US20100322442A1 (en)*2009-06-222010-12-23Motorola, Inc.Pressure activated remote microphone
US20120010881A1 (en)*2010-07-122012-01-12Carlos AvendanoMonaural Noise Suppression Based on Computational Auditory Scene Analysis
US8965756B2 (en)2011-03-142015-02-24Adobe Systems IncorporatedAutomatic equalization of coloration in speech recordings
US9185487B2 (en)2006-01-302015-11-10Audience, Inc.System and method for providing noise suppression utilizing null processing noise subtraction
US9343056B1 (en)2010-04-272016-05-17Knowles Electronics, LlcWind noise detection and suppression
US9438992B2 (en)2010-04-292016-09-06Knowles Electronics, LlcMulti-microphone robust noise suppression
US9502048B2 (en)2010-04-192016-11-22Knowles Electronics, LlcAdaptively reducing noise to limit speech distortion
US9558755B1 (en)2010-05-202017-01-31Knowles Electronics, LlcNoise suppression assisted automatic speech recognition
US9640194B1 (en)2012-10-042017-05-02Knowles Electronics, LlcNoise suppression for speech processing based on machine-learning mask estimation
US9799330B2 (en)2014-08-282017-10-24Knowles Electronics, LlcMulti-sourced noise suppression
US10166416B2 (en)2013-02-012019-01-013M Innovative Properties CompanyRespirator mask speech enhancement apparatus and method
US20220223145A1 (en)*2021-01-112022-07-14Ford Global Technologies, LlcSpeech filtering for masks
US12254895B2 (en)2021-07-022025-03-18Digital Voice Systems, Inc.Detecting and compensating for the presence of a speaker mask in a speech signal
US12415099B2 (en)2020-07-102025-09-163M Innovative Properties CompanyBreathing apparatus and method of communicating using breathing apparatus

Families Citing this family (18)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7748380B1 (en)2004-04-062010-07-06Sti Licensing CorporationCombined air-supplying/air-purifying system
US7658190B1 (en)2004-04-062010-02-09Sti Licensing Corp.Portable air-purifying system utilizing enclosed filters
JP4866958B2 (en)*2006-05-042012-02-01株式会社ソニー・コンピュータエンタテインメント Noise reduction in electronic devices with farfield microphones on the console
EP4272809A3 (en)2009-07-172024-01-17Implantica Patent Ltd.Voice control of a medical implant
JP5294085B2 (en)*2009-11-062013-09-18日本電気株式会社 Information processing apparatus, accessory apparatus thereof, information processing system, control method thereof, and control program
US9183844B2 (en)*2012-05-222015-11-10Harris CorporationNear-field noise cancellation
JP6207615B2 (en)*2012-09-242017-10-11ジョン ハミルトン Communication and speech improvement system
US9943712B2 (en)2012-09-242018-04-17Dolores Speech Products LlcCommunication and speech enhancement system
US9498658B2 (en)*2013-02-012016-11-223M Innovative Properties CompanyRespirator mask speech enhancement apparatus and method
FR3005823B1 (en)*2013-05-142016-10-14Elno MICROPHONE COMPRISING A MUTE SWITCH, AND BREATHING MASK COMPRISING SUCH A MICROPHONE
GB2519117A (en)*2013-10-102015-04-15Nokia CorpSpeech processing
US10163453B2 (en)*2014-10-242018-12-25Staton Techiya, LlcRobust voice activity detector system for use with an earphone
US9843859B2 (en)2015-05-282017-12-12Motorola Solutions, Inc.Method for preprocessing speech for digital audio quality improvement
US11631421B2 (en)*2015-10-182023-04-18Solos Technology LimitedApparatuses and methods for enhanced speech recognition in variable environments
CN105405447B (en)*2015-10-272019-05-24航宇救生装备有限公司One kind sending words respiratory noise screen method
CN106935247A (en)*2017-03-082017-07-07珠海中安科技有限公司It is a kind of for positive-pressure air respirator and the speech recognition controlled device and method of narrow and small confined space
IT201700090078A1 (en)*2017-08-032019-02-03Mestel Safety S R L MASK FOR UNDERWATER USE, IN PARTICULAR OF GRANFACIAL TYPE EQUIPPED WITH COMMUNICATION DEVICE.
FR3142639B1 (en)*2022-11-282024-11-2952 Hertz METHOD FOR CHARACTERIZING A FILTER FOR PROCESSING AN INDIVIDUAL'S VOICE, COMMUNICATION DEVICE

Citations (38)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US3347229A (en)1965-02-231967-10-17Sierra Eng CoLatch operated microphone switch for breathing mask
US3415245A (en)1965-03-081968-12-10Y2 AssociatesNoise-suppression diving apparatus
US3850168A (en)1971-09-211974-11-26Puritan Bennett CorpOxygen mask apparatus
US4154981A (en)1977-12-161979-05-15The United States Of America As Represented By The Secretary Of The NavyTelephone system for diver communication
US4376916A (en)*1980-05-291983-03-15Cbs Inc.Signal compression and expansion system
US4720802A (en)1983-07-261988-01-19Lear SieglerNoise compensation arrangement
US4799263A (en)1986-03-191989-01-17Dragerwerk AgSpeaking and hearing system for breathing apparatus
US4862503A (en)1988-01-191989-08-29Syracuse UniversityVoice parameter extractor using oral airflow
US4958638A (en)1988-06-301990-09-25Georgia Tech Research CorporationNon-contact vital signs monitor
US5143078A (en)1987-08-041992-09-01Colin Electronics Co., Ltd.Respiration rate monitor
US5222190A (en)1991-06-111993-06-22Texas Instruments IncorporatedApparatus and method for identifying a speech pattern
US5307793A (en)1992-06-291994-05-03Puritan-Bennett CorporationMicrophone signal attenuating apparatus for oxygen masks
US5444786A (en)1993-02-091995-08-22Snap Laboratories L.L.C.Snoring suppression system
US5579284A (en)1995-07-211996-11-26May; David F.Scuba diving voice and communication system using bone conducted sound
WO1997003723A1 (en)1995-07-181997-02-06Nellcor Puritan Bennett IncorporatedMicrophone attenuation device for use in oxygen breathing masks
US5606145A (en)1994-06-091997-02-25Lg Electronics Inc.Code changing method for electronic music instrument with automatic accompaniment function and slur processing
US5727074A (en)1996-03-251998-03-10Harold A. HildebrandMethod and apparatus for digital filtering of audio signals
US5730140A (en)1995-04-281998-03-24Fitch; William Tecumseh S.Sonification system using synthesized realistic body sounds modified by other medically-important variables for physiological monitoring
US5734090A (en)1996-03-271998-03-31Alcohol Sensors International, Ltd.Method and apparatus for sonic breath determination
US5768398A (en)1995-04-031998-06-16U.S. Philips CorporationSignal amplification system with automatic equalizer
US5822366A (en)1995-04-211998-10-13Nokia Mobile Phones Ltd.Transceiver and method for generating and processing complex I/Q-signals
US5890111A (en)1996-12-241999-03-30Technology Research Association Of Medical Welfare ApparatusEnhancement of esophageal speech by injection noise rejection
US5905971A (en)1996-05-031999-05-18British Telecommunications Public Limited CompanyAutomatic speech recognition
US5912965A (en)1995-11-221999-06-15U.S. Philips CorporationTelephone set which can be adjusted in response to ambient noise
US6295364B1 (en)*1998-03-302001-09-25Digisonix, LlcSimplified communication system
US6304844B1 (en)2000-03-302001-10-16Verbaltek, Inc.Spelling speech recognition apparatus and method for communications
US6324499B1 (en)1999-03-082001-11-27International Business Machines Corp.Noise recognizer for speech recognition systems
US20020018573A1 (en)1998-05-042002-02-14Schwartz Stephen R.Microphone-tailored equalizing system
US20020086653A1 (en)2000-12-282002-07-04Lg Electronics Inc.Mobile communication terminal with equalizer function
US20020116186A1 (en)2000-09-092002-08-22Adam StraussVoice activity detector for integrated telecommunications processing
US6470315B1 (en)1996-09-112002-10-22Texas Instruments IncorporatedEnrollment and modeling method and apparatus for robust speaker dependent speech models
US20020168077A1 (en)2001-05-142002-11-14Vertex Standard Co., Ltd.Microphone characteristic adjustment device
US20020198704A1 (en)2001-06-072002-12-26Canon Kabushiki KaishaSpeech processing system
US20030062046A1 (en)1998-08-142003-04-03Wiesmann William PaulIntegrated physiologic sensor system
US6575163B1 (en)1996-09-232003-06-10Resmed Ltd.Method for calculating the instantaneous inspired volume of a subject during ventilatory assistance
US20030163054A1 (en)2002-02-222003-08-28Dekker Andreas Lubbertus Aloysius JohannesMonitoring respiration based on plethysmographic heart rate signal
US6868378B1 (en)1998-11-202005-03-15Thomson-Csf SextantProcess for voice recognition in a noisy acoustic signal and system implementing this process
US6892175B1 (en)*2000-11-022005-05-10International Business Machines CorporationSpread spectrum signaling for speech watermarking

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5822360A (en)*1995-09-061998-10-13Solana Technology Development CorporationMethod and apparatus for transporting auxiliary data in audio signals

Patent Citations (39)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US3347229A (en)1965-02-231967-10-17Sierra Eng CoLatch operated microphone switch for breathing mask
US3415245A (en)1965-03-081968-12-10Y2 AssociatesNoise-suppression diving apparatus
US3850168A (en)1971-09-211974-11-26Puritan Bennett CorpOxygen mask apparatus
US4154981A (en)1977-12-161979-05-15The United States Of America As Represented By The Secretary Of The NavyTelephone system for diver communication
US4376916A (en)*1980-05-291983-03-15Cbs Inc.Signal compression and expansion system
US4720802A (en)1983-07-261988-01-19Lear SieglerNoise compensation arrangement
US4799263A (en)1986-03-191989-01-17Dragerwerk AgSpeaking and hearing system for breathing apparatus
US5143078A (en)1987-08-041992-09-01Colin Electronics Co., Ltd.Respiration rate monitor
US4862503A (en)1988-01-191989-08-29Syracuse UniversityVoice parameter extractor using oral airflow
US4958638A (en)1988-06-301990-09-25Georgia Tech Research CorporationNon-contact vital signs monitor
US5222190A (en)1991-06-111993-06-22Texas Instruments IncorporatedApparatus and method for identifying a speech pattern
US5307793A (en)1992-06-291994-05-03Puritan-Bennett CorporationMicrophone signal attenuating apparatus for oxygen masks
US5444786A (en)1993-02-091995-08-22Snap Laboratories L.L.C.Snoring suppression system
US5606145A (en)1994-06-091997-02-25Lg Electronics Inc.Code changing method for electronic music instrument with automatic accompaniment function and slur processing
US5768398A (en)1995-04-031998-06-16U.S. Philips CorporationSignal amplification system with automatic equalizer
US5822366A (en)1995-04-211998-10-13Nokia Mobile Phones Ltd.Transceiver and method for generating and processing complex I/Q-signals
US5730140A (en)1995-04-281998-03-24Fitch; William Tecumseh S.Sonification system using synthesized realistic body sounds modified by other medically-important variables for physiological monitoring
WO1997003723A1 (en)1995-07-181997-02-06Nellcor Puritan Bennett IncorporatedMicrophone attenuation device for use in oxygen breathing masks
US5829431A (en)1995-07-181998-11-03Puritan-Bennett CorporationMicrophone attenuation device for use in oxygen breathing masks
US5579284A (en)1995-07-211996-11-26May; David F.Scuba diving voice and communication system using bone conducted sound
US5912965A (en)1995-11-221999-06-15U.S. Philips CorporationTelephone set which can be adjusted in response to ambient noise
US5727074A (en)1996-03-251998-03-10Harold A. HildebrandMethod and apparatus for digital filtering of audio signals
US5734090A (en)1996-03-271998-03-31Alcohol Sensors International, Ltd.Method and apparatus for sonic breath determination
US5905971A (en)1996-05-031999-05-18British Telecommunications Public Limited CompanyAutomatic speech recognition
US6470315B1 (en)1996-09-112002-10-22Texas Instruments IncorporatedEnrollment and modeling method and apparatus for robust speaker dependent speech models
US6575163B1 (en)1996-09-232003-06-10Resmed Ltd.Method for calculating the instantaneous inspired volume of a subject during ventilatory assistance
US5890111A (en)1996-12-241999-03-30Technology Research Association Of Medical Welfare ApparatusEnhancement of esophageal speech by injection noise rejection
US6295364B1 (en)*1998-03-302001-09-25Digisonix, LlcSimplified communication system
US20020018573A1 (en)1998-05-042002-02-14Schwartz Stephen R.Microphone-tailored equalizing system
US20030062046A1 (en)1998-08-142003-04-03Wiesmann William PaulIntegrated physiologic sensor system
US6868378B1 (en)1998-11-202005-03-15Thomson-Csf SextantProcess for voice recognition in a noisy acoustic signal and system implementing this process
US6324499B1 (en)1999-03-082001-11-27International Business Machines Corp.Noise recognizer for speech recognition systems
US6304844B1 (en)2000-03-302001-10-16Verbaltek, Inc.Spelling speech recognition apparatus and method for communications
US20020116186A1 (en)2000-09-092002-08-22Adam StraussVoice activity detector for integrated telecommunications processing
US6892175B1 (en)*2000-11-022005-05-10International Business Machines CorporationSpread spectrum signaling for speech watermarking
US20020086653A1 (en)2000-12-282002-07-04Lg Electronics Inc.Mobile communication terminal with equalizer function
US20020168077A1 (en)2001-05-142002-11-14Vertex Standard Co., Ltd.Microphone characteristic adjustment device
US20020198704A1 (en)2001-06-072002-12-26Canon Kabushiki KaishaSpeech processing system
US20030163054A1 (en)2002-02-222003-08-28Dekker Andreas Lubbertus Aloysius JohannesMonitoring respiration based on plethysmographic heart rate signal

Non-Patent Citations (7)

* Cited by examiner, † Cited by third party
Title
A. Bruce Carlson, "Communications Systems: An Introduction to Signals and Noise in Electrical Communication", McGraw Hill, NY 1968, pp. 60-61.
Lee, K. Reddy, R., "Automatic Speech Recognition", Norwell, MA, Kluwer Academic Publishers, pp. 17-26, Copyright 1989.
Oppenheim, Alan V., Schafer, Ronald W., "Digital Signal Processing", Prentice Hall, Inc., Englewood Cliffs, New Jersey, pp. 345-353.
Quatieri, T., "Discrete-Time Speech Signal Processing", Upper Saddle River, NJ: Prentice Hall PTR, 2002.
Rabiner, Lawrence R., Schafer, Ronald W., "Digital Processing of Speech Signals", Prentice Hall, Inc., Englewood Cliffs, New Jersey, pp. 431-436.
Roe, D., Wilpon, J., "Voice Communication Between Humans and Machines", Washington, DC, National Academy Press, pp. 165-175, 1994.
Steams, S., "Digital Signal Analysis", Rochelle Park, New Jersey, Hayden Book Company, Inc. pp. 233-237, Copyright 1975.

Cited By (25)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7505902B2 (en)*2004-07-282009-03-17University Of MarylandDiscrimination of components of audio signals based on multiscale spectro-temporal modulations
US20060025989A1 (en)*2004-07-282006-02-02Nima MesgaraniDiscrimination of components of audio signals based on multiscale spectro-temporal modulations
US9185487B2 (en)2006-01-302015-11-10Audience, Inc.System and method for providing noise suppression utilizing null processing noise subtraction
US8260608B2 (en)*2006-12-072012-09-04Akg Acoustics GmbhDropout concealment for a multi-channel arrangement
US20090306972A1 (en)*2006-12-072009-12-10Martin OpitzDropout Concealment for a Multi-Channel Arrangement
US20090192799A1 (en)*2008-01-292009-07-30Digital Voice Systems, Inc.Breathing Apparatus Speech Enhancement
US8265937B2 (en)*2008-01-292012-09-11Digital Voice Systems, Inc.Breathing apparatus speech enhancement using reference sensor
US8245706B2 (en)2009-06-222012-08-21Motorola Solutions, Inc.Pressure activated remote microphone
US20100322442A1 (en)*2009-06-222010-12-23Motorola, Inc.Pressure activated remote microphone
US9502048B2 (en)2010-04-192016-11-22Knowles Electronics, LlcAdaptively reducing noise to limit speech distortion
US9343056B1 (en)2010-04-272016-05-17Knowles Electronics, LlcWind noise detection and suppression
US9438992B2 (en)2010-04-292016-09-06Knowles Electronics, LlcMulti-microphone robust noise suppression
US9558755B1 (en)2010-05-202017-01-31Knowles Electronics, LlcNoise suppression assisted automatic speech recognition
US20120010881A1 (en)*2010-07-122012-01-12Carlos AvendanoMonaural Noise Suppression Based on Computational Auditory Scene Analysis
US8447596B2 (en)*2010-07-122013-05-21Audience, Inc.Monaural noise suppression based on computational auditory scene analysis
US9431023B2 (en)*2010-07-122016-08-30Knowles Electronics, LlcMonaural noise suppression based on computational auditory scene analysis
US20130231925A1 (en)*2010-07-122013-09-05Carlos AvendanoMonaural Noise Suppression Based on Computational Auditory Scene Analysis
US8965756B2 (en)2011-03-142015-02-24Adobe Systems IncorporatedAutomatic equalization of coloration in speech recordings
US9640194B1 (en)2012-10-042017-05-02Knowles Electronics, LlcNoise suppression for speech processing based on machine-learning mask estimation
US10166416B2 (en)2013-02-012019-01-013M Innovative Properties CompanyRespirator mask speech enhancement apparatus and method
US9799330B2 (en)2014-08-282017-10-24Knowles Electronics, LlcMulti-sourced noise suppression
US12415099B2 (en)2020-07-102025-09-163M Innovative Properties CompanyBreathing apparatus and method of communicating using breathing apparatus
US20220223145A1 (en)*2021-01-112022-07-14Ford Global Technologies, LlcSpeech filtering for masks
US11404061B1 (en)*2021-01-112022-08-02Ford Global Technologies, LlcSpeech filtering for masks
US12254895B2 (en)2021-07-022025-03-18Digital Voice Systems, Inc.Detecting and compensating for the presence of a speaker mask in a speech signal

Also Published As

Publication numberPublication date
CA2572715C (en)2012-04-03
CN101010731A (en)2007-08-01
DE602005008974D1 (en)2008-09-25
WO2006007290B1 (en)2006-08-24
US20060020451A1 (en)2006-01-26
EP1769493A2 (en)2007-04-04
AU2005262623A1 (en)2006-01-19
AU2005262623B2 (en)2008-07-03
EP1769493A4 (en)2007-08-22
WO2006007290A2 (en)2006-01-19
JP4468447B2 (en)2010-05-26
CA2572715A1 (en)2006-01-19
MXPA06015235A (en)2007-03-26
EP1769493B1 (en)2008-08-13
WO2006007290A3 (en)2006-06-01
JP2008505538A (en)2008-02-21

Similar Documents

PublicationPublication DateTitle
US7254535B2 (en)Method and apparatus for equalizing a speech signal generated within a pressurized air delivery system
US7139701B2 (en)Method for detecting and attenuating inhalation noise in a communication system
US7155388B2 (en)Method and apparatus for characterizing inhalation noise and calculating parameters based on the characterization
AU2018266253B2 (en)System and method for determining cardiac rhythm and/or respiratory rate
US20120084084A1 (en)Noise cancellation device for communications in high noise environments
US11295759B1 (en)Method and apparatus for measuring distortion and muffling of speech by a face mask
Iyer et al.Autoregressive modeling of lung sounds: characterization of source and transmission
Kushner et al.The distorting effects of SCBA equipment on speech and algorithms for mitigation
CN1263423C (en)Method and device for determining respiratory system condition by using respiratory system produced sound
Kushner et al.The acoustic properties of SCBA equipment and its effects on speech communication
Shilling et al.Underwater communications
JPS647784B2 (en)
SingerThe effects of microphones and facemasks on LPC vocoder performance

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:MOTOROLA, INC., ILLINOIS

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:KUSHNER, WILLIAM M.;HARTON, SARA M.;JASIUK, MARK A.;REEL/FRAME:016232/0151

Effective date:20040630

STCFInformation on status: patent grant

Free format text:PATENTED CASE

FPAYFee payment

Year of fee payment:4

ASAssignment

Owner name:MOTOROLA SOLUTIONS, INC., ILLINOIS

Free format text:CHANGE OF NAME;ASSIGNOR:MOTOROLA, INC;REEL/FRAME:026081/0001

Effective date:20110104

FPAYFee payment

Year of fee payment:8

MAFPMaintenance fee payment

Free format text:PAYMENT OF MAINTENANCE FEE, 12TH YEAR, LARGE ENTITY (ORIGINAL EVENT CODE: M1553); ENTITY STATUS OF PATENT OWNER: LARGE ENTITY

Year of fee payment:12


[8]ページ先頭

©2009-2025 Movatter.jp