Background
In the process of simultaneously carrying out local sound amplification and remote interaction by adopting a hoisting microphone, the local sound amplification can generate howling, the remote interaction can generate Echo, and the howling and the Echo are the two most difficult problems in audio processing, so that the Feedback inhibition (AFC) and the Echo Cancellation (AEC) are hot points of audio research, the hoisting microphone belongs to remote sound pickup, the sensitivity of the microphone is very high, the Echo generated by the local sound amplification and the remote interaction is more serious, and no audio system equipment can really realize the local sound amplification and the remote interaction at the same time.
The source of the generation of howling and echo is that sound emitted by a sound box is picked up by a microphone, a hoisting microphone is adopted to pick up sound in a long distance, the microphone has high sensitivity and more serious howling and echo, and to realize the simultaneous local sound amplification and remote interaction of the hoisting microphone, the most effective method is to remove the sound fed back to the microphone by the sound box through an algorithm, only near-end audio is reserved, the technology of removing the sound fed back to the microphone by the sound box is changed into an acoustic feedback technology, but the removal of the sound fed back to the microphone by the sound box faces the following technical problems:
(1) Correlation in feedback suppression algorithms
In the local amplification process, because the voice coming out of the loudspeaker box and the local speaker speaking are the voice of the same person, the two voices have strong correlation, and the correlation can cause the convergence mismatch of the adaptive filter.
(2) Nonlinear effects of microphones, sound boxes and sound transmission channels in practical applications
In the process of local sound amplification and remote interaction, due to the fact that nonlinearity exists in a sound field, the self-adaptive filter is difficult to really match with an actual sound field environment.
Disclosure of Invention
The invention aims to overcome the defects of the prior art, provides a system and a method for carrying out local sound amplification and remote interaction by adopting a hoisting microphone, and effectively realizes the acoustic feedback elimination based on a pre-filtering method, variable step length self-adaptive signal processing and nonlinear processing in real time.
The method is realized by the following technical scheme: on the one hand, a hoisting microphone is adoptedMethod for local sound amplification and remote interaction, sound signal collected by microphone
Mix the near-end audio>
And the audio frequency which is fed back to the microphone by the sound box>
The method is characterized by comprising the following steps: for the sound signal>
Performing analysis filtering processing to divide the signal into several sub-band signals->
;
Pre-filtering: first pass whitening filter
Carrying out smooth half-wave rectification treatment;
step-length-variable adaptive signal processing: for each sub-band to whiten the filter
The processed signals are respectively processed by variable step length adaptive filtering to obtain->
;
And (3) nonlinear processing: by non-linear processing filters
For the signal after the adaptive signal processing with variable step length
Performing nonlinear filtering to obtain each sub-band signal->
;/>
For each sub-band signal to be actually output
Comprehensively filtering to obtain the near-end audio frequency after feedback elimination>
。
Further, the whitening filter is
In the formula (I), the compound is shown in the specification,
in order to whiten the filtered sub-band signal,
smooth() Is a smooth rectification function.
Further, the sub-band signal processing specifically includes the following steps,
s31: sound signal collected by microphone
Performing processing sub-band analysis filtering processing to obtain sub-band input of microphone as ^ or ^>
The sound box with sub-band feeds back sound signal of->
And the near end audio of a sub-band +>
,
In, is greater than or equal to>
Is a subband signal;
S32:
and &>
Passes through a whitening filter->
After processing, get>
,
;
S33: to pair
And &>
Send as input to adaptive filter to background filter +>
;
S34: background filter
Assign a value to the foreground filter>
Get->
,
Is sent to a non-linear filter->
Get->
;
S35: by passing
Judging whether the near-end audio is mute, if so, executing S36, and if not, executing S31;
s36: the sub-bands are subjected to comprehensive filtering to obtain the near-end audio frequency after feedback elimination
。
Further, in said S31, for
Pre-emphasis processing, framing processing, low-pass filtering processing and analysis filtering processing are sequentially carried out.
Further, the S31 includes 256 subbands.
On the other hand, in some embodiments, the system further comprises a system for local sound amplification and remote interaction by using a ceiling microphone, wherein the sound signal collected by the microphone comprises a plurality of sub-band signals
Wherein each sub-band signal->
In which a near-end audio is mixed>
And the signal fed back to the microphone by the sound box>
Sub-band signal->
Has an actual output of->
The system comprises a first whitening filter->
The second whitening filter->
The foreground filter->
Background filter
Non-linear processing filter->
And a feedback acoustic analog filter->
;
The first whitening filter
For each sub-band signal->
The output is->
;
The second whitening filter
Is the actual output->
And the output is->
;
The background filter
Is the output of the second whitening filter;
the foreground filter
Is inputted as->
;
The foreground filter
In or on the output>
And sub-band signal>
Combining as the non-linear processing filter>
The input of (a) is performed,
the nonlinear processing filter
The output being the actual output of each subband>
,/>
By means of said feedback acoustic analogue filter
Signal fed back to microphone by analog loudspeaker box>
,
Computing actual outputs of subbands
Comprehensively filtering to obtain a near-end time domain signal after feedback elimination>
。
Further, the foreground filter
Coefficient passing background filter>
Is obtained by the update of (c).
Further, the second whitening filter
For attenuating the actual output>
The sound box feeds back the sound signal->
。
Further, the first whitening filter
And said second whitening filter>
Smooth half-wave rectification is adopted.
Further, the foreground filter
Is greater or less than>
For feeding back signals from loudspeaker to microphone
Is estimated.
The invention has the beneficial effects that:
(1) In order to effectively remove the audio frequency fed back to the microphone by the sound box
In the method, a signal collected by a lifting microphone is firstly collected>
Performing subband (subband) analysis filtering processing, dividing the full-band audio signal into a plurality of subbands with equal bandwidth, and performing acoustic feedback elimination on each subband by using the method shown in FIG. 1;
(2) The method adopts a foreground and background operation mode, i.e. a foreground filter
Adaptive filter with coefficients passing through the background->
Is updated to obtain and is sent to the background adaptive filter>
Is passed through a first whitening filter->
To weaken the signal correlation and thereby effectively weaken the near-end audio->
And the signal fed back to the microphone by the sound box>
The strong correlation of the adaptive filter increases the robustness of the adaptive filter, and signals running in the foreground are not subjected to corresponding whitening processing, and the output tone quality of the system is not influenced.
Detailed Description
The technical solutions in the embodiments of the present invention are clearly and completely described below with reference to fig. 1, fig. 2 and fig. 3 of the present invention, and it is obvious that the described embodiments are only a part of the embodiments of the present invention, and not all of the embodiments. All other implementations made by those of ordinary skill in the art based on the embodiments of the present invention are obtained without inventive efforts.
In the description of the present invention, it is to be understood that the terms "counterclockwise", "clockwise", "longitudinal", "lateral", "up", "down", "front", "back", "left", "right", "vertical", "horizontal", "top", "bottom", "inner", "outer", etc., indicate orientations or positional relationships based on those shown in the drawings, and are used for convenience of description only, and do not indicate or imply that the referenced devices or elements must have a particular orientation, be constructed and operated in a particular orientation, and thus, are not to be considered as limiting.
Example 1:
method for local sound amplification and remote interaction by using hoisting microphone, signal d (n) collected by hoisting microphone (sound signal collected by microphone)
) Including near-end tonesFrequently v (n) (proximal tone @)>
) And the signal y (n) fed back to the microphone by the sound box (the audio frequency fed back to the microphone by the sound box &)>

) Namely: d (n) = v (n) + y (n), the main purpose of the invention is to remove the signal y (n) fed back to the microphone by the sound box in d (n), and only keep the near-end audio v (n), thereby realizing the purpose of no howling in local sound amplification and no echo in remote interaction. In order to effectively remove y (n), sub-band (subband) analysis filtering processing is firstly carried out on a signal d (n) collected by a hoisting microphone, a full-band audio signal is divided into a plurality of sub-bands, each sub-band is subjected to acoustic feedback elimination by adopting the method shown in the figure 1, the core technology of the method mainly comprises a sub-band pre-filtering method, variable step length self-adaptive signal processing and nonlinear processing, and a good application effect is achieved in practical application.
A pre-filtering method
The pre-filtering method is similar to the whitening process in signal processing, in that the signal is passed through a fixed whitening filter
The frequency spectrum distribution of the signal sent into the adaptive filter, namely the correlation characteristic of the signal itself, is changed, and the purpose of weakening the signal correlation is achieved. Fig. 1 shows a basic schematic diagram thereof. The method adopts a foreground and background operation mode, i.e. a filter of the foreground
Coefficient passing background filter>
Is updated to obtain and is sent to the background filter>
By means of a whitening filter>
To attenuate the signal correlation and thereby effectively attenuate the near-end audio &>
And the signal fed back to the microphone by the sound box>

The strong correlation of the method increases the robustness of the adaptive filter, the signals operated by the foreground do not carry out corresponding whitening processing, so the output tone quality of the system cannot be influenced, extra calculation and storage space are inevitably added to meet the operation of the foreground and the background, but along with the continuous development of chip technology, the CPU calculation capability is continuously enhanced, and the influence of the increased calculation and storage space on the product is very small, and the whitening filter provided by the invention has the advantages that the signal operated by the foreground does not carry out corresponding whitening processing, the output tone quality of the system is not influenced, and the method has the advantages that the CPU calculation capability is continuously enhanced, the calculation capability and the storage space are greatly influenced>

By means of smooth half-wave rectification,
in the formula (I), the compound is shown in the specification,
are variables of the function in the whitening filter,
smooth() To smooth the rectification function, the rectification function is, in this embodiment,
for a sub-band signal>
。
Step-length-variable adaptive signal processing
The linear processing part is to whiten the filter in each sub-band
The processed signals are respectively processed in variable step length and nonlinear processing, and sub-band signal processing is mainly carried out according to the following formula:
in the formula, T is a matrix transpose symbol,
is the subband signal (the input variable value in the function).
Wherein m is the iteration times, m is a natural integer, m is more than or equal to 0,
for a step factor (step constant), ->
Adaptive filter for background->
Linear part of the processed results.
Where L is the length (number of iterations) of the background adaptive filter for each subband,
for smoothing coefficients>
For background adaptive filter, <' > based on the background adaptive filter>
For foreground adaptive filter,>
for the result processed by the linear part of the background adaptive filter, then>
Is an input signal of a microphone>
For the input signal of a microphoneResults processed by an over-whitening filter>
Is the signal which is fed back to the microphone by the sound box>
In combination with a predetermined number of previous evaluations>
Is a variable-step coefficient whose magnitude is input +>
And processing result>
Is determined.
Three, non-linear processing
Due to the non-linearity of the system, a non-linear filtering process is required to be carried out on each sub-band, and a non-linear processing filter
The calculation method is as follows:
wherein
Is smoothed out>
For the autocorrelation of a subband transformation in the microphone input signal, a value is determined>
A cross-correlation of sub-band transforms for the microphone input signal.
Example 2:
referring to fig. 2, in this embodiment, the following steps are obtained by applying the above method to specific subbands:
the sub-band analysis filtering processing includes pre-emphasis processing, framing processing, low-pass filtering processing and analysis filtering processing on the signal d (n) collected by the near-end microphone, in this embodiment, 32KHz sampling is adopted to divide the signal into 256 sub-bands.
Microphone input for each sub-band
And output->
And performing pre-filtering treatment.
Thirdly, carrying out variable step length self-adaptive signal processing and nonlinear processing on the pre-filtered signal of each sub-band
Wherein
For smoothing out the factor, this embodiment->
=0.95. Again, the above parameters are illustrated, where L is the length of the background adaptive filter for each subband, and ` H `>
For smoothing coefficients>
For background adaptive filter, <' > based on the background adaptive filter>
For foreground adaptive filter,>
for results processed in the linear part of the background adaptive filter>
Is an input signal of a microphone>
For the result of the processing of the input signal of the microphone by means of the whitening filter, is->
Is the signal that the sound box feeds back to the microphone->
In combination with a predetermined number of previous evaluations>
Is a variable-step coefficient whose magnitude is input +>
And (b) aBased on results>
Is determined. m is the number of iterations, and m is a natural integer, m ≧ 0, < >>
Is a step size coefficient (step size constant). T is the matrix transposed symbol for the background adaptive filter for L iterations, for->
For a sub-band signal>
Adaptive filter for background->
Linear part of the processed results.
Obtaining foreground self-adaptive filter through background
Calculating the actual output &' for each sub-band>
Fifthly, obtaining the processed near-end audio signal through sub-band comprehensive filtering
。
It should be noted that, referring to fig. 3, the near-end audio, i.e. the near-end audio 102 (near-end audio signal) in the embodiment, is included
) The audio signal fed back to the microphone by the sound box is greater or less than>
(howling signal 101) and also echo signal 100 generated by the far-end audio. The present embodiment focuses on eliminating howling signals generated by a ceiling microphone.
In summary, the acoustic feedback elimination based on the pre-filtering method, the variable-step-size adaptive signal processing and the nonlinear processing is realized, the frequency of the near-end audio is not changed while the howling noise generated by the hoisting microphone is removed, only the howling signal mixed in the near-end audio is removed, and the effective near-end audio signal is reserved.
Those skilled in the art will recognize that the functionality described in this disclosure may be implemented in a combination of hardware and software in one or more of the examples described above. When software is applied, the corresponding functionality may be stored on or transmitted over as one or more instructions or code on a computer-readable medium. Computer-readable media includes both computer storage media and communication media including any medium that facilitates transfer of a computer program from one place to another. A storage media may be any available media that can be accessed by a general purpose or special purpose computer.