






技术领域technical field
本说明书涉及音频信号处理领域,尤其涉及一种用于抑制回声的音频信号处理方法和系统。This specification relates to the field of audio signal processing, and in particular, to an audio signal processing method and system for suppressing echoes.
背景技术Background technique
目前,振动传感器用于耳机等电子产品上,作为骨传导麦克风接收语音信号的应用越来越多。人在说话时,会同时引起骨骼和皮肤的振动,这些振动就是骨传导的语音信号,能够为骨传导麦克风拾取从而产生信号。系统将骨传导麦克风采集的振动信号转换为电信号或其他类型的信号,传递到电子设备实现拾音功能。目前,越来越多的电子设备将具有不同特性的气传导麦克风与骨传导麦克风组合起来,使用气传导麦克风拾取外部音频信号,使用骨传导麦克风拾取发声部位振动信号,并对所拾取信号进行语音增强处理和融合。当将骨传导麦克风放置在耳机或者其他具有扬声器的电子设备内时,骨传导麦克风不仅能够接收到人说话时的振动信号,也能接收到耳机或其他电子设备的扬声器在播放声音时产生的振动信号,从而产生回声信号。这时需要对其进行回声消除算法处理。而扬声器的回声信号不同,也会影响麦克风的语音质量。比如当扬声器输入信号较强时,骨传导麦克风接收到的扬声器振动信号较大,要远大于骨传导麦克风接收到的人说话时产生的振动信号,此时传统的回声消除算法难以消除骨传导麦克风中的回声。此时,气传导麦克风和骨传导麦克风输出的麦克风信号作为音源信号得到的语音质量较差。因此,在选择麦克风的音源信号时不考虑扬声器的回声信号是不合理的。At present, vibration sensors are used in electronic products such as headphones, and more and more applications are used as bone conduction microphones to receive voice signals. When a person speaks, the bones and the skin vibrate at the same time. These vibrations are the speech signals of bone conduction, which can be picked up by the bone conduction microphone to generate signals. The system converts the vibration signal collected by the bone conduction microphone into an electrical signal or other type of signal, and transmits it to the electronic device to realize the sound pickup function. At present, more and more electronic devices combine air conduction microphones with bone conduction microphones with different characteristics, use the air conduction microphone to pick up the external audio signal, use the bone conduction microphone to pick up the vibration signal of the sounding part, and make a speech on the picked up signal. Enhanced processing and fusion. When the bone conduction microphone is placed in an earphone or other electronic device with a speaker, the bone conduction microphone can not only receive the vibration signal when the person speaks, but also the vibration generated by the speaker of the earphone or other electronic device when the sound is played. signal, resulting in an echo signal. At this time, it needs to be processed by an echo cancellation algorithm. The different echo signals of the speaker will also affect the voice quality of the microphone. For example, when the input signal of the speaker is strong, the vibration signal of the speaker received by the bone conduction microphone is larger, which is much larger than the vibration signal generated when the person speaks received by the bone conduction microphone. At this time, the traditional echo cancellation algorithm is difficult to eliminate the bone conduction microphone. echo in . At this time, the voice quality obtained by the microphone signals output by the air conduction microphone and the bone conduction microphone as the sound source signal is poor. Therefore, it is unreasonable not to consider the echo signal of the speaker when selecting the sound source signal of the microphone.
因此,需要提供一种新的用于抑制回声的音频信号处理方法和系统,以根据不同的扬声器输入信号切换输入的音源信号,提高回声消除的效果,提升语音质量。Therefore, it is necessary to provide a new audio signal processing method and system for suppressing echo, so as to switch the input audio source signal according to different speaker input signals, improve the effect of echo cancellation, and improve the voice quality.
发明内容SUMMARY OF THE INVENTION
本说明书提供一种新的用于抑制回声的音频信号处理方法和系统,以提高回声消除的效果,提升语音质量。This specification provides a new audio signal processing method and system for suppressing echo, so as to improve the effect of echo cancellation and improve the voice quality.
第一方面,本说明书提供一种用于抑制回声的音频信号处理方法,包括:至少基于扬声器信号从第一模式和第二模式中选择电子设备的目标音频处理模式,所述扬声器信号为控制设备发送给所述电子设备的音频信号;通过所述目标音频处理模式处理麦克风信号生成目标音频,来至少降低所述目标音频中的回声,所述麦克风信号为所述电子设备获取的麦克风模组的输出信号,所述麦克风模组包括至少一个第一类麦克风和至少一个第二类麦克风,其中,所述第一模式包括对所述至少一个第一类麦克风输出的第一音频信号和所述至少一个第二类麦克风输出的第二音频信号进行信号处理,所述第二模式包括对所述第二音频信号进行信号处理;以及输出所述目标音频。In a first aspect, the present specification provides an audio signal processing method for suppressing echo, comprising: selecting a target audio processing mode of an electronic device from a first mode and a second mode based on at least a speaker signal, the speaker signal being a control device The audio signal sent to the electronic device; the target audio is generated by processing the microphone signal in the target audio processing mode to at least reduce the echo in the target audio, and the microphone signal is the signal of the microphone module obtained by the electronic device. outputting a signal, the microphone module includes at least one microphone of the first type and at least one microphone of the second type, wherein the first mode includes a first audio signal output to the at least one microphone of the first type and the at least one microphone of the first type Signal processing is performed on a second audio signal output by a second type of microphone, and the second mode includes performing signal processing on the second audio signal; and outputting the target audio.
在一些实施例中,所述至少基于扬声器信号从第一模式和第二模式中选择电子设备的目标音频处理模式,包括:至少基于所述扬声器信号的强度,生成与所述扬声器信号对应的控制信号,所述控制信号包括第一控制信号或第二控制信号;以及基于所述控制信号,选择与所述控制信号对应的目标音频处理模式,其中,所述第一模式与所述第一控制信号对应,所述第二模式与所述第二控制信号对应。In some embodiments, the selecting a target audio processing mode of the electronic device from the first mode and the second mode based at least on the speaker signal includes generating a control corresponding to the speaker signal based at least on the strength of the speaker signal a signal, the control signal includes a first control signal or a second control signal; and based on the control signal, selecting a target audio processing mode corresponding to the control signal, wherein the first mode and the first control signal, and the second mode corresponds to the second control signal.
在一些实施例中,所述至少基于所述扬声器信号的强度,生成与所述扬声器信号对应的控制信号,包括:确定所述扬声器信号的强度低于预设的扬声器阈值,生成所述第一控制信号;或者确定所述扬声器信号的强度高于所述扬声器阈值,生成所述第二控制信号。In some embodiments, the generating a control signal corresponding to the speaker signal based on at least the strength of the speaker signal includes: determining that the strength of the speaker signal is lower than a preset speaker threshold, and generating the first control signal; or determine that the intensity of the speaker signal is higher than the speaker threshold, and generate the second control signal.
在一些实施例中,所述至少基于所述扬声器信号的强度,生成与所述扬声器信号对应的控制信号,包括:基于所述扬声器信号的强度以及所述麦克风信号,生成对应的控制信号。In some embodiments, the generating a control signal corresponding to the speaker signal based on at least the strength of the speaker signal includes: generating a corresponding control signal based on the strength of the speaker signal and the microphone signal.
在一些实施例中,所述基于所述扬声器信号的强度以及所述麦克风信号,生成对应的控制信号,包括:获取所述麦克风信号的评价参数,所述评价参数包括环境噪声评价参数,所述环境噪声评价参数包括环境噪声等级以及信噪比中的至少一个;以及基于所述扬声器信号的强度以及所述评价参数,生成所述控制信号。In some embodiments, the generating a corresponding control signal based on the strength of the speaker signal and the microphone signal includes: acquiring an evaluation parameter of the microphone signal, where the evaluation parameter includes an environmental noise evaluation parameter, and the The environmental noise evaluation parameter includes at least one of an environmental noise level and a signal-to-noise ratio; and the control signal is generated based on the strength of the speaker signal and the evaluation parameter.
在一些实施例中,所述基于所述扬声器信号的强度以及所述评价参数,生成所述控制信号,包括以下情况中的一种:确定所述扬声器信号的强度高于预设的扬声器阈值,生成所述第二控制信号;确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于所述噪声评价范围外,生成所述第一控制信号;以及确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于所述噪声评价范围内,生成所述第一控制信号或所述第二控制信号。In some embodiments, the generating the control signal based on the strength of the speaker signal and the evaluation parameter includes one of the following cases: determining that the strength of the speaker signal is higher than a preset speaker threshold, generating the second control signal; determining that the speaker signal strength is below the speaker threshold and the ambient noise evaluation parameter is outside the noise evaluation range, generating the first control signal; and determining the speaker The strength of the signal is lower than the speaker threshold, and the environmental noise evaluation parameter is within the noise evaluation range, and the first control signal or the second control signal is generated.
在一些实施例中,所述环境噪声评价参数处于所述噪声评价范围内,包括以下情况中的至少一种:所述环境噪声等级低于预设环境噪声阈值;以及所述信噪比高于预设信噪比阈值。In some embodiments, the environmental noise evaluation parameter is within the noise evaluation range, including at least one of the following situations: the environmental noise level is lower than a preset environmental noise threshold; and the signal-to-noise ratio is higher than Preset SNR threshold.
在一些实施例中,所述评价参数还包括人声信号强度,所述基于所述扬声器信号的强度以及所述评价参数,生成所述控制信号,包括以下情况中的一种:确定所述扬声器信号的强度高于预设的扬声器阈值,且所述人声信号强度超过预设人声阈值,所述环境噪声评价参数处于预设的噪声评价范围之外,生成所述第一控制信号;确定所述扬声器信号的强度高于所述扬声器阈值,且所述人声信号强度超过所述人声阈值,所述环境噪声评价参数处于预设的噪声评价范围之内,生成所述第二控制信号;确定所述扬声器信号的强度高于所述扬声器阈值,且所述人声信号强度低于所述人声阈值,生成所述第二控制信号;确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于所述噪声评价范围之外,生成所述第一控制信号;以及确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于所述噪声评价范围内,生成所述第一控制信号或所述第二控制信号。In some embodiments, the evaluation parameter further includes a vocal signal strength, and the generating the control signal based on the strength of the speaker signal and the evaluation parameter includes one of the following: determining the speaker The strength of the signal is higher than the preset speaker threshold, and the strength of the vocal signal exceeds the preset vocal threshold, and the environmental noise evaluation parameter is outside the preset noise evaluation range, and the first control signal is generated; determine; The intensity of the speaker signal is higher than the speaker threshold, the human voice signal intensity exceeds the human voice threshold, the environmental noise evaluation parameter is within a preset noise evaluation range, and the second control signal is generated ; determine that the strength of the speaker signal is higher than the speaker threshold, and the strength of the vocal signal is lower than the vocal threshold, and generate the second control signal; determine that the strength of the speaker signal is lower than the speaker threshold, and the ambient noise assessment parameter is outside the noise assessment range, generating the first control signal; and determining that the loudspeaker signal strength is lower than the loudspeaker threshold, and the ambient noise assessment parameter is within the range Within the noise evaluation range, the first control signal or the second control signal is generated.
在一些实施例中,所述环境噪声评价参数处于所述噪声评价范围内,包括以下情况中的至少一种:所述环境噪声等级低于预设环境噪声阈值;以及所述信噪比高于预设信噪比阈值。In some embodiments, the environmental noise evaluation parameter is within the noise evaluation range, including at least one of the following situations: the environmental noise level is lower than a preset environmental noise threshold; and the signal-to-noise ratio is higher than Preset SNR threshold.
在一些实施例中,所述生成目标音频,包括:通过所述第一模式中的第一算法,对所述第一音频信号和所述第二音频信号进行信号处理,生成第一目标音频;或者通过所述第二模式中的第二算法,对所述第二音频信号进行信号处理,生成第二目标音频,其中,所述目标音频包括所述第一目标音频和所述第二目标音频中的一个。In some embodiments, the generating the target audio includes: performing signal processing on the first audio signal and the second audio signal by a first algorithm in the first mode to generate the first target audio; Or by using the second algorithm in the second mode, signal processing is performed on the second audio signal to generate a second target audio, wherein the target audio includes the first target audio and the second target audio one of the.
在一些实施例中,所述输出所述目标音频,包括:对所述目标音频做平滑处理,当所述目标音频在所述第一目标音频和所述第二目标音频之间切换时,对所述第一目标音频和所述第二目标音频的连接处进行所述平滑处理;以及输出经过所述平滑处理的所述目标音频。In some embodiments, the outputting the target audio includes: smoothing the target audio, and when the target audio is switched between the first target audio and the second target audio The smoothing process is performed at the connection between the first target audio and the second target audio; and the smoothed target audio is output.
在一些实施例中,所述方法还包括:基于所述控制信号,控制所述扬声器的扬声器输入信号的强度。In some embodiments, the method further comprises controlling the strength of a speaker input signal of the speaker based on the control signal.
在一些实施例中,所述基于所述控制信号,控制所述扬声器的扬声器输入信号的强度,包括:确定所述控制信号为所述第一控制信号,降低输入所述扬声器的所述扬声器输入信号的强度,从而降低所述扬声器输出的声音的强度。In some embodiments, the controlling the intensity of the speaker input signal of the speaker based on the control signal includes: determining the control signal as the first control signal, and reducing the speaker input input to the speaker signal strength, thereby reducing the strength of the sound output by the speaker.
在一些实施例中,所述至少一个第一类麦克风用于采集人体振动信号;以及所述至少一个第二类麦克风用于采集空气振动信号。In some embodiments, the at least one microphone of the first type is used to collect human body vibration signals; and the at least one microphone of the second type is used to collect air vibration signals.
第二方面,本说明书还提供一种用于抑制回声的音频信号处理的系统,包括:至少一个存储介质以及至少一个处理器,所述至少一个存储介质存储有至少一个指令集,用于抑制回声的音频信号处理;所述至少一个处理器同所述至少一个存储介质通信连接,其中,当所述系统运行时,所述至少一个处理器读取所述至少一个指令集,并且根据所述至少一个指令集的指示执行本说明书第一方面所述的用于抑制回声的音频信号处理的方法。In a second aspect, the present specification also provides an audio signal processing system for suppressing echoes, comprising: at least one storage medium and at least one processor, wherein the at least one storage medium stores at least one instruction set for suppressing echoes audio signal processing; the at least one processor is communicatively connected to the at least one storage medium, wherein, when the system is running, the at least one processor reads the at least one instruction set and, according to the at least one The instruction of an instruction set executes the method of audio signal processing for suppressing echoes described in the first aspect of this specification.
由以上技术方案可知,本说明书提供的用于抑制回声的音频信号处理方法和系统,可以根据扬声器信号的强度生成与所述扬声器信号相对应的控制信号,并根据控制信号控制或切换音频处理模式,从而对与音频处理模式对应的音源信号进行信号处理,以获得更优的语音质量。当扬声器信号未超过阈值时,所述系统生成第一控制信号,选择第一模式,并以第一音频信号和第二音频信号作为第一音源信号,对第一音源信号进行信号处理,以得到第一目标音频。而当扬声器信号超过阈值时,第一音频信号中的扬声器回声较大。此时,所述系统生成第二控制信号,选择第二模式,并以第二音频信号作为第二音源信号,对第二音源信号进行信号处理,以得到第二目标音频。所述方法和系统,能够根据扬声器信号切换不同的音频处理模式,从而切换麦克风信号的音源信号,以提升语音质量,保证在不同场景下都能获得更好的语音质量。It can be seen from the above technical solutions that the audio signal processing method and system for suppressing echoes provided in this specification can generate a control signal corresponding to the speaker signal according to the strength of the speaker signal, and control or switch the audio processing mode according to the control signal. , so as to perform signal processing on the audio source signal corresponding to the audio processing mode to obtain better voice quality. When the speaker signal does not exceed the threshold, the system generates the first control signal, selects the first mode, and uses the first audio signal and the second audio signal as the first audio source signal, and performs signal processing on the first audio source signal to obtain The first target audio. When the loudspeaker signal exceeds the threshold, the loudspeaker echo in the first audio signal is larger. At this time, the system generates the second control signal, selects the second mode, and uses the second audio signal as the second audio source signal, and performs signal processing on the second audio source signal to obtain the second target audio frequency. The method and system can switch different audio processing modes according to the speaker signal, thereby switching the audio source signal of the microphone signal, so as to improve the voice quality and ensure better voice quality in different scenarios.
本说明书提供的用于抑制回声的音频信号处理方法和系统的其他功能将在以下说明中部分列出。根据描述,以下数字和示例介绍的内容将对那些本领域的普通技术人员显而易见。本说明书提供的用于抑制回声的音频信号处理方法和系统的创造性方面可以通过实践或使用下面详细示例中所述的方法、装置和组合得到充分解释。Other functions of the audio signal processing method and system for echo suppression provided by this specification will be partially listed in the following description. From the description, what is presented in the following figures and examples will be apparent to those of ordinary skill in the art. The inventive aspects of the audio signal processing methods and systems for suppressing echoes provided by this specification can be fully explained by practice or use of the methods, apparatus, and combinations described in the detailed examples below.
附图说明Description of drawings
为了更清楚地说明本说明书实施例中的技术方案,下面将对实施例描述中所需要使用的附图作简单地介绍,显而易见地,下面描述中的附图仅仅是本说明书的一些实施例,对于本领域普通技术人员来讲,在不付出创造性劳动的前提下,还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions in the embodiments of the present specification more clearly, the following briefly introduces the drawings that are used in the description of the embodiments. Obviously, the drawings in the following description are only some embodiments of the present specification. For those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative effort.
图1示出了根据本说明书的实施例提供的一些用于抑制回声的音频信号处理系统的应用场景示意图;FIG. 1 shows a schematic diagram of application scenarios of some audio signal processing systems for suppressing echoes provided according to embodiments of the present specification;
图2示出了根据本说明书的实施例提供的一些电子设备的设备示意图;FIG. 2 shows a schematic diagram of some electronic devices provided according to an embodiment of the present specification;
图3示出了根据本说明书的实施例提供的一些第一模式的工作示意图;Fig. 3 shows the working schematic diagram of some first modes provided according to the embodiments of this specification;
图4示出了根据本说明书的实施例提供的一些第二模式的工作示意图;Fig. 4 shows the working schematic diagram of some second modes provided according to the embodiments of this specification;
图5示出了根据本说明书的实施例提供的一些用于抑制回声的音频信号处理方法流程图;FIG. 5 shows a flowchart of some audio signal processing methods for suppressing echoes provided according to an embodiment of the present specification;
图6示出了根据本说明书的实施例提供的一些用于抑制回声的音频信号处理方法流程图;以及FIG. 6 shows a flowchart of some audio signal processing methods for suppressing echoes provided according to an embodiment of the present specification; and
图7示出了根据本说明书的实施例提供的一些用于抑制回声的音频信号处理方法流程图。FIG. 7 shows a flowchart of some audio signal processing methods for suppressing echoes provided according to an embodiment of the present specification.
具体实施方式Detailed ways
以下描述提供了本说明书的特定应用场景和要求,目的是使本领域技术人员能够制造和使用本说明书中的内容。对于本领域技术人员来说,对所公开的实施例的各种局部修改是显而易见的,并且在不脱离本说明书的精神和范围的情况下,可以将这里定义的一般原理应用于其他实施例和应用。因此,本说明书不限于所示的实施例,而是与权利要求一致的最宽范围。The following description provides specific application scenarios and requirements of this specification, and is intended to enable those skilled in the art to make and use the content of this specification. Various partial modifications to the disclosed embodiments will be readily apparent to those skilled in the art, and the generic principles defined herein may be applied to other embodiments and without departing from the spirit and scope of the description. application. Thus, this specification is not to be limited to the embodiments shown, but is to be accorded the widest scope consistent with the claims.
这里使用的术语仅用于描述特定示例实施例的目的,而不是限制性的。比如,除非上下文另有明确说明,这里所使用的,单数形式“一”,“一个”和“该”也可以包括复数形式。当在本说明书中使用时,术语“包括”、“包含”和/或“含有”意思是指所关联的整数,步骤、操作、元素和/或组件存在,但不排除一个或多个其他特征、整数、步骤、操作、元素、组件和/或组的存在或在该系统/方法中可以添加其他特征、整数、步骤、操作、元素、组件和/或组。The terminology used herein is for the purpose of describing particular example embodiments only and is not limiting. For example, as used herein, the singular forms "a," "an," and "the" can include the plural forms as well, unless the context clearly dictates otherwise. When used in this specification, the terms "comprising", "comprising" and/or "comprising" are meant to refer to the associated integer, step, operation, element and/or component being present, but not excluding one or more other features , integers, steps, operations, elements, components and/or groups exist or other features, integers, steps, operations, elements, components and/or groups may be added to the system/method.
考虑到以下描述,本说明书的这些特征和其他特征、以及结构的相关元件的操作和功能、以及部件的组合和制造的经济性可以得到明显提高。参考附图,所有这些形成本说明书的一部分。然而,应该清楚地理解,附图仅用于说明和描述的目的,并不旨在限制本说明书的范围。还应理解,附图未按比例绘制。These and other features of this specification, as well as the operation and function of related elements of structure, and the economics of assembly and manufacture of parts, may be significantly improved in view of the following description. Reference is made to the accompanying drawings, all of which form a part of this specification. However, it should be clearly understood that the drawings are for illustration and description purposes only and are not intended to limit the scope of the present specification. It should also be understood that the figures are not drawn to scale.
本说明书中使用的流程图示出了根据本说明书中的一些实施例的系统实现的操作。应该清楚地理解,流程图的操作可以不按顺序实现。相反,操作可以以反转顺序或同时实现。此外,可以向流程图添加一个或多个其他操作。可以从流程图中移除一个或多个操作。The flowcharts used in this specification illustrate the operation of a system implementation according to some embodiments in this specification. It should be clearly understood that the operations of the flowcharts may be implemented out of sequence. Instead, operations may be implemented in reverse order or simultaneously. Additionally, one or more other operations can be added to the flowchart. One or more operations can be removed from the flowchart.
图1示出了根据本说明书的实施例提供的一些用于抑制回声的音频信号处理系统100(以下简称系统100)的应用场景示意图。系统100可以包括电子设备200和控制设备400。FIG. 1 shows a schematic diagram of some application scenarios of an audio
电子设备200可以存储有执行本说明书描述的用于抑制回声的音频信号处理的方法的数据或指令,并可以执行所述数据和/或指令。在一些实施例中,电子设备200可以是无线耳机、有线耳机、智能穿戴式设备,比如,智能眼镜、智能头盔或者智能腕表等具有语音采集功能以及语音播放功能的设备。电子设备200也可以是移动设备、平板电脑、笔记本电脑、机动车内置装置或类似内容,或其任意组合。在一些实施例中,移动设备可包括智能家居设备、智能移动设备或类似设备,或其任意组合。比如,所述智能移动设备可包括手机、个人数字辅助、游戏设备、导航设备、超级移动个人计算机(Ultra-mobile Personal Computer,UMPC)等,或其任意组合。在一些实施例中,所述智能家居装置可包括智能电视、台式电脑等,或任意组合。在一些实施例中,机动车中的内置装置可包括车载计算机、车载电视等。The
控制设备400可以是与电子设备200进行有线和/或无线音频信号通信的远程设备。控制设备400也本地可以是与电子设备200通信连接的设备。电子设备200可以采集本地的音频信号并输出至控制设备400。电子设备200还可以接收控制设备400发送的远端音频信号并输出。所述远端音频信号也可以称为扬声器信号。控制设备400也可以是具有语音采集功能以及语音播放功能的设备。比如,手机、平板电脑、笔记本电脑、耳机、智能穿戴式设备、机动车内置装置或类似内容,或其任意组合。比如,电子设备200为耳机时,控制设备400可以是与所述耳机通信连接的终端设备,比如,手机、电脑,等等。
如图1所示,电子设备200可以包括麦克风模组240以及扬声器280。麦克风模组240可以被配置为获取本地音频信号,并输出麦克风信号,也就是携带了音频信息的电子信号。麦克风模组240可以是耳外麦克风模组也可以是耳内麦克风模组。比如,麦克风模组240可以是设置于耳道外的麦克风,也可以是设置在耳道内的麦克风。麦克风模组240可以包括至少一个第一类麦克风242和至少一个第二类麦克风244。第一类麦克风242不同于第二类麦克风244。第一类麦克风242可以是直接采集人体振动信号的麦克风,比如骨传导麦克风。第二类麦克风244可以是直接采集空气振动信号的麦克风,比如气传导麦克风。当然,第一类麦克风242和第二类麦克风244也可以是其他类型的麦克风。比如第一类麦克风242可以是光学麦克风;第二类麦克风244可以是接收肌电信号的麦克风,等等。由于第一类麦克风242不同于第二类麦克风244,在感知音频信号的表现上便会不同,造成相应的音频信号中的噪音和回声成分会不同。为了方便展示,本披露在下面的陈述中将使用骨传导麦克风作为第一类麦克风242的例子,使用气传导麦克风作为第二类麦克风244的例子。As shown in FIG. 1 , the
骨传导麦克风可以包括振动传感器,比如光学振动传感器、加速度传感器等。所述振动传感器可以采集机械振动信号(比如,由用户002说话时皮肤或骨骼产生的振动产生的信号),并将该机械振动信号转换成电信号。这里所说的机械振动信号主要指经由固体传播的振动。骨传导麦克风通过所述振动传感器或与所述振动传感器连接的振动部件与用户002的皮肤或骨骼进行接触,从而采集用户002在发出声音时骨骼或皮肤产生的振动信号,并将振动信号转换为电信号。在一些实施例中,所述振动传感器可以是对机械振动敏感而对空气振动不敏感的装置(即所述振动传感器对于机械振动的响应能力超过所述振动传感器对于空气振动的响应能力)。由于骨传导麦克风能够直接拾取发声部位的振动信号,骨传导麦克风能降低环境噪声的影响。The bone conduction microphone may include vibration sensors, such as optical vibration sensors, acceleration sensors, and the like. The vibration sensor can collect mechanical vibration signals (eg, signals generated by the vibration of the skin or bones when the
气传导麦克风通过采集用户002在发出声音时引起的空气振动信号,并将空气振动信号转化为电信号。气传导麦克风可以是单独的一颗气传导麦克风,也可以是由两个及以上的气传导麦克风组成的麦克风阵列。麦克风阵列可以是波束形成麦克风阵列或者其他类似的麦克风阵列。通过麦克风阵列可以采集来自空间不同方向或不同位置的声音。The air conduction microphone collects the air vibration signal caused by the
第一类麦克风242可以输出第一音频信号243。第二类麦克风244可以输出第二音频信号245。所述麦克风信号包括所述第一音频信号243和所述第二音频信号245。在低噪声场景下,第二音频信号245较第一音频信号243具有更好的语音质量。而在环境噪声较大的场景下,在低频部分第一音频信号243的语音质量更高,而在高频部分第二音频信号245的语音质量更高。因此在环境噪声较大的场景下,将第一音频信号243和第二音频信号245进行特征融合后得到的音频信号具有良好的语音质量。在实际使用过程中,环境的噪声时刻都可能发生变化,在所述低噪声场景和所述高噪声场景之间反复转换。The first type of
扬声器280可以将电信号转换为音频信号。扬声器280可以被配置为接收来自控制设备400的所述扬声器信号并输出。为了方便描述,我们将输入扬声器280的音频信号定义为扬声器输入信号。在一些实施例中,所述扬声器输入信号可以是所述扬声器信号。在一些实施例中,电子设备200可以对所述扬声器信号进行信号处理,并将信号处理后的音频信号发送给扬声器280进行输出。此时,所述扬声器输入信号可以是电子设备200对所述扬声器信号进行信号处理后得到的音频信号。The
所述扬声器输入信号经扬声器280输出后的声音可以通过空气传导或者骨传导的方式传递给用户002。扬声器280可以是通过向人体传递振动信号以传递声音的扬声器,比如骨传导扬声器,也可以是通过空气传递振动信号的扬声器,比如气传导扬声器。骨传导扬声器通过振动模块产生机械振动,并将所述机械振动经由骨骼传导至耳内。比如,扬声器280可以直接或者通过特定介质(例如,一个或多个面板)接触用户002的头部,并将所述音频信号通过颅骨振动的方式传递给用户的听觉神经。气传导扬声器通过振动模块在空气中产生振动,并将所述空气振动经由空气传导至耳内。扬声器280还可以是骨传导扬声器和气传导扬声器的组合。扬声器280还可以是其他类型的扬声器。所述扬声器输入信号经扬声器280输出后的声音可能会被麦克风模组240采集,形成回声。所述扬声器输入信号强度越大,扬声器280输出的声音强度越大,所述回声信号越强。The sound output by the speaker input signal through the
需要说明的是,麦克风模组240和扬声器280可以集成在电子设备200上,也可以是电子设备200的外接式设备。It should be noted that the
第一类麦克风242和第二类麦克风244工作时,不仅能够采集到用户002发出的声音,也能采集到环境噪声,还能采集到扬声器280发出的声音。电子设备200可以通过麦克风模组240采集音频信号并生成所述麦克风信号。所述麦克风信号可以包括第一音频信号243和第二音频信号245。不同场景下,第一音频信号243和第二音频信号245的语音质量不同。为保证语音通信质量,电子设备200可以根据不同的应用场景,从多个音频处理模式中选择目标音频处理模式,以从所述麦克风信号中选择语音质量更好的音频信号作为音源信号,并通过所述目标音频处理模式对所述音源信号进行信号处理后输出至控制设备400。所述音源信号可以是所述目标音频处理模式的输入信号。在一些实施例中,所述信号处理可以包括噪声抑制以降低噪声信号。在一些实施例中,所述信号处理可以包括回声抑制以降低回声信号。在一些实施例中,所述信号处理既可以包括所述噪声抑制,也可以包括所述回声抑制。在一些实施例中,所述信号处理也可以是直接输出所述音源信号。为了方便展示,下面的描述中我们将以所述信号处理包括所述回声抑制进行描述。本领域技术人员应当明白,其他信号处理方式都在本说明书的保护范围内。When the first type of
电子设备200对所述目标音频处理模式的选择,除了与环境噪声有关外,还与所述扬声器信号有关。在一些场景下,比如,所述扬声器信号较小,扬声器280输出的声音也较小时,第一类麦克风242输出的第一音频信号243和第二类麦克风244输出的第二音频信号245进行特征融合后的音频信号的语音质量优于第二类麦克风244输出的第二音频信号245的语音质量。The selection of the target audio processing mode by the
而在一些特殊场景下,比如所述扬声器信号较大,扬声器280输出的声音也较大时,对于第一类麦克风242输出的所述第一音频信号243影响较大,导致所述第一音频信号243中的回声较大。在一些实施例中,所述第一音频信号243中的回声信号会超过用户002的语音信号。特别是当扬声器280为骨传导扬声器时,所述第一音频信号243中的回声信号更明显。传统的回声消除算法难以消除所述第一音频信号243中的回声信号,无法保证回声消除的效果。此时,第二类麦克风244输出的第二音频信号245的语音质量优于第一类麦克风242输出的第一音频信号243和第二类麦克风244输出的第二音频信号245进行特征融合后的音频信号的语音质量。In some special scenarios, for example, when the speaker signal is large and the sound output by the
因此,电子设备200可以基于所述扬声器信号从所述多个音频处理模式中选择所述目标音频处理模式对所述麦克风信号进行所述信号处理。所述多个音频处理模式至少可以包括第一模式1和第二模式2。Therefore, the
第一模式1可以对第一音频信号243和第二音频信号245进行信号处理。如前所述,在一些实施例中,所述信号处理可以包括噪声抑制以降低噪声信号。在一些实施例中,所述信号处理可以包括回声抑制以降低回声信号。在一些实施例中,所述信号处理既可以包括所述噪声抑制,也可以包括所述回声抑制。为了方便展示,下面的描述中我们将以所述信号处理包括所述回声抑制进行描述。本领域技术人员应当明白,其他信号处理方式都在本说明书的保护范围内。The
第二模式2可以对第二音频信号245进行信号处理。在一些实施例中,所述信号处理可以包括噪声抑制以降低噪声信号。在一些实施例中,所述信号处理可以包括回声抑制以降低回声信号。在一些实施例中,所述信号处理既可以包括所述噪声抑制,也可以包括所述回声抑制。为了方便展示,下面的描述中我们将以所述信号处理包括所述回声抑制进行描述。本领域技术人员应当明白,其他信号处理方式都在本说明书的保护范围内。The
所述目标音频处理模式是第一模式1和第二模式2中的一个。所述多个音频处理模式还可以包括其他模式,比如,对第一音频信号243进行信号处理的处理模式。The target audio processing mode is one of the
因此,所述扬声器信号较小时,为了保证应用于语音通信的语音具有较高的质量,电子设备200选用第一模式1,以第一音频信号243和第二音频信号245作为音源信号,并对所述音源信号进行信号处理,生成第一目标音频291并输出,应用于语音通信。所述扬声器信号较大时,为了保证应用于语音通信的语音具有较高的质量,电子设备200选用第二模式2,以第二音频信号245作为音源信号,并对所述音源信号进行信号处理,生成第二目标音频292并输出,应用于语音通信。Therefore, when the speaker signal is small, in order to ensure that the voice applied to the voice communication has higher quality, the
电子设备200可以执行本说明书描述的用于抑制回声的音频信号处理的方法的数据或指令,获取所述麦克风信号以及所述扬声器信号;电子设备200可以基于所述扬声器信号的信号强度,选择对应的目标音频处理模式对所述麦克风信号进行信号处理。具体地,电子设备200可以根据所述扬声器信号的强度,从多个音频处理模式中选择与所述扬声器信号强度对应的目标音频处理模式,从所述第一音频信号243和所述第二音频信号245中选择语音质量更好的音频信号或者其组合作为音源信号,并采用对应的信号处理算法对所述音源信号进行信号处理(比如回声消除以及降噪处理),生成目标音频并输出,以降低所述目标音频中的回声。所述目标音频可以包括第一目标音频291和第二目标音频292中的一个。电子设备200可以将所述目标音频输出至控制设备400。The
综上所述,为了保证通信的语音质量,电子设备200可以基于所述扬声器信号的强度,控制并选择目标音频处理模式,从而选择语音质量更好的音频信号作为电子设备200的音源信号,并对所述音源信号进行信号处理,以针对不同的使用场景获取不同的目标音频,从而保证不同使用场景下,所述目标音频的语音质量都是最优的。To sum up, in order to ensure the voice quality of the communication, the
图2示出了一种电子设备200的设备示意图。电子设备200可以执行本说明书描述的用于抑制回声的音频信号处理的方法。所述用于抑制回声的音频信号处理的方法在本说明书中的其他部分介绍。比如,在图5至图7的描述中介绍了所述用于抑制回声的音频信号处理的方法。FIG. 2 shows a schematic diagram of an
如图2所示,电子设备200可以包括麦克风模组240和扬声器280。在一些实施例中,电子设备200还可以包括至少一个存储介质230和至少一个处理器220。As shown in FIG. 2 , the
存储介质230可以包括数据存储装置。所述数据存储装置可以是非暂时性存储介质,也可以是暂时性存储介质。比如,所述数据存储装置可以包括磁盘、只读存储介质(ROM)或随机存取存储介质(RAM)中的一种或多种。存储介质230还包括存储在所述数据存储装置中的至少一个指令集,用于抑制回声的音频信号处理。所述指令是计算机程序代码,所述计算机程序代码可以包括执行本说明书提供的用于抑制回声的音频信号处理的方法的程序、例程、对象、组件、数据结构、过程、模块等等。The
如图2所示,所述至少一个指令集可以包括控制指令,由控制模块231发出,被配置为基于所述扬声器信号或者所述扬声器信号和所述麦克风信号生成与所述扬声器信号对应的控制信号。所述控制信号包括第一控制信号或第二控制信号。其中,所述第一控制信号与第一模式1相对应。所述第二控制信号与第二模式2相对应。所述控制信号可以是任意的信号,比如,所述第一控制信号可以是信号1,所述第二控制信号可以是信号2,等等。控制模块231发出的控制指令可以根据所述扬声器信号的信号强度或者所述扬声器信号的信号强度以及所述麦克风信号的评价参数,生成相对应的控制信号。所述控制信号与所述扬声器信号或者所述扬声器信号以及所述麦克风信号的对应关系将在后面的描述中详细介绍。控制模块231还可以根据所述控制信号选择与所述控制信号对应的目标音频信号处理模式。当所述控制信号为所述第一控制信号时,控制模块231选择第一模式1;当所述控制信号为所述第二控制信号时,控制模块231选择第二模式2。As shown in FIG. 2, the at least one instruction set may include control instructions, issued by the control module 231, configured to generate a control corresponding to the speaker signal based on the speaker signal or the speaker signal and the microphone signal Signal. The control signal includes a first control signal or a second control signal. Wherein, the first control signal corresponds to the
在一些实施例中,所述至少一个指令集还可以包括回声处理指令,由回声处理模块233发出,被配置为基于所述控制信号,通过电子设备200的所述目标音频处理模式对所述麦克风信号进行信号处理(比如回声抑制,降噪处理等)。当所述控制信号为所述第一控制信号时,回声处理模块233采用第一模式1对所述麦克风信号进行信号处理。当所述控制信号为所述第二控制信号时,回声处理模块233采用第一模式2对所述麦克风信号进行信号处理。In some embodiments, the at least one instruction set may further include an echo processing instruction, which is issued by the
所述回声处理模块233可以包括第一算法233-1以及第二算法233-8。所述第一算法233-1与所述第一控制信号以及所述第一模式1对应。所述第二算法233-8与所述第二控制信号以及所述第二模式2对应。The
在第一模式1中,电子设备200采用第一算法233-1分别对第一音频信号243和第二音频信号245进行信号处理,并将经过所述信号处理后的第一音频信号243和第二音频信号245进行特征融合,输出所述第一目标音频291。In the
图3示出了根据本说明书的实施例提供的一种第一模式1的工作示意图。如图3所示,在第一模式1中,第一算法233-1可以接收所述第一音频信号243和所述第二音频信号245以及所述扬声器输入信号。第一算法233-1可以使用第一回声消除模块233-2基于所述扬声器输入信号对所述第一音频信号243进行回声消除。所述扬声器输入信号可以是经过降噪处理后的音频信号。第一回声消除模块233-2接收所述第一音频信号243以及所述扬声器输入信号,并输出消除回声后的所述第一音频信号243。第一回声消除模块233-2可以是单麦克风回声消除算法。FIG. 3 shows a schematic working diagram of a
在一些实施例中,第一算法233-1可以使用第二回声消除模块233-3基于所述扬声器输入信号对所述第二音频信号245进行回声消除。第二回声消除模块233-3接收所述第二音频信号245以及所述扬声器输入信号,并输出消除回声后的所述第二音频信号245。第二回声消除模块233-3可以是单麦克风回声消除算法,也可以是多麦克风回声消除算法。第一回声消除模块233-2与第二回声消除模块233-3可以相同也可以不同。In some embodiments, the first algorithm 233-1 may use a second echo cancellation module 233-3 to echo cancel the second audio signal 245 based on the speaker input signal. The second echo cancellation module 233-3 receives the second audio signal 245 and the speaker input signal, and outputs the second audio signal 245 after echo cancellation. The second echo cancellation module 233-3 may be a single-microphone echo cancellation algorithm or a multi-microphone echo cancellation algorithm. The first echo cancellation module 233-2 and the second echo cancellation module 233-3 may be the same or different.
在一些实施例中,第一算法233-1可以使用第一噪声抑制模块233-4对消除回声后的所述第一音频信号243和所述第二音频信号245进行噪声抑制。第一噪声抑制模块233-4用于抑制所述第一音频信号243和所述第二音频信号245中的噪声信号。第一噪声抑制模块233-4接收消除回声后的所述第一音频信号243和所述第二音频信号245,并输出噪声抑制后的所述第一音频信号243和所述第二音频信号245。第一噪声抑制模块233-4可以单独对所述第一音频信号243和所述第二音频信号245进行降噪,也可以同时对所述第一音频信号243和所述第二音频信号245进行降噪。In some embodiments, the first algorithm 233-1 may use a first noise suppression module 233-4 to perform noise suppression on the first audio signal 243 and the second audio signal 245 after echo cancellation. The first noise suppression module 233 - 4 is used for suppressing noise signals in the first audio signal 243 and the second audio signal 245 . The first noise suppression module 233-4 receives the echo-removed first audio signal 243 and the second audio signal 245, and outputs the noise-suppressed first audio signal 243 and the second audio signal 245 . The first noise suppression module 233-4 may perform noise reduction on the first audio signal 243 and the second audio signal 245 independently, or may simultaneously perform noise reduction on the first audio signal 243 and the second audio signal 245. Noise reduction.
在一些实施例中,所述第一算法233-1可以使用特征融合模块233-5对经过噪声抑制的所述第一音频信号243和所述第二音频信号245进行特征融合处理。特征融合模块233-5接收经过降噪处理的所述第一音频信号243和所述第二音频信号245。特征融合模块233-5可以分析所述第一音频信号243和所述第二音频信号245的语音质量。比如,特征融合模块233-5可以分析所述第一音频信号243和所述第二音频信号245中的有效语音信号强度、噪声信号强度、回声信号强度以及信噪比等等,判断所述第一音频信号243和所述第二音频信号245的语音质量,将第一音频信号243和所述第二音频信号245融合成所述第一目标音频291并输出。In some embodiments, the first algorithm 233-1 may use a feature fusion module 233-5 to perform feature fusion processing on the noise-suppressed first audio signal 243 and the second audio signal 245. The feature fusion module 233-5 receives the first audio signal 243 and the second audio signal 245 after noise reduction processing. The feature fusion module 233 - 5 can analyze the voice quality of the first audio signal 243 and the second audio signal 245 . For example, the feature fusion module 233-5 can analyze the effective voice signal strength, noise signal strength, echo signal strength, signal-to-noise ratio, etc. in the first audio signal 243 and the second audio signal 245, and determine the first audio signal For the voice quality of an audio signal 243 and the second audio signal 245, the first audio signal 243 and the second audio signal 245 are fused into the first target audio 291 and output.
在一些实施例中,第一算法233-1还可以使用第二噪声抑制模块233-6对所述扬声器信号进行噪声抑制。第二噪声抑制模块233-6用来抑制所述扬声器信号中的噪声信号。第二噪声抑制模块233-6接收控制设备400发送的所述扬声器信号,消除所述扬声器信号中的远端噪声、信道噪声及电子设备200中的电子噪声等噪声信号,输出经过降噪处理的扬声器处理信号。In some embodiments, the first algorithm 233-1 may also perform noise suppression on the speaker signal using the second noise suppression module 233-6. The second noise suppression module 233-6 is used to suppress the noise signal in the speaker signal. The second noise suppression module 233-6 receives the speaker signal sent by the
需要说明的是,图3只是示例性说明。本领域技术人员应该明白,在一些实施例中,第一算法233-1可以包括特征融合模块233-5。在另一些实施例中,第一算法233-2还可以包括第一回声消除模块233-2、第二回声消除模块233-3、第一噪声抑制模块233-4和第二噪声抑制模块233-6中的任意一种或其任意组合。It should be noted that FIG. 3 is only an exemplary illustration. Those skilled in the art should understand that, in some embodiments, the first algorithm 233-1 may include a feature fusion module 233-5. In other embodiments, the first algorithm 233-2 may further include a first echo cancellation module 233-2, a second echo cancellation module 233-3, a first noise suppression module 233-4 and a second noise suppression module 233- Any of 6 or any combination thereof.
在第二模式2中,电子设备200采用第二算法233-8对第二音频信号245进行信号处理,并输出所述第二目标音频292。In the
图4示出了根据本说明书的实施例提供的一种第二模式2的工作示意图。如图4所示,在第二模式2中,第二算法233-8可以接收所述第二音频信号245以及所述扬声器输入信号。第二算法233-8可以使用第三回声消除模块233-9基于所述扬声器输入信号对所述第二音频信号245进行回声消除。第三回声消除模块233-9接收所述第二音频信号245以及所述扬声器输入信号,并输出消除回声后的所述第二音频信号245。第三回声消除模块233-9可以与第二回声消除模块233-3相同,也可以不同。FIG. 4 shows a schematic working diagram of a
在一些实施例中,第二算法233-8可以使用第三噪声抑制模块233-10对消除回声后的所述第二音频信号245进行噪声抑制。第三噪声抑制模块233-10用于抑制所述第二音频信号245中的噪声信号。第三噪声抑制模块233-10接收消除回声后的所述第二音频信号245,并输出噪声抑制后的所述第二音频信号245作为所述第二目标音频292。第三噪声抑制模块233-10可以与第一噪声抑制模块233-4相同,也可以不同。In some embodiments, the second algorithm 233-8 may use the third noise suppression module 233-10 to perform noise suppression on the second audio signal 245 after echo cancellation. The third noise suppression module 233 - 10 is used to suppress the noise signal in the second audio signal 245 . The third noise suppression module 233 - 10 receives the second audio signal 245 after echo cancellation, and outputs the second audio signal 245 after noise suppression as the second target audio 292 . The third noise suppression module 233-10 may be the same as or different from the first noise suppression module 233-4.
在一些实施例中,第二算法233-8还可以使用第四噪声抑制模块233-11对所述扬声器信号进行噪声抑制。第四噪声抑制模块233-11用来抑制所述扬声器信号中的噪声信号。第四噪声抑制模块233-11接收控制设备400发送的所述扬声器信号,消除所述扬声器信号中的远端噪声、信道噪声及电子设备200中的电子噪声等噪声信号,输出经过降噪处理的扬声器处理信号。第四噪声抑制模块233-11可以与第二噪声抑制模块233-6相同,也可以不同。In some embodiments, the second algorithm 233-8 may also perform noise suppression on the speaker signal using the fourth noise suppression module 233-11. The fourth noise suppression module 233-11 is used to suppress the noise signal in the speaker signal. The fourth noise suppression module 233-11 receives the speaker signal sent by the
需要说明的是,图4只是示例性说明。本领域技术人员应该明白,在一些实施例中,第二算法233-8可以包括第三回声消除模块233-9、第三噪声抑制模块233-10和第四噪声抑制模块233-11中的任意一种或其任意组合。在另一些实施例中,第二算法233-8也可以不包括上述任何信号处理模块,直接输出所述第二音频信号245。It should be noted that FIG. 4 is only an exemplary illustration. Those skilled in the art will appreciate that, in some embodiments, the second algorithm 233-8 may include any of the third echo cancellation module 233-9, the third noise suppression module 233-10, and the fourth noise suppression module 233-11 one or any combination thereof. In other embodiments, the second algorithm 233-8 may not include any of the above signal processing modules, and directly output the second audio signal 245.
第一模式1和第二模式2可以只运行一个,以节省计算资源。当第一模式1和运行时,第二模式2可以关闭。当第二模式2运行时,第一模式1可以关闭。第一模式1和第二模式2也可以同时运行,当其中一个模式运行时,另一个模式可以更新算法参数。当电子设备200在第一模式1和第二模式2之间切换时,第一模式1和第二模式2内的部分参数可以共用(比如噪声估计算法得到的噪声参数、人声估计算法得到的人声参数、信噪比算法得到的信噪比参数,等等),从而节约计算资源,使计算结果更为准确。第一模式1和第二模式2中的第一算法233-1和第二算法233-8也可以与控制模块231发出的控制指令中的部分参数共用,比如噪声估计算法得到的噪声参数、人生估计算法得到的人声参数、信噪比算法得到的信噪比参数,等等,从而节约计算资源,使计算结果更为准确。Only one of the
在一些实施例中,所述至少一个指令集还可以包括麦克风控制指令,由麦克风控制模块235执行,被配置为对所述目标音频进行平滑处理,并将平滑处理后的所述目标音频输出至控制设备400。麦克风控制模块235可以接收控制模块231生成的控制信号以及所述目标音频,并基于所述控制信号对所述目标音频进行所述平滑处理。当所述控制信号为所述第一控制信号时,运行第一模式1,使用第一算法233-1输出的所述第一目标音频291为输入信号;当所述控制信号为所述第二控制信号时,运行第二模式2,使用第二算法233-8输出的所述第二目标音频292为输入信号。当所述控制信号在所述第一控制信号和所述第二控制信号之间切换,导致所述目标音频处理模式在第一模式1和第二模式2之间切换时,为避免所述第一目标音频291和所述第二目标音频292切换带来的信号的不连续性,麦克风控制模块235可以对所述目标音频进行平滑处理。具体地,麦克风控制模块235可以对所述第一目标音频291和所述第一目标音频291的参数进行调整,使所述目标音频连续。所述参数可以预先存储在所述至少一个存储介质230中。所述参数可以是幅度、相位、频率响应等等。所述调整的内容可以包括对所述目标音频的音量的调整、EQ均衡的调整、残留噪声的调整等。麦克风控制模块235可以使所述目标音频处理模式在第一模式1和第二模式2之间切换时,所述目标音频为连续信号,使用户002不容易感知二者之间的切换。In some embodiments, the at least one instruction set may further include a microphone control instruction, executed by the
在一些实施例中,所述至少一个指令集还可以包括扬声器控制指令,由扬声器控制模块237执行,被配置为对所述扬声器处理信号进行调整得到所述扬声器输入信号,并将所述扬声器输入信号输出至扬声器280输出声音。扬声器控制模块237可以接收第一算法233-1和第二算法233-8输出的所述扬声器处理信号以及所述控制信号。当所述控制信号为所述第一控制信号时,扬声器控制模块237可以对第一算法233-1输出的所述扬声器处理信号进行控制,使其降低或关闭后再输出至扬声器280进行输出,以降低扬声器280输出的声音,从而降低回声,提升第一算法233-1的回声消除的效果。当所述控制信号为所述第二控制信号时,扬声器控制模块237可以不对第二算法233-8输出的所述扬声器处理信号进行调整。当所述控制信号在所述第一控制信号和所述第二控制信号之间切换时,为避免扬声器280输出的声音不连续,扬声器控制模块237可以对第一算法233-1和第二算法233-8输出的所述扬声器处理信号进行平滑处理。当所述第一控制信号和所述第二控制信号之间切换时,扬声器控制模块237尽量保证切换的连续性,使用户002不容易感知两者之间的切换。In some embodiments, the at least one instruction set may further include speaker control instructions, which are executed by the speaker control module 237 and configured to adjust the speaker processing signal to obtain the speaker input signal, and input the speaker to the speaker. The signal is output to the
第一模式1中,第一算法233-1偏重于近端麦克风模组240拾取的用户002的语音质量。当所述扬声器处理信号过大时,通过扬声器控制模块237对所述扬声器处理信号进行处理以降低所述扬声器输入信号,从而降低扬声器280输出的声音,以降低回声确保近端语音质量。第二算法233-8偏重于扬声器280的所述扬声器输入信号,不采用第一类麦克风242输出的所述第一音频信号243来确保扬声器280的所述扬声器输入信号的语音质量以及可懂度。In the
至少一个处理器220可以同至少一个存储介质230、麦克风模组240和扬声器280通信连接。所述通信连接是指能够直接地或者间接地接收信息的任何形式的连接。至少一个处理器220用以执行上述至少一个指令集。当系统100运行时,至少一个处理器220读取所述至少一个指令集,并且根据所述至少一个指令集的指示获取麦克风模组240以及扬声器280的数据,执行本说明书提供的用于抑制回声的音频信号处理的方法。处理器220可以执行用于抑制回声的音频信号处理的方法包含的所有步骤。处理器220可以是一个或多个处理器的形式,在一些实施例中,处理器220可以包括一个或多个硬件处理器,例如微控制器,微处理器,精简指令集计算机(RISC),专用集成电路(ASIC),特定于应用的指令集处理器(ASIP),中央处理单元(CPU),图形处理单元(GPU),物理处理单元(PPU),微控制器单元,数字信号处理器(DSP),现场可编程门阵列(FPGA),高级RISC机器(ARM),可编程逻辑器件(PLD),能够执行一个或多个功能的任何电路或处理器等,或其任何组合。仅仅为了说明问题,在本说明书中电子设备200中仅描述了一个处理器220。然而,应当注意,本说明书中电子设备200还可以包括多个处理器,因此,本说明书中披露的操作和/或方法步骤可以如本说明书所述的由一个处理器执行,也可以由多个处理器联合执行。例如,如果在本说明书中电子设备200的处理器220执行步骤A和步骤B,则应该理解,步骤A和步骤B也可以由两个不同处理器220联合或分开执行(例如,第一处理器执行步骤A,第二处理器执行步骤B,或者第一和第二处理器共同执行步骤A和B)。At least one processor 220 may be communicatively connected with at least one
在一些实施例中,系统100可以根据所述扬声器信号的信号强度选择电子设备200的所述目标音频处理模式。在一些实施例中,系统100可以根据所述扬声器信号的信号强度以及所述麦克风信号选择电子设备200的所述目标音频处理模式。In some embodiments, the
图5示出了根据本说明书的实施例提供的一种用于抑制回声的音频信号处理方法P100的流程图。所述方法P100为系统100根据所述扬声器信号的信号强度选择电子设备200的所述目标音频处理模式的方法流程图。如图5所示,所述方法P100可以包括通过至少一个处理器220执行:FIG. 5 shows a flowchart of an audio signal processing method P100 for suppressing echoes provided according to an embodiment of the present specification. The method P100 is a flowchart of a method for the
S120:至少基于所述扬声器信号从第一模式1和第二模式2中选择电子设备200的目标音频处理模式。如前所述,所述目标音频处理模式可以包括第一模式1和第二模式2中的一个。具体地,步骤S120可以包括:S120: Select a target audio processing mode of the
S121:获取所述扬声器信号。S121: Acquire the speaker signal.
S122:至少基于所述扬声器信号的强度,生成与所述扬声器信号对应的控制信号。所述控制信号包括第一控制信号或第二控制信号。具体地,电子设备200可以接收控制设备400发送的所述扬声器信号,并将所述扬声器信号的强度与预设的扬声器阈值进行对比,并根据对比结果生成所述控制信号。步骤S122可以包括以下情况中的一种:S122: Generate a control signal corresponding to the speaker signal based on at least the strength of the speaker signal. The control signal includes a first control signal or a second control signal. Specifically, the
S122-2:确定所述扬声器信号的强度低于所述扬声器阈值,生成所述第一控制信号;或者S122-2: Determine that the intensity of the speaker signal is lower than the speaker threshold, and generate the first control signal; or
S122-4:确定所述扬声器信号的强度高于预设的扬声器阈值,生成所述第二控制信号。S122-4: Determine that the intensity of the speaker signal is higher than a preset speaker threshold, and generate the second control signal.
步骤S120还可以包括:Step S120 may also include:
S124:基于所述控制信号,选择与所述控制信号对应的所述目标音频处理模式。其中,所述第一控制信号与所述第一模式1对应。所述第二控制信号与所述第二模式2对应。当所述控制信号为所述第一控制信号时,选择第一模式1;当所述控制信号为所述第二控制信号时,选择第二模式2。S124: Based on the control signal, select the target audio processing mode corresponding to the control signal. Wherein, the first control signal corresponds to the
当所述扬声器信号的强度高于所述扬声器阈值时,使用第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理时,无法在保留较好的人声信号的同时,消除信号中的回声信号,因此得到的所述第一目标音频291的语音质量较差;而使用第二模式2中的第二算法233-8对所述第二音频信号245进行信号处理得到的所述第二目标音频292质量较好。因此,当所述扬声器信号的强度高于所述扬声器阈值时,电子设备200生成与第二模式2对应的所述第二控制信号。When the strength of the speaker signal is higher than the speaker threshold, when the first algorithm 233-1 in the
当所述扬声器信号的强度低于所述扬声器阈值时,使用第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理时,能够在保留较好的人声信号的同时,消除信号中的回声信号,因此得到的所述第一目标音频291的语音质量较好;而使用第二模式2中的第二算法233-8对所述第二音频信号245进行信号处理得到的所述第二目标音频292质量也较好。因此,当所述扬声器信号的强度低于所述扬声器阈值时,电子设备200生成与第一模式1对应的所述第一控制信号,也可以生成与第二模式2对应的所述第二控制信号。When the intensity of the speaker signal is lower than the speaker threshold, when the first algorithm 233-1 in the
所述控制信号由控制模块231生成。具体地,电子设备200可以实时监测所述扬声器信号的强度,并与所述扬声器阈值进行对比。电子设备200也可以定时检测所述扬声器信号的强度,并与所述扬声器阈值进行对比。电子设备200还可以在监测到所述扬声器信号的强度发生明显变化,且变化值超过预设范围时,再将所述扬声器信号与所述扬声器阈值进行对比。The control signal is generated by the control module 231 . Specifically, the
当所述扬声器信号的强度高于所述扬声器阈值时,电子设备200生成所述第二控制信号;当所述扬声器信号发生变化,且所述扬声器信号的强度低于所述扬声器阈值时,电子设备生成所述第一控制信号。当所述扬声器信号的强度低于所述扬声器阈值时,电子设备200生成所述第一控制信号;当所述扬声器信号发生变化,且所述扬声器信号的强度高于所述扬声器阈值时,电子设备生成所述第二控制信号。When the strength of the speaker signal is higher than the speaker threshold, the
为了保证所述控制信号切换时不被用户002感知,所述扬声器阈值可以是一个范围。所述扬声器阈值可以在第一扬声器临界值和第二扬声器临界值所处的范围内。所述第一扬声器临界值小于第二扬声器临界值。所述扬声器信号的强度高于所述扬声器阈值可以包括所述扬声器信号的强度高于所述第二扬声器临界值。所述扬声器信号的强度低于所述扬声器阈值可以包括所述扬声器信号的强度低于所述第一扬声器临界值。In order to ensure that the control signal is not perceived by the
当所述扬声器信号强度等于所述扬声器阈值时,电子设备200可以生成所述第一控制信号或者所述第二控制信号。当所述扬声器信号强度高于所述第二扬声器临界值时,电子设备200生成所述第二控制信号;当所述扬声器信号强度降低至所述第一扬声器临界值和所述第二扬声器临界值之间时,电子设备200可以生成所述第二控制信号。当所述扬声器信号强度低于所述第一扬声器临界值时,电子设备200生成所述第一控制信号;当所述扬声器信号强度增强至所述第一扬声器临界值和所述第二扬声器临界值之间时,电子设备200可以生成所述第一控制信号。When the speaker signal strength is equal to the speaker threshold, the
电子设备200也可以通过机器学习得到控制模型,将所述扬声器信号输入所述控制模型,所述控制模型输出所述控制信号。The
所述方法P100还可以包括通过至少一个处理器220执行:The method P100 may further include executing, by at least one processor 220:
S140:通过所述目标音频处理模式处理所述麦克风信号生成所述目标音频,来至少降低所述麦克风信号中的回声。具体地,步骤S140可以包括以下情况中的一种:S140: Process the microphone signal in the target audio processing mode to generate the target audio to at least reduce echoes in the microphone signal. Specifically, step S140 may include one of the following situations:
S142:确定所述控制信号为所述第一控制信号,通过与所述第一控制信号对应的所述第一模式1中的第一算法233-1,基于所述扬声器输入信号,对所述第一音频信号243和所述第二音频信号245进行信号处理以及特征融合,生成第一目标音频291。具体过程如前所述,在这里不再赘述。S142: Determine that the control signal is the first control signal, and use the first algorithm 233-1 in the
S144:确定所述控制信号为所述第二控制信号,通过与所述第二控制信号对应的第二模式2中的第二算法233-8,基于所述扬声器输入信号,对所述第二音频信号245进行信号处理。具体过程如前所述,在这里不再赘述。S144: Determine that the control signal is the second control signal, and use the second algorithm 233-8 in the
S160:输出所述目标音频。电子设备200可以直接输出所述目标音频。电子设备200也可以对所述目标音频做平滑处理,以使所述目标音频在所述第一目标音频291和所述第二目标音频292之间切换时,不被用户002感知。具体地,步骤S160可以包括:对所述目标音频做平滑处理并输出经过所述平滑处理的所述目标音频。S160: Output the target audio. The
具体地,电子设备200可以通过麦克风控制模块235对所述目标音频做平滑处理。当所述目标音频在所述第一目标音频291和所述第二目标音频292之间切换时,麦克风控制模块235可以对所述第一目标音频291和所述第二目标音频292的连接处进行所述平滑处理,即对第一目标音频291和所述第二目标音频292进行信号调节,使得连接处平滑过渡。Specifically, the
所述方法P100还可以包括:The method P100 may further include:
S180:基于所述控制信号,控制所述扬声器280的所述扬声器输入信号的强度。具体地,步骤S180可以通过扬声器控制模块237执行。步骤S180可以是通过扬声器控制模块237确定所述控制信号为所述第一控制信号;扬声器控制模块237对所述扬声器处理信号进行处理,降低输入扬声器280的所述扬声器输入信号的强度,从而降低扬声器280输出的声音的强度,以降低所述麦克风信号中的回声信号,以提高所述第一目标音频的语音质量。S180: Based on the control signal, control the intensity of the speaker input signal of the
表1示出了图5对应的目标音频处理模式结果图。如表1所示,为了方便对照,我们将场景分为4个场景,分别是第一种:近端声音信号小于阈值(比如用户002不发出声音)且所述扬声器信号不超过所述扬声器阈值;第二种:近端声音信号大于阈值(比如用户002发出声音)且所述扬声器信号不超过所述扬声器阈值;第三种:近端声音信号小于阈值(比如用户002不发出声音)且所述扬声器信号超过所述扬声器阈值;以及第四种:近端声音信号大于阈值(比如用户002发出声音)且所述扬声器信号超过所述扬声器阈值。其中,近端声音信号是否大于阈值可以通过控制模块231根据所述麦克风信号进行判断。近端声音信号大于阈值可以是用户002发出的音频信号强度超过预设的阈值。所述4个场景对应的目标音频处理模式分别是第一种和第二种对应第一模式1;第三种和第四种对应第二模式2。Table 1 shows the result graph of the target audio processing mode corresponding to FIG. 5 . As shown in Table 1, for the convenience of comparison, we divide the scenarios into 4 scenarios, which are the first one: the near-end sound signal is less than the threshold (for example,
所述方法P100中,电子设备200可以根据所述扬声器信号所选择电子设备200的目标音频处理模式,以保证电子设备200在任何场景下选择的目标音频处理模式处理的语音质量都是最优的,以保证通话质量。In the method P100, the
在一些实施例中,所述目标音频处理模式的选择不仅与所述扬声器信号的回声有关,还可以与环境噪声有关。所述环境噪声可以通过所述麦克风信号中的环境噪声等级和信噪比中的至少一个进行评价。In some embodiments, the selection of the target audio processing mode is not only related to the echo of the speaker signal, but also to ambient noise. The ambient noise may be evaluated by at least one of an ambient noise level and a signal-to-noise ratio in the microphone signal.
图6示出了根据本说明书的实施例提供的一种用于抑制回声的音频信号处理方法P200的流程图。所述方法P200为系统100根据所述扬声器信号的信号强度以及所述麦克风信号选择电子设备200的所述目标音频处理模式的方法流程图。具体地,所述方法P200为系统100根据所述扬声器信号以及所述麦克风信号中的环境噪声等级和信噪比中的至少一个选择所述目标音频处理模式的方法流程图。所述方法P200可以包括通过至少一个处理器220执行:FIG. 6 shows a flowchart of an audio signal processing method P200 for suppressing echoes provided according to an embodiment of the present specification. The method P200 is a flowchart of a method for the
S220:至少基于所述扬声器信号从第一模式1和第二模式2中选择电子设备200的目标音频处理模式。具体地,步骤S220可以包括:S220: Select a target audio processing mode of the
S222:至少基于所述扬声器信号的强度,生成与所述扬声器信号对应的控制信号。所述控制信号包括第一控制信号或第二控制信号。具体地,步骤S222可以是电子设备200基于所述扬声器信号的强度以及所述麦克风信号中的噪声,生成对应的控制信号。步骤S222可以包括:S222: Generate a control signal corresponding to the speaker signal based on at least the strength of the speaker signal. The control signal includes a first control signal or a second control signal. Specifically, step S222 may be that the
S222-2:获取所述扬声器信号和所述麦克风信号的评价参数。其中,所述评价参数可以是所述麦克风信号中的环境噪声评价参数。所述环境噪声评价参数可以包括环境噪声等级以及信噪比中的至少一个。电子设备200可以通过控制模块231获取所述麦克风信号中的环境噪声评价参数。具体地,电子设备200可以根据第一音频信号243和第二音频信号245中的至少一个获取所述环境噪声评价参数。电子设备200可以通过噪声估计算法获取所述环境噪声等级或所述信噪比,本说明书在此不再赘述。S222-2: Acquire evaluation parameters of the speaker signal and the microphone signal. Wherein, the evaluation parameter may be an environmental noise evaluation parameter in the microphone signal. The environmental noise evaluation parameter may include at least one of an environmental noise level and a signal-to-noise ratio. The
S222-4:基于所述扬声器信号的强度以及所述环境噪声评价参数,生成所述控制信号。具体地,电子设备200可以将所述扬声器信号的强度与预设的扬声器阈值进行对比,以及将所述环境噪声评价参数与预设的噪声评价范围进行对比,并根据对比结果生成所述控制信号。步骤S222-4可以包括以下情况中的一种:S222-4: Generate the control signal based on the strength of the speaker signal and the environmental noise evaluation parameter. Specifically, the
S222-5:确定所述扬声器信号的强度高于预设的扬声器阈值,生成所述第二控制信号;S222-5: Determine that the intensity of the speaker signal is higher than a preset speaker threshold, and generate the second control signal;
S222-6:确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于预设的噪声评价范围外,生成所述第一控制信号;S222-6: Determine that the intensity of the speaker signal is lower than the speaker threshold, and the environmental noise evaluation parameter is outside a preset noise evaluation range, and generate the first control signal;
S222-7:确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于所述噪声评价范围内,生成所述第一控制信号或所述第二控制信号。S222-7: Determine that the intensity of the speaker signal is lower than the speaker threshold, and the environmental noise evaluation parameter is within the noise evaluation range, and generate the first control signal or the second control signal.
其中,所述环境噪声评价参数处于所述噪声评价范围内可以包括所述环境噪声等级低于预设环境噪声阈值,以及所述信噪比高于预设信噪比阈值中的至少一种。此时的环境噪声较小。所述环境噪声评价参数处于所述噪声评价范围外可以包括所述环境噪声等级高于预设环境噪声阈值,以及所述信噪比低于预设信噪比阈值中的至少一种。此时的环境噪声较大。其中,当所述环境噪声评价参数处于所述噪声评价范围外时,即大噪声环境下,所述第一目标音频291的语音质量优于所述第二目标音频292。当所述环境噪声评价参数处于所述噪声评价范围内时,所述第一目标音频291的语音质量与所述第二目标音频292的语音质量相差不大。Wherein, the environmental noise evaluation parameter being within the noise evaluation range may include at least one of the environmental noise level being lower than a preset environmental noise threshold, and the signal-to-noise ratio being higher than a preset signal-to-noise ratio threshold. The ambient noise at this time is small. The fact that the environmental noise evaluation parameter is outside the noise evaluation range may include at least one of the environmental noise level being higher than a preset environmental noise threshold, and the signal-to-noise ratio being lower than a preset signal-to-noise ratio threshold. At this time, the ambient noise is large. Wherein, when the environmental noise evaluation parameter is outside the noise evaluation range, that is, in a loud noise environment, the voice quality of the first target audio 291 is better than that of the second target audio 292 . When the environmental noise evaluation parameter is within the noise evaluation range, the voice quality of the first target audio 291 is not much different from the voice quality of the second target audio 292 .
步骤S220还可以包括:Step S220 may also include:
S224:基于所述控制信号,选择与所述控制信号对应的所述目标音频处理模式。其中,所述第一控制信号与所述第一模式1对应。所述第二控制信号与所述第二模式2对应。当所述控制信号为所述第一控制信号时,选择第一模式1;当所述控制信号为所述第二控制信号时,选择第二模式2。S224: Based on the control signal, select the target audio processing mode corresponding to the control signal. Wherein, the first control signal corresponds to the
当所述扬声器信号的强度高于所述扬声器阈值时,第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理时,无法在保留较好的人声信号的同时,消除信号中的回声信号,因此得到的所述第一目标音频291的语音质量较差;而第二模式2中的第二算法233-8对所述第二音频信号245进行信号处理得到的所述第二目标音频292质量较好。因此,当所述扬声器信号的强度高于所述扬声器阈值时,不管所述环境噪声处于什么范围内,电子设备200都生成与第二模式2对应的所述第二控制信号。When the intensity of the speaker signal is higher than the speaker threshold, when the first algorithm 233-1 in the
当所述扬声器信号的强度低于所述扬声器阈值时,第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理时,能够在保留较好的人声信号的同时,消除信号中的回声信号,因此得到的所述第一目标音频291的语音质量较好;而第二模式2中的第二算法233-8对所述第二音频信号245进行信号处理得到的所述第二目标音频292质量也较好。因此,当所述扬声器信号的强度低于所述扬声器阈值时,电子设备200生成的控制信号与环境噪声有关。When the intensity of the speaker signal is lower than the speaker threshold, when the first algorithm 233-1 in the
当所述环境噪声等级高于所述环境噪声阈值或所述信噪比低于所述信噪比阈值时,代表所述麦克风信号中的环境噪声较大。第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理时,能够在保留较好的人声信号的同时,降低信号中的噪声,因此得到的所述第一目标音频291的语音质量较好;而第二模式2中的第二算法233-8对所述第二音频信号245进行信号处理得到的所述第二目标音频292的语音质量不如第一目标音频291的语音质量。因此,当所述扬声器信号的强度低于所述扬声器阈值,并且所述环境噪声等级高于所述环境噪声阈值或所述信噪比低于所述信噪比阈值时,电子设备200生成与第一模式1对应的所述第一控制信号。When the ambient noise level is higher than the ambient noise threshold or the signal-to-noise ratio is lower than the signal-to-noise ratio threshold, it represents that the ambient noise in the microphone signal is relatively large. When the first algorithm 233-1 in the
需要说明的是,当环境噪声较小时,即环境噪声评价参数处于所述噪声评价范围之内时,第一目标音频291的语音质量与第二目标音频292的语音质量相差不大。这时,电子设备200可以始终生成所述第二控制信号,以选择所述第二模式2中的第二算法233-8对第二音频信号245进行信号处理,在保证目标音频语音质量的前提下,减少计算量,节约资源。It should be noted that when the environmental noise is small, that is, when the environmental noise evaluation parameter is within the noise evaluation range, the voice quality of the first target audio 291 is not much different from the voice quality of the second target audio 292 . At this time, the
当所述环境噪声等级低于所述环境噪声阈值或所述信噪比高于所述信噪比阈值时,代表所述麦克风信号中的环境噪声较小。第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理时得到的所述第一目标音频291,以及第二模式2中的第二算法233-8对所述第二音频信号245进行信号处理得到的所述第二目标音频292的语音质量都较好。因此,当所述扬声器信号的强度低于所述扬声器阈值,并且所述环境噪声等级低于所述环境噪声阈值或所述信噪比高于所述信噪比阈值时,电子设备200生成所述第一控制信号或所述第二控制信号。具体地,电子设备200可以根据前一场景的控制信号决定当前场景下的控制信号。也就是说,当前一场景下,电子设备生成第一控制信号时,当处在当前场景下时,电子设备也生成第一控制信号,从而保证信号的连续性。反之亦然。When the ambient noise level is lower than the ambient noise threshold or the signal-to-noise ratio is higher than the signal-to-noise ratio threshold, it represents that the ambient noise in the microphone signal is small. The first target audio 291 obtained when the first algorithm 233-1 in the
所述控制信号由控制模块231生成。具体地,电子设备200可以实时监测所述扬声器信号的强度以及所述环境噪声评价参数,并与所述扬声器阈值和所述噪声评价范围进行对比。电子设备200也可以定时检测所述扬声器信号的强度以及所述环境噪声评价参数,并与所述扬声器阈值和所述噪声评价范围进行对比。电子设备200还可以在监测到所述扬声器信号的强度或所述环境噪声评价参数发生明显变化,且变化值超过预设范围时,再将所述扬声器信号以及所述环境噪声评价参数与所述扬声器阈值和所述噪声评价范围进行对比。The control signal is generated by the control module 231 . Specifically, the
为了保证所述控制信号切换时不被用户002感知,所述扬声器阈值、所述环境噪声阈值和所述预设信噪比阈值可以是一个范围。所述扬声器阈值如前所述,在此不再赘述。所述环境噪声阈值可以在第一噪声临界值和第二噪声临界值所处的范围内。所述第一噪声临界值小于第二噪声临界值。所述环境噪声等级高于所述环境噪声阈值可以包括所述环境噪声等级高于所述第二噪声临界值。所述环境噪声等级低于所述环境噪声阈值可以包括所述环境噪声等级低于所述第一噪声临界值。所述信噪比阈值可以在第一信噪比临界值和第二信噪比临界值所处的范围内。所述第一信噪比临界值小于第二信噪比临界值。所述信噪比高于所述信噪比阈值可以包括所述信噪比高于所述第二信噪比临界值。所述信噪比低于所述信噪比阈值可以包括所述信噪比低于所述第一信噪比临界值。In order to ensure that the control signal is not perceived by the
所述方法P200可以包括通过至少一个处理器220执行:The method P200 may include performing, by at least one processor 220:
S240:通过所述目标音频处理模式处理所述麦克风信号生成目标音频,来至少降低所述麦克风信号中的回声。具体地,步骤S240可以包括以下情况中的一种:S240: Process the microphone signal in the target audio processing mode to generate target audio to at least reduce echoes in the microphone signal. Specifically, step S240 may include one of the following situations:
S242:确定所述控制信号为所述第一控制信号,选择所述第一模式1,对所述第一音频信号243和第二音频信号245进行信号处理,生成第一目标音频291。具体地,步骤S242可以与步骤S142一致,在此不再赘述。S242 : Determine that the control signal is the first control signal, select the
S244:确定所述控制信号为所述第二控制信号,选择所述第二模式2,对所述第二音频信号245进行回声抑制,生成第二目标音频292。具体地,步骤S244可以与步骤S144一致,在此不再赘述。S244: Determine that the control signal is the second control signal, select the
S260:输出所述目标音频。具体地,步骤S260可以与步骤S160一致,在此不再赘述。S260: Output the target audio. Specifically, step S260 may be the same as step S160, and details are not repeated here.
所述方法P200还可以包括:The method P200 may also include:
S280:基于所述控制信号,控制所述扬声器280的所述扬声器输入信号的强度。具体地,步骤S280可以与步骤S180一致,在此不再赘述。S280: Control the intensity of the speaker input signal of the
表2示出了图6对应的目标音频处理模式结果图。如表2所示,为了方便对照,我们将场景分为8个场景,分别是第一种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较小;第二种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较小;第三种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较小;以及第四种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较小;第五种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较大;第六种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较大;第七种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较大;以及第八种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较大。其中,近端声音信号是否大于阈值可以通过控制模块231根据所述麦克风信号进行判断。近端声音信号大于阈值可以是用户002发出的音频信号强度超过预设的阈值。所述8个场景对应的目标音频处理模式分别是第五种和第六种对应第一模式1;第三种、第四种、第七种和第八种对应第二模式2;其余场景对应第一模式1或第二模式2。Table 2 shows the result graph of the target audio processing mode corresponding to FIG. 6 . As shown in Table 2, for the convenience of comparison, we divide the scenarios into 8 scenarios, which are the first one: the near-end sound signal is less than the threshold (for example,
所述方法P200不仅可以根据扬声器信号控制电子设备200的所述目标音频处理模式,还可以根据近端的环境噪声信号控制所述目标音频处理模式,从而保证在不同场景下,电子设备200输出的语音信号的语音质量都是最佳的,以保证通话质量。The method P200 can not only control the target audio processing mode of the
在一些实施例中,所述目标音频处理模式的选择不仅与所述扬声器信号的回声以及环境噪声有关,还可以与用户002说话时的语音信号有关。所述环境噪声信号可以通过所述麦克风信号中的环境噪声等级和信噪比中的至少一个进行评价。用户002说话时的语音信号可以通过所述麦克风信号中的人声信号强度进行评价。所述人声信号强度可以是通过噪声估计算法得到的人声信号强度,所述人声信号强度也可以是经过降噪处理后得到的音频信号的强度。In some embodiments, the selection of the target audio processing mode is not only related to the echo and ambient noise of the speaker signal, but also to the speech signal when the
图7示出了根据本说明书的实施例提供的一种用于抑制回声的音频信号处理方法P300的流程图。所述方法P300为系统100根据所述扬声器信号的信号强度以及所述麦克风信号选择电子设备200的目标音频处理模式的方法流程图。具体地,所述方法P300为系统100根据所述扬声器信号、所述麦克风信号中的人声信号强度以及环境噪声等级和信噪比的至少一个选择所述目标音频处理模式的方法流程图。所述方法P300可以包括通过至少一个处理器220执行:FIG. 7 shows a flowchart of an audio signal processing method P300 for suppressing echoes provided according to an embodiment of the present specification. The method P300 is a flowchart of a method for the
S320:至少基于所述扬声器信号从第一模式1和第二模式2中选择电子设备200的目标音频处理模式。具体地,步骤S320可以包括:S320: Select a target audio processing mode of the
S322:至少基于所述扬声器信号的强度,生成与所述扬声器信号对应的控制信号。所述控制信号包括第一控制信号或第二控制信号。步骤S320可以是电子设备200基于所述扬声器信号的强度、所述麦克风信号中的噪声以及所述麦克风信号中的人声信号强度,生成对应的控制信号。具体地,步骤S322可以包括:S322: Generate a control signal corresponding to the speaker signal based on at least the strength of the speaker signal. The control signal includes a first control signal or a second control signal. Step S320 may be that the
S322-2:获取所述扬声器信号和所述麦克风信号的评价参数。其中,所述评价参数可以包括所述麦克风信号中的环境噪声评价参数,还可以包括所述麦克风信号中的人声信号强度。所述环境噪声评价参数可以包括环境噪声等级以及信噪比中的至少一个。电子设备200可以通过控制模块231获取所述麦克风信号中的环境噪声评价参数以及人声信号强度。具体地,电子设备200可以根据第一音频信号243和第二音频信号245中的至少一个获取所述评价参数。电子设备200可以通过噪声估计算法获取所述人声信号以及所述环境噪声等级和所述信噪比,本说明书在此不再赘述。S322-2: Acquire evaluation parameters of the speaker signal and the microphone signal. The evaluation parameters may include environmental noise evaluation parameters in the microphone signal, and may also include human voice signal strength in the microphone signal. The environmental noise evaluation parameter may include at least one of an environmental noise level and a signal-to-noise ratio. The
S322-4:基于所述扬声器信号的强度以及所述评价参数,生成所述控制信号。具体地,电子设备200可以将所述扬声器信号的强度与预设的扬声器阈值进行对比,将所述环境噪声评价参数与预设的噪声评价范围进行对比,以及将所述人声信号强度与预设的人声阈值进行对比,并根据对比结果生成所述控制信号。步骤S322-4可以包括以下情况中的一种:S322-4: Generate the control signal based on the strength of the speaker signal and the evaluation parameter. Specifically, the
S322-5:确定所述扬声器信号的强度高于预设的扬声器阈值,且所述人声信号强度超过所述人声阈值,所述环境噪声评价参数处于预设的噪声评价范围之外,生成所述第一控制信号;S322-5: Determine that the intensity of the speaker signal is higher than a preset speaker threshold, the intensity of the human voice signal exceeds the human voice threshold, and the environmental noise evaluation parameter is outside the preset noise evaluation range, and generate the first control signal;
S322-6:确定所述扬声器信号的强度高于所述扬声器阈值,且所述人声信号强度超过所述人声阈值,所述环境噪声评价参数处于所述噪声评价范围之内,生成所述第二控制信号;S322-6: Determine that the intensity of the speaker signal is higher than the speaker threshold, the intensity of the human voice signal exceeds the human voice threshold, and the environmental noise evaluation parameter is within the noise evaluation range, and generate the a second control signal;
S322-7:确定所述扬声器信号的强度高于所述扬声器阈值,且所述人声信号强度低于所述人声阈值,生成所述第二控制信号;S322-7: Determine that the intensity of the speaker signal is higher than the speaker threshold, and the intensity of the human voice signal is lower than the human voice threshold, and generate the second control signal;
S322-8:确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于所述噪声评价范围之外,生成所述第一控制信号;S322-8: Determine that the intensity of the speaker signal is lower than the speaker threshold, and the environmental noise evaluation parameter is outside the noise evaluation range, and generate the first control signal;
S322-9:确定所述扬声器信号的强度低于所述扬声器阈值,且所述环境噪声评价参数处于所述噪声评价范围内,生成所述第一控制信号或所述第二控制信号。S322-9: Determine that the intensity of the speaker signal is lower than the speaker threshold, and the environmental noise evaluation parameter is within the noise evaluation range, and generate the first control signal or the second control signal.
所述环境噪声评价参数处于所述噪声评价范围内可以包括所述环境噪声等级低于预设环境噪声阈值,以及所述信噪比高于预设信噪比阈值中的至少一种。此时的环境噪声较小。所述环境噪声评价参数处于所述噪声评价范围外可以包括所述环境噪声等级高于预设环境噪声阈值,以及所述信噪比低于预设信噪比阈值中的至少一种。此时的环境噪声较大。其中,当所述环境噪声评价参数处于所述噪声评价范围外时,即大噪声环境下,所述第一目标音频291的语音质量优于所述第二目标音频292。当所述环境噪声评价参数处于所述噪声评价范围内时,所述第一目标音频291的语音质量与所述第二目标音频292的语音质量相差不大。所述扬声器阈值、所述环境噪声阈值以及所述信噪比阈值如前所述,在此不在赘述。The environmental noise evaluation parameter being within the noise evaluation range may include at least one of the environmental noise level being lower than a preset environmental noise threshold, and the signal-to-noise ratio being higher than a preset signal-to-noise ratio threshold. The ambient noise at this time is small. The fact that the environmental noise evaluation parameter is outside the noise evaluation range may include at least one of the environmental noise level being higher than a preset environmental noise threshold, and the signal-to-noise ratio being lower than a preset signal-to-noise ratio threshold. At this time, the ambient noise is large. Wherein, when the environmental noise evaluation parameter is outside the noise evaluation range, that is, in a loud noise environment, the voice quality of the first target audio 291 is better than that of the second target audio 292 . When the environmental noise evaluation parameter is within the noise evaluation range, the voice quality of the first target audio 291 is not much different from the voice quality of the second target audio 292 . The loudspeaker threshold, the ambient noise threshold, and the signal-to-noise ratio threshold are as described above, and will not be repeated here.
其中,所述人声信号强度超过所述人声阈值说明用户002正在说话。此时,为了保证用户002的语音质量,电子设备200可以生成所述第一控制信号,并降低所述扬声器信号以保证所述第一目标音频292的语音质量。Wherein, if the voice signal strength exceeds the voice threshold, it indicates that the
所述扬声器阈值、所述环境噪声阈值、所述信噪比阈值以及所述人声阈值可以预先存储在电子设备200中。The speaker threshold, the ambient noise threshold, the signal-to-noise ratio threshold, and the vocal threshold may be stored in the
步骤S320还可以包括:Step S320 may also include:
S324:基于所述控制信号,选择与所述控制信号对应的所述目标音频处理模式。其中,所述第一控制信号与所述第一模式1对应。所述第二控制信号与所述第二模式2对应。当所述控制信号为所述第一控制信号时,选择第一模式1;当所述控制信号为所述第二控制信号时,选择第二模式2。S324: Based on the control signal, select the target audio processing mode corresponding to the control signal. Wherein, the first control signal corresponds to the
当所述扬声器信号的强度高于所述扬声器阈值且所述人声信号强度超过所述人声阈值,所述环境噪声评价参数处于预设的噪声评价范围之外时,证明此时用户002正在说话,并且回声很大,噪声也较大。为了保证用户002的语音质量以及可懂度,电子设备200可降低甚至关闭输入至扬声器280的扬声器输入信号,以降低所述麦克风信号中的回声,保证目标音频的语音质量。此时,第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理得到的所述第一目标音频291的语音质量较第二模式2中的第二算法233-8对第二音频信号245进行信号处理得到的第二目标音频292更好。因此,当所述扬声器信号的强度高于所述扬声器阈值且所述人声信号强度超过所述人声阈值,所述环境噪声评价参数处于预设的噪声评价范围之外时,电子设备200生成与第一模式1对应的所述第一控制信号。在这种情况下,电子设备200可以保证近端用户002的语音质量的可懂度。虽然扬声器输入信号有一部分缺失,但电子设备200能保留扬声器输入信号的大部分语音质量和可懂度,从而提升双方的语音通信质量。When the intensity of the speaker signal is higher than the speaker threshold and the human voice signal intensity exceeds the human voice threshold, and the environmental noise evaluation parameter is outside the preset noise evaluation range, it proves that the
当所述扬声器信号的强度高于所述扬声器阈值且所述人声信号强度低于所述人声阈值或者所述人声信号强度超过所述人声阈值,所述环境噪声评价参数处于预设的噪声评价范围之内时,证明此时用户002没有说话,或者用户002正在说话但噪音较小。此时,第一模式1中的第一算法233-1对所述第一音频信号243和第二音频信号245进行信号处理得到的所述第一目标音频291的语音质量较第二模式2中的第二算法233-8对第二音频信号245进行信号处理得到的第二目标音频292更差。因此,当所述扬声器信号的强度高于所述扬声器阈值且所述人声信号强度低于所述人声阈值或者所述人声信号强度超过所述人声阈值,所述环境噪声评价参数处于预设的噪声评价范围之内时,电子设备200生成与第二模式2对应的所述第二控制信号。When the strength of the speaker signal is higher than the speaker threshold and the voice signal strength is lower than the voice threshold or the voice signal strength exceeds the voice threshold, the environmental noise evaluation parameter is at a preset value When it is within the noise evaluation range of , it proves that the
步骤S322-4中的其他情况与步骤S222-4基本一致,在此不再赘述。The other situations in step S322-4 are basically the same as those in step S222-4, and are not repeated here.
所述控制信号由控制模块231生成。具体地,电子设备200可以实时监测所述扬声器信号的强度以及所述评价参数,并与所述扬声器阈值、所述噪声评价范围以及所述人声阈值进行对比。电子设备200也可以定时检测所述扬声器信号的强度以及所述评价参数,并与所述扬声器阈值、所述噪声评价范围以及所述人声阈值进行对比。电子设备200还可以在监测到所述扬声器信号的强度或所述评价参数发生明显变化,且变化值超过预设范围时,再将所述扬声器信号以及所述评价参数与所述扬声器阈值、所述噪声评价范围以及所述人声阈值进行对比。The control signal is generated by the control module 231 . Specifically, the
所述方法P300可以包括通过至少一个处理器220执行:The method P300 may include performing, by at least one processor 220:
S340:通过所述目标音频处理模式处理所述麦克风信号生成所述目标音频,来至少降低所述麦克风信号中的回声。具体地,步骤S340可以包括以下情况中的一种:S340: Process the microphone signal in the target audio processing mode to generate the target audio to at least reduce the echo in the microphone signal. Specifically, step S340 may include one of the following situations:
S342:确定所述控制信号为所述第一控制信号,选择所述第一模式1,对所述第一音频信号243和第二音频信号245进行信号处理,生成第一目标音频291。具体地,步骤S342可以与步骤S142一致,在此不再赘述。S342 : Determine that the control signal is the first control signal, select the
S344:确定所述控制信号为所述第二控制信号,选择所述第二模式2,对所述第二音频信号245进行信号处理,生成第二目标音频292。具体地,步骤S344可以与步骤S144一致,在此不再赘述。S344: Determine that the control signal is the second control signal, select the
所述方法P300可以包括通过至少一个处理器220执行:The method P300 may include performing, by at least one processor 220:
S360:输出所述目标音频。具体地,步骤S360可以与步骤S160一致,在此不再赘述。S360: Output the target audio. Specifically, step S360 may be the same as step S160, and details are not repeated here.
所述方法P300还可以包括:The method P300 may also include:
S380:基于所述控制信号,控制所述扬声器280的所述扬声器输入信号的强度。具体地,步骤S380可以与步骤S180一致,在此不再赘述。S380: Based on the control signal, control the intensity of the speaker input signal of the
表3示出了图7对应的目标音频处理模式结果图。如表3所示,为了方便对照,我们将场景分为8个场景,分别是第一种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较小;第二种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较小;第三种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较小;以及第四种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较小;第五种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较大;第六种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号不超过所述扬声器阈值,且环境噪声较大;第七种:近端声音信号小于阈值(比如用户002不发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较大;以及第八种:近端声音信号大于阈值(比如用户002发出声音),所述扬声器信号超过所述扬声器阈值,且环境噪声较大。其中,近端声音信号是否大于阈值可以通过控制模块231根据所述麦克风信号进行判断。近端声音信号大于阈值可以是用户002发出的音频信号强度超过预设的阈值。所述8个场景对应的目标音频处理模式分别是第五种、第六种和第八种对应第一模式1;第三种、第四种和第七种对应第二模式2;其余场景对应第一模式1或第二模式2。Table 3 shows the result graph of the target audio processing mode corresponding to FIG. 7 . As shown in Table 3, for the convenience of comparison, we divide the scenarios into 8 scenarios, which are the first one: the near-end sound signal is less than the threshold (for example, user 002 does not emit sound), and the speaker signal does not exceed the speaker threshold , and the ambient noise is small; the second type: the near-end sound signal is greater than the threshold (for example, user 002 makes a sound), the speaker signal does not exceed the speaker threshold, and the ambient noise is small; the third type: the near-end sound signal If it is less than the threshold (such as the user 002 does not make a sound), the speaker signal exceeds the speaker threshold, and the ambient noise is small; and the fourth type: the near-end sound signal is greater than the threshold (such as the user 002 makes a sound), the speaker signal Exceed the speaker threshold, and the ambient noise is small; the fifth: the near-end sound signal is less than the threshold (for example, user 002 does not emit sound), the speaker signal does not exceed the speaker threshold, and the ambient noise is large; sixth Type: the near-end sound signal is greater than the threshold (for example, the user 002 makes a sound), the speaker signal does not exceed the speaker threshold, and the ambient noise is large; the seventh: the near-end sound signal is less than the threshold (for example, the user 002 does not make a sound) ), the loudspeaker signal exceeds the loudspeaker threshold, and the ambient noise is large; and the eighth type: the near-end sound signal is greater than the threshold (such as the user 002 makes a sound), the loudspeaker signal exceeds the loudspeaker threshold, and the ambient noise larger. Wherein, whether the near-end sound signal is greater than the threshold can be determined by the control module 231 according to the microphone signal. The fact that the near-end sound signal is greater than the threshold may be that the strength of the audio signal sent by the
需要说明的是,方法P200和方法P300适用于不同应用场景。当扬声器信号比近端语音质量重要的场景下,为了保证扬声器信号的质量以及扬声器信号的可懂度可以选择方法P200。当近端语音质量比扬声器信号重要的场景下,为了保证近端语音的语音质量和可懂度可以选择方法P300。It should be noted that the method P200 and the method P300 are applicable to different application scenarios. In a scenario where the speaker signal is more important than the near-end voice quality, the method P200 may be selected to ensure the quality of the speaker signal and the intelligibility of the speaker signal. In a scenario where the near-end voice quality is more important than the speaker signal, method P300 can be selected to ensure the voice quality and intelligibility of the near-end voice.
综上所述,系统100、所述方法P100、所述方法P200以及所述方法P300可以针对不同的场景,根据扬声器信号控制电子设备200的目标音频处理模式,从而控制电子设备200的音源信号,使得目标音频在任何场景下的语音质量都是最优的,从而提升语音通信的质量。To sum up, the
需要说明的是,环境噪声的信号强度在各个频率下是不同的。在不同频率下,所述第一目标音频291和所述第二目标音频292的语音质量也是不同的。比如,在第一频率下,所述第一音频信号243和第二音频信号245经过第一算法233-1做信号处理后得到的所述第一目标音频291的语音质量好于所述第二音频信号245经过第二算法233-8做信号处理后得到的第二目标音频292的语音质量。而在除所述第一频率外的其他频率下,所述第一音频信号243和第二音频信号245经过第一算法233-1做信号处理后得到的所述第一目标音频291的语音质量与所述第二音频信号245经过第二算法233-8做信号处理后得到的第二目标音频292的语音质量相近。这时,电子设备200还可以根据所述环境噪声的频率生成所述控制信号。在所述第一频率下生成所述第一控制信号,在除所述第一频率外的其他频率下生成所述第二控制信号。It should be noted that the signal strength of environmental noise is different at each frequency. Under different frequencies, the voice quality of the first target audio 291 and the second target audio 292 are also different. For example, at the first frequency, the voice quality of the first target audio 291 obtained after the first audio signal 243 and the second audio signal 245 are processed by the first algorithm 233-1 is better than that of the second audio The voice quality of the second target audio 292 obtained after the audio signal 245 is subjected to signal processing by the second algorithm 233-8. At frequencies other than the first frequency, the voice quality of the first target audio 291 is obtained after the first audio signal 243 and the second audio signal 245 are processed by the first algorithm 233-1. The voice quality is similar to that of the second target audio 292 obtained after the second audio signal 245 is processed by the second algorithm 233-8. At this time, the
当所述环境噪声是低频噪声时(比如地铁、公交等一些情况),这时可能出现第一音频信号243和第二音频信号245在第一算法233-1的信号处理下得到的第一目标音频291在低频处的语音信号质量较差,即第一目标音频291在低频时的语音可懂度较差,而在高频时的语音可懂度较高。这时,电子设备200可以根据所述环境噪声的频率控制目标音频处理模式的选择。比如,在低频范围内,电子设备200可以选择所述方法P300控制所述目标音频处理模式,以保证近端用户002的语音被拾取,从而保证近端语音质量;在高频范围内,电子设备200可以选择所述方法P200控制所述目标音频处理模式,以保证近端用户002可以听到所述扬声器信号。When the environmental noise is low-frequency noise (such as subway, bus, etc.), the first target obtained by the first audio signal 243 and the second audio signal 245 under the signal processing of the first algorithm 233-1 may appear at this time. The speech signal quality of the audio 291 at low frequencies is poor, that is, the speech intelligibility of the first target audio 291 at low frequencies is poor, and the speech intelligibility at high frequencies is higher. At this time, the
本说明书另一方面提供一种非暂时性存储介质,存储有至少一组用来基于的音源信号控制的可执行指令,当所述可执行指令被处理器执行时,所述可执行指令指导所述处理器实施本说明书所述的用于抑制回声的音频信号处理方法的步骤。在一些可能的实施方式中,本说明书的各个方面还可以实现为一种程序产品的形式,其包括程序代码。当所述程序产品在电子设备200上运行时,所述程序代码用于使电子设备200执行本说明书描述的基于的音源信号控制的步骤。用于实现上述方法的程序产品可以采用便携式紧凑盘只读存储器(CD-ROM)包括程序代码,并可以在电子设备200上运行。然而,本说明书的程序产品不限于此,在本说明书中,可读存储介质可以是任何包含或存储程序的有形介质,该程序可以被指令执行系统(例如处理器220)使用或者与其结合使用。所述程序产品可以采用一个或多个可读介质的任意组合。可读介质可以是可读信号介质或者可读存储介质。可读存储介质例如可以为但不限于电、磁、光、电磁、红外线、或半导体的系统、装置或器件,或者任意以上的组合。可读存储介质的更具体的例子包括:具有一个或多个导线的电连接、便携式盘、硬盘、随机存取存储器(RAM)、只读存储器(ROM)、可擦式可编程只读存储器(EPROM或闪存)、光纤、便携式紧凑盘只读存储器(CD-ROM)、光存储器件、磁存储器件、或者上述的任意合适的组合。所述计算机可读存储介质可以包括在基带中或者作为载波一部分传播的数据信号,其中承载了可读程序代码。这种传播的数据信号可以采用多种形式,包括但不限于电磁信号、光信号或上述的任意合适的组合。可读存储介质还可以是可读存储介质以外的任何可读介质,该可读介质可以发送、传播或者传输用于由指令执行系统、装置或者器件使用或者与其结合使用的程序。可读存储介质上包含的程序代码可以用任何适当的介质传输,包括但不限于无线、有线、光缆、RF等等,或者上述的任意合适的组合。可以以一种或多种程序设计语言的任意组合来编写用于执行本说明书操作的程序代码,所述程序设计语言包括面向对象的程序设计语言—诸如Java、C++等,还包括常规的过程式程序设计语言—诸如“C”语言或类似的程序设计语言。程序代码可以完全地在电子设备200上执行、部分地在电子设备200上执行、作为一个独立的软件包执行、部分在电子设备200上部分在远程计算设备上执行、或者完全在远程计算设备上执行。Another aspect of the present specification provides a non-transitory storage medium storing at least one set of executable instructions for control based on a sound source signal, and when the executable instructions are executed by a processor, the executable instructions instruct a The processor implements the steps of the audio signal processing method for echo suppression described in this specification. In some possible implementations, various aspects of this specification may also be implemented in the form of a program product, which includes program code. When the program product is executed on the
上述对本说明书特定实施例进行了描述。其他实施例在所附权利要求书的范围内。在一些情况下,在权利要求书中记载的动作或步骤可以按照不同于实施例中的顺序来执行并且仍然可以实现期望的结果。另外,在附图中描绘的过程不一定要求示出特定顺序或者连续顺序才能实现期望的结果。在某些实施方式中,多任务处理和并行处理也是可以的或者是可能有利的。The foregoing describes specific embodiments of the present specification. Other embodiments are within the scope of the appended claims. In some cases, the actions or steps recited in the claims can be performed in an order different from that in the embodiments and still achieve desirable results. Additionally, the processes depicted in the figures do not necessarily require the particular order shown, or sequential order, to achieve desirable results. In some embodiments, multitasking and parallel processing are also possible or may be advantageous.
综上所述,在阅读本详细公开内容之后,本领域技术人员可以明白,前述详细公开内容可以仅以示例的方式呈现,并且可以不是限制性的。尽管这里没有明确说明,本领域技术人员可以理解本说明书需求囊括对实施例的各种合理改变,改进和修改。这些改变,改进和修改旨在由本说明书提出,并且在本说明书的示例性实施例的精神和范围内。In conclusion, after reading this detailed disclosure, those skilled in the art will appreciate that the foregoing detailed disclosure may be presented by way of example only, and may not be limiting. Although not explicitly described herein, it will be understood by those skilled in the art that this description needs to encompass various reasonable changes, improvements and modifications to the embodiments. Such changes, improvements and modifications are intended to be suggested by this specification and are within the spirit and scope of the exemplary embodiments of this specification.
此外,本说明书中的某些术语已被用于描述本说明书的实施例。例如,“一个实施例”,“实施例”和/或“一些实施例”意味着结合该实施例描述的特定特征,结构或特性可以包括在本说明书的至少一个实施例中。因此,可以强调并且应当理解,在本说明书的各个部分中对“实施例”或“一个实施例”或“替代实施例”的两个或更多个引用不一定都指代相同的实施例。此外,特定特征,结构或特性可以在本说明书的一个或多个实施例中适当地组合。Furthermore, certain terms in this specification have been used to describe embodiments of this specification. For example, "one embodiment," "an embodiment," and/or "some embodiments" mean that a particular feature, structure, or characteristic described in connection with the embodiment may be included in at least one embodiment of this specification. Thus, it is emphasized and should be understood that two or more references to "an embodiment" or "one embodiment" or "an alternative embodiment" in various parts of this specification are not necessarily all referring to the same embodiment. Furthermore, the particular features, structures or characteristics may be combined as appropriate in one or more embodiments of this specification.
应当理解,在本说明书的实施例的前述描述中,为了帮助理解一个特征,出于简化本说明书的目的,本说明书将各种特征组合在单个实施例、附图或其描述中。然而,这并不是说这些特征的组合是必须的,本领域技术人员在阅读本说明书的时候完全有可能将其中一部分特征提取出来作为单独的实施例来理解。也就是说,本说明书中的实施例也可以理解为多个次级实施例的整合。而每个次级实施例的内容在于少于单个前述公开实施例的所有特征的时候也是成立的。It will be appreciated that, in the foregoing description of embodiments of this specification, in order to aid in the understanding of one feature, the specification, for the purpose of simplifying the specification, groups various features in a single embodiment, drawings, or description thereof. However, this does not mean that the combination of these features is necessary, and it is entirely possible for those skilled in the art to extract some of the features as a separate embodiment to understand when reading this specification. That is to say, the embodiments in this specification can also be understood as the integration of multiple sub-embodiments. It is also true that each sub-embodiment contains less than all features of a single foregoing disclosed embodiment.
本文引用的每个专利,专利申请,专利申请的出版物和其他材料,例如文章,书籍,说明书,出版物,文件,物品等,可以通过引用结合于此。用于所有目的的全部内容,除了与其相关的任何起诉文件历史,可能与本文件不一致或相冲突的任何相同的,或者任何可能对权利要求的最宽范围具有限制性影响的任何相同的起诉文件历史。现在或以后与本文件相关联。举例来说,如果在与任何所包含的材料相关联的术语的描述、定义和/或使用与本文档相关的术语、描述、定义和/或之间存在任何不一致或冲突时,使用本文件中的术语为准。Each patent, patent application, publication of a patent application, and other materials, such as articles, books, specifications, publications, documents, articles, etc., cited herein, may be incorporated herein by reference. For all purposes in its entirety, except any filing history with which it relates, any identical filing that may be inconsistent or conflicting with this document, or any identical filing that may have a limiting effect on the broadest scope of the claims history. associated with this document now or in the future. For example, in the event of any inconsistency or conflict between the descriptions, definitions, and/or use of terms, descriptions, definitions, and/or terms associated with any of the included materials, use The term shall prevail.
最后,应理解,本文公开的申请的实施方案是对本说明书的实施方案的原理的说明。其他修改后的实施例也在本说明书的范围内。因此,本说明书披露的实施例仅仅作为示例而非限制。本领域技术人员可以根据本说明书中的实施例采取替代配置来实现本说明书中的申请。因此,本说明书的实施例不限于申请中被精确地描述过的实施例。Finally, it should be understood that the embodiments of the application disclosed herein are illustrative of the principles of the embodiments of the present specification. Other modified embodiments are also within the scope of this specification. Therefore, the embodiments disclosed in this specification are merely illustrative and not limiting. Those skilled in the art may adopt alternative configurations according to the embodiments in this specification to implement the applications in this specification. Accordingly, the embodiments of this specification are not limited to those precisely described in the application.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202011581826.3ACN114697785B (en) | 2020-12-28 | 2020-12-28 | Audio signal processing method and system for echo suppression |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN202011581826.3ACN114697785B (en) | 2020-12-28 | 2020-12-28 | Audio signal processing method and system for echo suppression |
| Publication Number | Publication Date |
|---|---|
| CN114697785Atrue CN114697785A (en) | 2022-07-01 |
| CN114697785B CN114697785B (en) | 2025-03-14 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN202011581826.3AActiveCN114697785B (en) | 2020-12-28 | 2020-12-28 | Audio signal processing method and system for echo suppression |
| Country | Link |
|---|---|
| CN (1) | CN114697785B (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115268826A (en)* | 2022-07-29 | 2022-11-01 | 联想(北京)有限公司 | Control method and device and electronic equipment |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH11308692A (en)* | 1998-02-18 | 1999-11-05 | Oki Electric Ind Co Ltd | Echo elimination device and transmitter |
| CN104243732A (en)* | 2013-06-05 | 2014-12-24 | Dsp集团有限公司 | Use of vibration sensor in acoustic echo cancellation |
| CN110782912A (en)* | 2019-10-10 | 2020-02-11 | 安克创新科技股份有限公司 | Sound source control method and speaker device |
| JP2020536273A (en)* | 2017-09-28 | 2020-12-10 | ソノズ インコーポレイテッド | Sound interference cancellation |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JPH11308692A (en)* | 1998-02-18 | 1999-11-05 | Oki Electric Ind Co Ltd | Echo elimination device and transmitter |
| CN104243732A (en)* | 2013-06-05 | 2014-12-24 | Dsp集团有限公司 | Use of vibration sensor in acoustic echo cancellation |
| JP2020536273A (en)* | 2017-09-28 | 2020-12-10 | ソノズ インコーポレイテッド | Sound interference cancellation |
| CN110782912A (en)* | 2019-10-10 | 2020-02-11 | 安克创新科技股份有限公司 | Sound source control method and speaker device |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN115268826A (en)* | 2022-07-29 | 2022-11-01 | 联想(北京)有限公司 | Control method and device and electronic equipment |
| Publication number | Publication date |
|---|---|
| CN114697785B (en) | 2025-03-14 |
| Publication | Publication Date | Title |
|---|---|---|
| WO2022140928A1 (en) | Audio signal processing method and system for suppressing echo | |
| US9749731B2 (en) | Sidetone generation using multiple microphones | |
| CN110475178B (en) | Wireless earphone noise reduction method and device, wireless earphone and storage medium | |
| CN103686575B (en) | Auditory prosthesis | |
| CN111131947A (en) | Earphone signal processing method and system and earphone | |
| JP2009530950A (en) | Data processing for wearable devices | |
| CN112954530B (en) | Earphone noise reduction method, device and system and wireless earphone | |
| JP2006139307A (en) | Apparatus and method having voice effect processing and noise control | |
| CN111683319A (en) | Call pickup noise reduction method, earphone and storage medium | |
| US20160240185A1 (en) | Active noise cancellation in audio output device | |
| CN113207056B (en) | Wireless earphone and transparent transmission method, device and system thereof | |
| CN112399301A (en) | Earphone and noise reduction method | |
| CN114694668A (en) | Method and system for generating audio | |
| WO2022198538A1 (en) | Active noise reduction audio device, and method for active noise reduction | |
| CN113411417A (en) | Wireless sound amplification system and terminal | |
| CN110010117B (en) | Method and device for active noise reduction of speech | |
| CN105323682A (en) | Digital-analog hybrid microphone and earphones | |
| CN114697785A (en) | Audio signal processing method and system for suppressing echo | |
| CN115278429A (en) | Headphone system, headphone box, headphone and control method thereof | |
| US12142290B2 (en) | Audio signal generation method and system | |
| CN111083250A (en) | Mobile terminal and noise reduction method thereof | |
| JP2019110447A (en) | Electronic device, control method of electronic device, and control program of electronic device | |
| US20240334125A1 (en) | Audio processing based on target signal-to-noise ratio | |
| CN115515041B (en) | Audio playback method, device, earphone and storage medium | |
| WO2024205944A1 (en) | Audio processing based on target signal-to-noise ratio |
| Date | Code | Title | Description |
|---|---|---|---|
| PB01 | Publication | ||
| PB01 | Publication | ||
| SE01 | Entry into force of request for substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| GR01 | Patent grant | ||
| GR01 | Patent grant |