CN108520760B

Movatterモバイル変換

Info

Publication number: CN108520760B
Application number: CN201810259017.7A
Authority: CN
Inventors: 符升升
Original assignee: Vivo Mobile Communication Co Ltd
Current assignee: Vivo Mobile Communication Co Ltd
Priority date: 2018-03-27
Filing date: 2018-03-27
Publication date: 2020-07-24
Anticipated expiration: 2038-03-27
Also published as: CN108520760A

Abstract

The embodiment of the invention discloses a voice signal processing method and a terminal. The method is applied to the terminal and comprises the following steps: the method comprises the steps of detecting voice signals received by a terminal in the process of voice communication, determining the content corresponding to a first voice signal according to a second voice signal when detecting that the first voice signal in the voice signals does not meet parameter conditions, wherein the second voice signal comprises at least one of the voice signal with preset duration received before and the voice signal with preset duration received after the first voice signal in the voice signals, and displaying the content corresponding to the first voice signal on a display interface for a user to refer to, so that the mutual interruption of the two voice communication parties and the operation of repeating the voice content are reduced, the smooth operation of the voice communication is ensured, and the user experience is improved.

Description

Translated fromChinese

一种语音信号处理方法及终端A kind of voice signal processing method and terminal

技术领域technical field

本发明实施例涉及信息处理技术领域，尤其涉及一种语音信号处理方法及终端。Embodiments of the present invention relate to the technical field of information processing, and in particular, to a voice signal processing method and terminal.

背景技术Background technique

社交软件具有实时交互的功能，利用网络实现，用户互为好友后可以进行短信、语音、视频等多种形式的实时交互操作。Social software has the function of real-time interaction, which is realized by using the network. After users become friends with each other, they can conduct real-time interactive operations in various forms such as text messages, voice, and video.

当用户使用社交软件与其他好友进行语音通话时，若终端所连接的网络出现波动，则语音信号会发生畸变，导致用户无法听清楚好友的语音信息。这时用户为知晓好友的语音信息，通常会中断与好友的语音通话，让好友重新复述语音信息，然而语音信息的复述操作增加了好友的工作量，增加了语音通话的耗时，降低了用户体验。When a user uses a social software to make a voice call with other friends, if the network connected to the terminal fluctuates, the voice signal will be distorted, so that the user cannot hear the friend's voice information clearly. At this time, in order to know the voice information of the friend, the user usually interrupts the voice call with the friend and asks the friend to repeat the voice information. However, the repeat operation of the voice information increases the workload of the friend, increases the time-consuming of voice calls, and reduces the user experience.

发明内容SUMMARY OF THE INVENTION

本发明提供了一种语音信号处理方法，以解决语音信号发生畸变时，语音信息的复述操作增加了好友的工作量，增加了语音通话的耗时，降低了用户体验的问题。The present invention provides a voice signal processing method to solve the problem that when the voice signal is distorted, the repetition operation of voice information increases the workload of friends, increases the time consuming of voice calls, and reduces user experience.

第一方面，提供了一种语音信号处理方法，应用于终端，包括：In a first aspect, a voice signal processing method is provided, applied to a terminal, including:

在进行语音通话的过程中，对所述终端接收的语音信号进行检测；In the process of making a voice call, the voice signal received by the terminal is detected;

当检测到所述语音信号中的第一语音信号不满足参数条件时，根据第二语音信号确定所述第一语音信号对应的内容，所述第二语音信号包括所述语音信号中在所述第一语音信号之前接收到的预设时长的语音信号和之后接收到的预设时长的语音信号中的至少一种；When it is detected that the first voice signal in the voice signal does not meet the parameter condition, the content corresponding to the first voice signal is determined according to the second voice signal, and the second voice signal includes at least one of a voice signal of a preset duration received before the first voice signal and a voice signal of a preset duration received after;

将所述第一语音信号对应的内容显示在显示界面上。The content corresponding to the first voice signal is displayed on the display interface.

第二方面，提供了一种终端，包括：In a second aspect, a terminal is provided, including:

信号检测模块，用于在进行语音通话的过程中，对所述终端接收的语音信号进行检测；a signal detection module, configured to detect the voice signal received by the terminal during a voice call;

内容确定模块，用于当检测到所述语音信号中的第一语音信号不满足参数条件时，根据第二语音信号确定所述第一语音信号对应的内容，所述第二语音信号包括所述语音信号中在所述第一语音信号之前接收到的预设时长的语音信号和之后接收到的预设时长的语音信号中的至少一种；a content determination module, configured to determine the content corresponding to the first voice signal according to the second voice signal when it is detected that the first voice signal in the voice signal does not meet the parameter condition, the second voice signal includes the at least one of a voice signal of a preset duration received before the first voice signal and a voice signal of a preset duration received after the voice signal;

内容显示模块，用于将所述第一语音信号对应的内容显示在显示界面上。The content display module is used for displaying the content corresponding to the first voice signal on the display interface.

这样，本发明实施例中，在进行语音通话的过程中，对终端接收的语音信号进行检测，检测到语音信号中的第一语音信号不满足参数条件时，判定第一语音信号发生畸变，随后根据在第一语音信号之前或之后接收到的第二语音信号，确定第一语音信号对应的内容并将该内容显示在显示界面上供用户参考，从而减少了语音通话双方的相互打断以及复述语音内容的操作，保证了语音通话的顺畅进行，提高了用户体验。In this way, in the embodiment of the present invention, in the process of making a voice call, the voice signal received by the terminal is detected, and when it is detected that the first voice signal in the voice signal does not meet the parameter conditions, it is determined that the first voice signal is distorted, and then the first voice signal is distorted. According to the second voice signal received before or after the first voice signal, the content corresponding to the first voice signal is determined and displayed on the display interface for the user's reference, thereby reducing the mutual interruption and repetition of the two parties in the voice call The operation of the voice content ensures the smooth progress of the voice call and improves the user experience.

上述说明仅是本发明技术方案的概述，为了能够更清楚了解本发明的技术手段，而可依照说明书的内容予以实施，并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂，以下特举本发明的具体实施方式。The above description is only an overview of the technical solutions of the present invention, in order to be able to understand the technical means of the present invention more clearly, it can be implemented according to the content of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and easy to understand , the following specific embodiments of the present invention are given.

附图说明Description of drawings

为了更清楚地说明本发明实施例的技术方案，下面将对本发明实施例的描述中所需要使用的附图作简单地介绍，显而易见地，下面描述中的附图仅仅是本发明的一些实施例，对于本领域普通技术人员来讲，在不付出创造性劳动性的前提下，还可以根据这些附图获得其他的附图。In order to illustrate the technical solutions of the embodiments of the present invention more clearly, the following briefly introduces the drawings that are used in the description of the embodiments of the present invention. Obviously, the drawings in the following description are only some embodiments of the present invention. , for those of ordinary skill in the art, other drawings can also be obtained from these drawings without creative labor.

图1是本发明一个实施例的语音信号处理方法的流程图；1 is a flowchart of a speech signal processing method according to an embodiment of the present invention;

图2是本发明另一个实施例的语音信号处理方法的流程图；2 is a flowchart of a voice signal processing method according to another embodiment of the present invention;

图3是本发明一个示例的语音信号处理方法的流程图；Fig. 3 is the flow chart of the speech signal processing method of an example of the present invention;

图4是本发明一个实施例的终端的框图；4 is a block diagram of a terminal according to an embodiment of the present invention;

图5是本发明一个实施例的移动终端的硬件结构示意图。FIG. 5 is a schematic diagram of a hardware structure of a mobile terminal according to an embodiment of the present invention.

具体实施方式Detailed ways

下面将结合本发明实施例中的附图，对本发明实施例中的技术方案进行清楚、完整地描述，显然，所描述的实施例是本发明一部分实施例，而不是全部的实施例。基于本发明中的实施例，本领域普通技术人员在没有作出创造性劳动前提下所获得的所有其他实施例，都属于本发明保护的范围。The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

实施例一Example 1

图1是本发明一个实施例的语音信号处理方法的流程图。图1所示的语音信号处理方法应用于终端，所述方法包括：FIG. 1 is a flowchart of a speech signal processing method according to an embodiment of the present invention. The voice signal processing method shown in FIG. 1 is applied to a terminal, and the method includes:

步骤101、在进行语音通话的过程中，对终端接收的语音信号进行检测。Step 101: Detect the voice signal received by the terminal during the voice call.

终端可以有多种，例如固定终端、移动终端，固定终端可以有多种，例如台式电脑，移动终端可以有多种，例如手机、笔记本、平板等。There may be various types of terminals, such as fixed terminals and mobile terminals. There may be various types of stationary terminals, such as desktop computers, and various types of mobile terminals, such as mobile phones, notebooks, and tablets.

社交软件具有实时交互的功能，安装在终端上，当终端连接网络时，如连接移动网络或无线网络时，用户可以使用社交软件与其好友进行语音通话、文字通信等方式的实时交互。当用户使用社交软件与其好友进行语音通话时，用户使用的终端与好友使用的终端之间互发语音信号，用户使用的终端接收用户使用的终端发送的语音信号。Social software has the function of real-time interaction and is installed on the terminal. When the terminal is connected to a network, such as a mobile network or a wireless network, users can use the social software to interact with their friends in real-time by voice calls and text communications. When a user uses the social software to make a voice call with a friend, the terminal used by the user and the terminal used by the friend send voice signals to each other, and the terminal used by the user receives the voice signal sent by the terminal used by the user.

在进行语音通话的过程中，由于网络质量、环境噪声以及用户发音语速等多个因素的影响，使得语音信号发生畸变，导致用户无法听清楚好友的通话内容。During a voice call, due to the influence of various factors such as network quality, environmental noise, and the speed of the user's pronunciation, the voice signal is distorted, so that the user cannot hear the content of the friend's call clearly.

为解决上述问题，本发明对终端接收的语音信号进行检测，检测语音信号是否发生畸变，当检测到语音信号发生畸变，则对语音信号进行处理，使得用户能够听清楚好友的通话内容。In order to solve the above problem, the present invention detects the voice signal received by the terminal, and detects whether the voice signal is distorted.

步骤102、当检测到语音信号中的第一语音信号不满足参数条件时，根据第二语音信号确定第一语音信号对应的内容，第二语音信号包括语音信号中在第一语音信号之前接收到的预设时长的语音信号和之后接收到的预设时长的语音信号中的至少一种。Step 102, when it is detected that the first voice signal in the voice signal does not meet the parameter conditions, determine the content corresponding to the first voice signal according to the second voice signal, and the second voice signal includes the voice signal received before the first voice signal. at least one of a voice signal of preset duration and a voice signal of preset duration received afterward.

第一语音信号是终端接收的语音信号的一部分。第二语音信号是终端接收的语音信号的另一部分，是在第一语音信号之前接收到的预设时长的语音信号、之后接收的预设时长的语音信号、或者之前接收到的预设时长的语音信号和之后接收到的预设时长的语音信号的组合。The first voice signal is a part of the voice signal received by the terminal. The second voice signal is another part of the voice signal received by the terminal, and is a voice signal of a preset duration received before the first voice signal, a voice signal of a preset duration received after, or a voice signal of a preset duration received before A combination of a voice signal and a voice signal of a preset duration received afterward.

上述预设时长所限定的内容可以有多种，例如，可以是预先设置的指定时长，也可以是与实际通话情况对应的非固定的时长。通常情况下，为提高语音信号的处理效果，预设时长优选为与实际通话情况对应的非固定的时长。The content limited by the preset duration may be various, for example, may be a preset specified duration, or may be a non-fixed duration corresponding to the actual call situation. Generally, in order to improve the processing effect of the voice signal, the preset duration is preferably a non-fixed duration corresponding to the actual call situation.

本发明预先设置了语音信号的参数条件，终端对接收的语音信号进行判断，当语音信号满足该参数条件时，判定语音信号未发生畸变，当语音信号不满足该参数条件时，判定语音信号发生畸变。The present invention presets the parameter conditions of the voice signal, and the terminal judges the received voice signal. When the voice signal meets the parameter condition, it is judged that the voice signal is not distorted, and when the voice signal does not meet the parameter condition, it is judged that the voice signal has occurred. distortion.

检测到第一语音信号不满足参数条件后，判定第一语音信号发生畸变，终端的用户无法听清楚好友的通话内容，随后根据未发生畸变的第二语音信号对应的内容确定第一语音信号对应的内容。具体地，可以根据第二语音信号对应的内容语义推测第一语音信号对应的内容。After detecting that the first voice signal does not meet the parameter conditions, determine that the first voice signal is distorted, and the user of the terminal cannot hear the content of the friend's call clearly, and then determine the corresponding content of the first voice signal according to the content of the undistorted second voice signal. Content. Specifically, the content corresponding to the first speech signal can be inferred according to the content semantics corresponding to the second speech signal.

实际中，检测到第一语音信号不满足参数条件后，可以直接根据第二语音信号对应的内容确定第一语音信号对应的内容，也可以先采用语音识别的方式对第一语音信号进行识别，在未能语音识别出第一语音信号对应的内容后，再根据第二语音信号对应的内容确定第一语音信号对应的内容。In practice, after it is detected that the first voice signal does not meet the parameter conditions, the content corresponding to the first voice signal may be directly determined according to the content corresponding to the second voice signal, or the first voice signal may be recognized first by means of voice recognition, After the content corresponding to the first speech signal cannot be recognized by speech, the content corresponding to the first speech signal is then determined according to the content corresponding to the second speech signal.

参数条件可以有多种，例如频率条件、杂音占比条件以及对应语速条件中的至少一种，除上述举例的参数条件外，还可以是其他的参数条件，可以根据实际进行设置。当参数条件包括频率条件时，频率条件可以限定频率范围、频率变化幅度范围等；当参数条件包括杂音占比条件时，杂音占比条件可以限定杂音占比范围等；当参数条件包括语音信号对应的语速条件时，语速条件可以限定语速范围、语速变化幅度范围等。There can be various parameter conditions, such as at least one of frequency conditions, noise ratio conditions, and corresponding speech rate conditions. In addition to the parameter conditions exemplified above, other parameter conditions can also be set, which can be set according to actual conditions. When the parameter condition includes the frequency condition, the frequency condition can limit the frequency range, the range of the frequency change amplitude, etc.; when the parameter condition includes the noise ratio condition, the noise ratio condition can limit the noise ratio range, etc.; when the parameter condition includes the corresponding voice signal When the speech rate condition is specified, the speech rate condition may limit the speech rate range, the speech rate variation range, and the like.

步骤103、将第一语音信号对应的内容显示在显示界面上。Step 103: Display the content corresponding to the first voice signal on the display interface.

确定出第一语音信号对应的内容后，将该内容显示在终端的显示界面上供用户参考，从而减少了语音通话的过程中通话双方的不必要的复述，保证了语音通话的顺畅进行，提升了语音通话的用户体验。确定出的第一语音信号对应的内容可以以文字形式、图画形式、文字和图画组合形式等形式显示在显示界面上，供用户查看。After the content corresponding to the first voice signal is determined, the content is displayed on the display interface of the terminal for the user's reference, thereby reducing unnecessary repetition of both parties during the voice call, ensuring the smooth progress of the voice call, and improving Improve the user experience of voice calls. The determined content corresponding to the first voice signal may be displayed on the display interface in the form of text, a picture, a combination of text and pictures, etc., for the user to view.

为方便用户对语音通话内容的整体理解，可以按照信号的接收时间顺序，将第一语音信号对应的内容和第二语音信号对应的内容同时显示在显示界面上供用户查看。In order to facilitate the user's overall understanding of the content of the voice call, the content corresponding to the first voice signal and the content corresponding to the second voice signal may be simultaneously displayed on the display interface for the user to view according to the time sequence of signal reception.

语音信号处理功能的开启方式有多种，例如，可以在用户与好友开始进行语音通话时，开启语音信号处理功能，也可以在终端接收到用户执行的功能开启指示后开启该功能，如接收到用户对预设选项或按钮的选择操作后开启该功能，还可以是其他适用的功能开启方式，本发明在此不做限制。There are many ways to enable the voice signal processing function. For example, the voice signal processing function can be enabled when the user starts a voice call with a friend, or the function can be enabled after the terminal receives the function enable instruction performed by the user. After the user selects and operates the preset option or button, the function is activated, and other applicable function activation methods are also possible, which are not limited in the present invention.

依据本发明实施例，在进行语音通话的过程中，对终端接收的语音信号进行检测，检测到语音信号中的第一语音信号不满足参数条件时，判定第一语音信号发生畸变，随后根据在第一语音信号之前或之后接收到的第二语音信号，确定第一语音信号对应的内容并将该内容显示在显示界面上供用户参考，从而减少了语音通话双方的相互打断以及复述语音内容的操作，保证了语音通话的顺畅进行，提高了用户体验。According to the embodiment of the present invention, in the process of making a voice call, the voice signal received by the terminal is detected, and when it is detected that the first voice signal in the voice signal does not meet the parameter conditions, it is determined that the first voice signal is distorted, and then according to the The second voice signal received before or after the first voice signal, determines the content corresponding to the first voice signal and displays the content on the display interface for the user's reference, thereby reducing the interruption of the voice call and the repetition of the voice content The operation ensures the smooth progress of the voice call and improves the user experience.

实施例二Embodiment 2

图2是本发明另一个实施例的语音信号处理方法的流程图。图2所示的语音信号处理方法应用于终端，所述方法包括：FIG. 2 is a flowchart of a voice signal processing method according to another embodiment of the present invention. The voice signal processing method shown in FIG. 2 is applied to a terminal, and the method includes:

步骤201、在进行语音通话的过程中，对终端接收的语音信号进行检测。Step 201: Detect the voice signal received by the terminal during the voice call.

在进行语音通话的过程中，用户使用的终端会接收到好友使用的终端发送的语音信号，用户的终端对接收的语音信号进行检测，检测语音信号是否正常。During a voice call, the terminal used by the user will receive the voice signal sent by the terminal used by the friend, and the terminal of the user will detect the received voice signal to detect whether the voice signal is normal.

步骤202、当检测到语音信号中的第一语音信号不满足参数条件时，根据第二语音信号对应的内容构造待修正语句，待修正语句在第一语音信号对应的语句位置处存在空缺。Step 202: When it is detected that the first voice signal in the voice signal does not meet the parameter conditions, construct a sentence to be revised according to the content corresponding to the second voice signal, and the sentence to be revised has a vacancy at the sentence position corresponding to the first voice signal.

第二语音信号是在第一语音信号之前接收的语音信号和之后接收的语音信号中的一种或组合，第二语音信号满足参数条件，能够确定对应的内容。The second voice signal is one or a combination of a voice signal received before the first voice signal and a voice signal received after the first voice signal, the second voice signal satisfies the parameter conditions, and the corresponding content can be determined.

本发明实施例中，终端在语音通话的过程中，将语音通话内容存储至指定存储位置。语音通话内容存储方式有多种，例如，存储整个语音通话过程中好友/双方的语音通话内容，或者，存储预设历史时长如过去半分钟内好友/双方的语音通话内容，还可以是其他适用的存储方式，其中符号“/”代表的含义为或。In this embodiment of the present invention, the terminal stores the content of the voice call in a designated storage location during a voice call. There are various ways to store the content of the voice call. For example, the content of the voice call between friends/both parties during the entire voice call is stored, or the content of the voice call between friends/both parties is stored for a preset historical duration, such as the content of the voice call between friends/both parties in the past half minute, or other applicable content. The storage method, where the symbol "/" represents the meaning of or.

检测到第一语音信号不满足参数条件后，从终端的指定存储位置提取第二语音信号对应的内容，并根据第二语音信号对应的内容构建待修正语句，待修正语句在第一语音信号对应的语句位置处存在空缺。After detecting that the first voice signal does not meet the parameter conditions, extract the content corresponding to the second voice signal from the designated storage location of the terminal, and construct the sentence to be revised according to the content corresponding to the second voice signal, and the sentence to be revised is corresponding to the first voice signal. There is a gap in the statement position of .

由于第二语音信号相对于第一语音信号的接收顺序有多种，因此空缺在待修正语句中的位置有多种，具体地，空缺可能位于待修正语句的中部、末端、前端等。Since the second speech signal is received in various order relative to the first speech signal, there are various positions of the vacancy in the sentence to be revised. Specifically, the vacancy may be located in the middle, end, and front of the sentence to be revised.

根据第二语音信号对应的内容构造待修正语句的方式有多种，例如，首先，根据第二语音信号的信号时长与其内容长度的对应关系，确定第一语音信号的与其信号时长匹配的内容长度；其次，根据第二语音信号对应的内容和对应内容长度的空缺，构造待修正语句。There are many ways to construct the sentence to be revised according to the content corresponding to the second voice signal. For example, first, according to the corresponding relationship between the signal duration of the second voice signal and its content length, determine the content length of the first voice signal that matches its signal duration. ; secondly, according to the corresponding content of the second speech signal and the vacancy of the corresponding content length, construct the sentence to be revised.

示例性地，在检测到接收的语音信号中第一语音信号不满足参数条件后，即判定第一语音信号发生畸变后，根据未发生畸变的第二语音信号对应的内容的字符数(N1)和其信号时长(t1)，确定第二语音信号的一个字符所对应的信号时长(t1/N1)，使用发生畸变的第一语音信号的信号时长(t2)除以(t1/N1)，得到第一语音信号对应的内容所包含的字符数(N1·t2/t1)，确定空缺对应的字符数(N1·t2/t1)，根据对应字符数为(N1·t2/t1)的空缺以及第二语音信号对应的内容，构造待修正语句。Exemplarily, after it is detected that the first voice signal in the received voice signal does not meet the parameter conditions, that is, after it is determined that the first voice signal is distorted, according to the number of characters (N1) of the content corresponding to the undistorted second voice signal and its signal duration (t1), determine the signal duration (t1/N1) corresponding to a character of the second speech signal, and divide the signal duration (t2) of the distorted first speech signal by (t1/N1) to obtain The number of characters (N1 t2/t1) contained in the content corresponding to the first speech signal, determine the number of characters corresponding to the vacancy (N1 t2/t1), and according to the corresponding number of characters, the vacancy of (N1 t2/t1) and the first The content corresponding to the two speech signals is used to construct the sentence to be corrected.

步骤203、从语句数据库中，查找与待修正语句匹配的目标语句。Step 203: Search for a target sentence matching the sentence to be revised from the sentence database.

预先设置语句数据库，并在语句数据库中记录大量语句。在根据第二语音信号对应的内容构造待修正语句后，从语句数据库中，查找与待修正语句匹配的目标语句。A statement database is set up in advance, and a large number of statements are recorded in the statement database. After constructing the sentence to be revised according to the content corresponding to the second speech signal, a target sentence matching the sentence to be revised is searched from the sentence database.

步骤204、将目标语句中空缺对应的内容作为第一语音信号对应的内容。Step 204: Use the content corresponding to the vacancy in the target sentence as the content corresponding to the first speech signal.

从语句数据库中匹配出的目标语句中，与待修正语句的空缺对应的内容即为第一语音信号对应的内容。Among the target sentences matched from the sentence database, the content corresponding to the vacancy of the sentence to be corrected is the content corresponding to the first speech signal.

从语句数据库中匹配出的目标语句可能包括一个或两个以上。当目标语句包括两个以上时，可以将所有目标语句中空缺对应的内容作为第一语音信号对应的内容，在后续步骤中将多个内容显示在终端的显示界面上供用户查看；也可以在查找到两个以上的目标语句后，先对两个以上的目标语句进行排序，之后将排序前N的目标语句中空缺对应的的内容作为第一语音信号对应的内容，其中，N为大于或等于1的正整数，N的大小可以根据实际进行设置。The target sentence matched from the sentence database may include one or more than two sentences. When there are more than two target sentences, the content corresponding to the vacancies in all target sentences can be used as the content corresponding to the first voice signal, and in subsequent steps, multiple contents are displayed on the display interface of the terminal for the user to view; After more than two target sentences are found, first sort the two or more target sentences, and then use the content corresponding to the vacancy in the N target sentences before sorting as the content corresponding to the first speech signal, where N is greater than or A positive integer equal to 1, the size of N can be set according to the actual situation.

排序两个以上目标语句的方式有多种，例如，根据第一语音信号的接收时间、终端所处的位置信息以及第一语音信号对应的发音效果中的至少一种，对两个以上目标语句进行排序。还可以根据其他参数对两个以上目标语句进行排序，本发明实施例在此不做限制。There are many ways to sort more than two target sentences. For example, according to at least one of the reception time of the first voice signal, the location information of the terminal, and the pronunciation effect corresponding to the first voice signal, the two or more target sentences are sorted. put in order. It is also possible to sort more than two target sentences according to other parameters, which is not limited in this embodiment of the present invention.

例如，在第一语音信号之前接收的第二语音信号对应的内容为“你”，在第一语音信号之后接收的第二语音信号对应的内容为“早晨吃饭了么？”，根据第二语音信号的信号时长和其内容字符数，以及根据第一语音信号的信号时长，推测第一语音信号对应的内容包括两个字符，这时构造的待修正语句为“你**早晨吃饭了么？”，每个“*”代表一个字符。构造出待修正语句后，将待修正语句与语句数据库进行匹配，获得五个与待修正语句匹配的目标语句，对五个目标语句进行排序，得到目标语句的排序结果为：“你今天早晨吃饭了么？”、“你昨天早晨吃饭了么？”、“你周末早晨吃饭了么？”、“你记得早晨吃饭了么？”以及“你不在早晨吃饭了么？”，选取排序最前的3个目标语句(“你今天早晨吃饭了么？”、“你昨天早晨吃饭了么？”以及“你周末早晨吃饭了么？”)作为目标语句。For example, the content corresponding to the second voice signal received before the first voice signal is "you", and the content corresponding to the second voice signal received after the first voice signal is "Have you eaten in the morning?", according to the second voice signal According to the signal duration of the signal and the number of characters in its content, and according to the signal duration of the first voice signal, it is inferred that the content corresponding to the first voice signal includes two characters. At this time, the constructed sentence to be corrected is "Have you had dinner in the morning? ", each "*" represents a character. After constructing the to-be-corrected sentence, match the to-be-corrected sentence with the sentence database to obtain five target sentences matching the to-be-corrected sentence, sort the five target sentences, and obtain the sorting result of the target sentence: "You have dinner this morning. Did you eat?", "Did you eat yesterday morning?", "Did you eat on weekend mornings?", "Do you remember eating in the morning?" and "Have you not eaten in the morning?", select the top 3 target sentences ("Did you eat this morning?", "Did you eat yesterday morning?", and "Did you eat on weekend morning?") as target sentences.

步骤205、将第一语音信号对应的内容显示在显示界面上。Step 205: Display the content corresponding to the first voice signal on the display interface.

将目标语句中空缺对应的内容作为第一语音信号对应的内容后，将第一语音信号对应的内容显示在显示界面上供用户查看。例如，将步骤204中举例的“你今天早晨吃饭了么？”、“你昨天早晨吃饭了么？”以及“你周末早晨吃饭了么？”按顺序依次显示在显示界面上。After the content corresponding to the vacancy in the target sentence is used as the content corresponding to the first voice signal, the content corresponding to the first voice signal is displayed on the display interface for the user to view. For example, "Did you eat this morning?", "Did you eat yesterday morning?" and "Did you eat on weekend morning?" exemplified instep 204 are displayed on the display interface in sequence.

操作中，可以将目标语句中空缺对应的内容显示在显示界面上供用户查看，也可以将包括空缺对应的内容的目标语句显示在显示界面上，即将第一语音信号对应的内容和第二语音信号对应的内容同时显示在显示界面上，由于目标语句包括了较多的通话内容，因此方便用户对通话内容的整体理解。In operation, the content corresponding to the vacancy in the target sentence can be displayed on the display interface for the user to view, or the target sentence including the content corresponding to the vacancy can be displayed on the display interface, that is, the content corresponding to the first voice signal and the second voice. The content corresponding to the signal is displayed on the display interface at the same time. Since the target sentence includes more content of the call, it is convenient for the user to understand the content of the call as a whole.

为使本领域技术人员更加清楚地理解本发明，现通过以下示例对本发明实施例所述的语音信号处理方法进行详细说明。In order for those skilled in the art to understand the present invention more clearly, the following examples are used to describe the speech signal processing method according to the embodiment of the present invention in detail.

图3为是本发明一个示例的语音信号处理方法的流程图。参照图3所示，语音信号处理方法包括：FIG. 3 is a flow chart of a voice signal processing method according to an example of the present invention. Referring to Figure 3, the speech signal processing method includes:

S1、检测到社交软件的语音通话功能开启。S1. It is detected that the voice calling function of the social software is enabled.

S2、开启网络质量检测。S2. Enable network quality detection.

对终端所连接的网络的质量进行检测。Detect the quality of the network to which the terminal is connected.

S3、构造缓冲池，将语音通话双方在预设时长n秒内的语音信号对应的内容记录在缓冲池内。S3. Construct a buffer pool, and record the content corresponding to the voice signals of both parties in the voice call within a preset duration of n seconds in the buffer pool.

S4、判断终端连接的网络是否存在异常，如果否，则执行步骤S5，如果是，则执行步骤S6。S4. Determine whether the network to which the terminal is connected is abnormal, if not, execute step S5, and if yes, execute step S6.

S5、判断语音通话是否结束，如果是，则方法结束，如果不是，则执行步骤S4。S5. Determine whether the voice call ends, and if so, the method ends, and if not, execute step S4.

S6、判断新添加至缓冲池内的当前语音信号是否发生畸变，如果不满足，则执行步骤S5，如果满足，则执行步骤S7。S6. Determine whether the current voice signal newly added to the buffer pool is distorted, if not, execute step S5, and if so, execute step S7.

本示例中，当前语音信号为终端当前新接收的语音信号，当前语音信号包括两部分，一部分语音信号满足参数条件，另一部分语音信号不满足参数条件。In this example, the current voice signal is the voice signal currently newly received by the terminal, and the current voice signal includes two parts, one part of the voice signal satisfies the parameter condition, and the other part of the voice signal does not meet the parameter condition.

本示例中的语音信号发生畸变是指语音信号不满足预设的参数条件。参数条件的内容可以参照本发明实施例前述描述。The speech signal distortion in this example means that the speech signal does not meet the preset parameter conditions. For the content of the parameter conditions, reference may be made to the foregoing descriptions of the embodiments of the present invention.

S7、对当前语音信号中不发生畸变的部分语音信号进行语音识别，识别出不发生畸变的部分语音信号对应的内容即文本。S7. Perform speech recognition on the part of the speech signal that is not distorted in the current speech signal, and identify the content corresponding to the part of the speech signal that is not distorted, that is, text.

S8、判断当前语音信号中发生畸变的部分语音信号是否可以语音识别，如果不可以，则执行步骤S9-S11，如果可以，则执行步骤S12。S8. Determine whether the distorted part of the speech signal in the current speech signal can be recognized by speech, if not, execute steps S9-S11, and if yes, execute step S12.

S9、根据不发生畸变的部分语音信号对应的内容，对发生畸变的部分语音信号对应的内容进行自然语言上下文推断。S9. Perform natural language context inference on the content corresponding to the distorted partial speech signal according to the content corresponding to the undistorted partial speech signal.

可以采用上述记录的语句数据库查找的方式，对发生畸变的部分语音信号对应的内容进行推断，还可以是其他适用的推断方式。The content corresponding to the distorted part of the speech signal may be inferred by using the above-mentioned method of searching the recorded sentence database, or other applicable inference methods.

S10、将推断出的多个内容进行排序，筛选出排序前m的内容，m为大于或等于1的正整数。S10. Sort the inferred multiple contents, and filter out the contents of m before the sorting, where m is a positive integer greater than or equal to 1.

S11、将推断出的m个内容置于指定位置，生成当前语音信号对应的修正内容，其中，指定位置为当前语音信号对应的语句中发生畸变的部分语音信号对应的语句位置。S11. Place the inferred m contents in a specified position to generate the corrected contents corresponding to the current voice signal, wherein the specified position is the sentence position corresponding to the distorted part of the voice signal in the sentence corresponding to the current voice signal.

步骤S11结束后，执行S13。After step S11 ends, S13 is executed.

S12、将语音识别出的内容置于指定位置，生成当前语音信号对应的修正内容，其中，指定位置为当前语音信号对应的语句中发生畸变的部分语音信号对应的语句位置。S12. Place the speech-recognized content at a specified position to generate a correction content corresponding to the current speech signal, wherein the specified position is the sentence position corresponding to the distorted part of the speech signal in the sentence corresponding to the current speech signal.

步骤S12结束后，执行S13。After step S12 ends, S13 is executed.

S13、将修正后的当前语音信号对应的内容显示在显示界面上，供用户参考。S13. Display the content corresponding to the corrected current speech signal on the display interface for the user's reference.

步骤S13结束后，执行S5。After step S13 ends, S5 is executed.

本示例利用语音记录、语音识别和自然语言上下文推断技术，对因网络波动造成的出现畸变的短暂语音信号对应的内容进行辅助修正，并将修正后的内容展示在显示界面上供用户参考，用户可以通过查看显示界面上的内容大致了解发生畸变的语音信号对应的内容，从而减少语音通话过程中通话双方的不必要的复述，保证了语音通话的顺畅进行，提升了网络语音通话的用户体验。This example uses voice recording, voice recognition and natural language context inference technology to assist in the correction of the content corresponding to the distorted short-term voice signal caused by network fluctuations, and displays the corrected content on the display interface for user reference. The content corresponding to the distorted voice signal can be roughly understood by viewing the content on the display interface, thereby reducing unnecessary repetition of both parties during the voice call, ensuring the smooth progress of the voice call, and improving the user experience of the network voice call.

实施例三Embodiment 3

图4是本发明一个实施例的终端的框图。图4所示的终端包括：FIG. 4 is a block diagram of a terminal according to an embodiment of the present invention. The terminal shown in Figure 4 includes:

信号检测模块301，用于在进行语音通话的过程中，对所述终端接收的语音信号进行检测。The signal detection module 301 is configured to detect the voice signal received by the terminal during the voice call.

内容确定模块302，用于当检测到所述语音信号中的第一语音信号不满足参数条件时，根据第二语音信号确定所述第一语音信号对应的内容，所述第二语音信号包括所述语音信号中在所述第一语音信号之前接收到的预设时长的语音信号和之后接收到的预设时长的语音信号中的至少一种。The content determination module 302 is configured to determine the content corresponding to the first voice signal according to the second voice signal when it is detected that the first voice signal in the voice signal does not meet the parameter condition, and the second voice signal includes the at least one of a voice signal of a preset duration received before the first voice signal and a voice signal of a preset duration received after the voice signal.

内容显示模块303，用于将所述第一语音信号对应的内容显示在显示界面上。The content display module 303 is configured to display the content corresponding to the first voice signal on the display interface.

本发明实施例中，优选地，所述内容确定模块302包括：In this embodiment of the present invention, preferably, the content determination module 302 includes:

语句构造子模块，用于根据所述第二语音信号对应的内容构造待修正语句，所述待修正语句在所述第一语音信号对应的语句位置处存在空缺；a sentence construction submodule, configured to construct a sentence to be revised according to the content corresponding to the second speech signal, and the sentence to be revised has a vacancy at the position of the sentence corresponding to the first speech signal;

目标语句查找子模块，用于从语句数据库中，查找与所述待修正语句匹配的目标语句；A target statement search submodule is used to search for a target statement matching the statement to be revised from the statement database;

内容得到子模块，用于将所述目标语句中所述空缺对应的内容作为所述第一语音信号对应的内容。The content obtaining sub-module is configured to use the content corresponding to the vacancy in the target sentence as the content corresponding to the first speech signal.

本发明实施例中，优选地，所述语句构造子模块包括：In the embodiment of the present invention, preferably, the sentence construction submodule includes:

内容长度确定子模块，用于根据所述第二语音信号的信号时长与其内容长度的对应关系，确定所述第一语音信号的与其信号时长匹配的内容长度；a content length determination submodule, configured to determine the content length of the first voice signal that matches the signal duration of the first voice signal according to the correspondence between the signal duration of the second voice signal and its content length;

语句获得子模块，用于根据所述第二语音信号对应的内容和对应所述内容长度的空缺，构造所述待修正语句。The sentence obtaining sub-module is configured to construct the sentence to be revised according to the content corresponding to the second speech signal and the vacancy corresponding to the length of the content.

本发明实施例中，优选地，所述内容确定模块302还包括：In this embodiment of the present invention, preferably, the content determination module 302 further includes:

语句排序子模块，用于在所述从语句数据库中，查找与所述待修正语句匹配的目标语句之后，对至少两个所述目标语句进行排序，从所述数据库中查找出的目标语句为至少两个；The statement sorting submodule is used to sort at least two of the target statements after searching for the target statement matching the statement to be revised in the slave statement database, and the target statement found from the database is: at least two;

所述内容得到子模块，具体用于将排序前N的目标语句中所述空缺对应的内容作为所述第一语音信号对应的内容，其中，N为大于或等于1的正整数。The content obtaining sub-module is specifically configured to use the content corresponding to the vacancy in the top N target sentences as the content corresponding to the first speech signal, where N is a positive integer greater than or equal to 1.

本发明实施例中，优选地，所述语句排序子模块，具体用于根据所述第一语音信号的接收时间、所述终端所处的位置信息以及所述第一语音信号对应的发音效果中的至少一种，对多个至少两个目标语句进行排序。In the embodiment of the present invention, preferably, the sentence sorting sub-module is specifically configured to perform the following steps according to the reception time of the first voice signal, the location information of the terminal, and the pronunciation effect corresponding to the first voice signal. At least one of a plurality of at least two target sentences is sorted.

本发明实施例中，优选地，所述内容显示模块303，具体用于将包括所述空缺对应的内容的所述目标语句显示在所述显示界面上。In the embodiment of the present invention, preferably, the content display module 303 is specifically configured to display the target sentence including the content corresponding to the vacancy on the display interface.

图5为实现本发明各个实施例的一种移动终端的硬件结构示意图。FIG. 5 is a schematic diagram of a hardware structure of a mobile terminal implementing various embodiments of the present invention.

该移动终端400包括但不限于：射频单元401、网络模块402、音频输出单元403、输入单元404、传感器405、显示单元406、用户输入单元407、接口单元408、存储器409、处理器410、以及电源411等部件。本领域技术人员可以理解，图5中示出的移动终端结构并不构成对移动终端的限定，移动终端可以包括比图示更多或更少的部件，或者组合某些部件，或者不同的部件布置。在本发明实施例中，移动终端包括但不限于手机、平板电脑、笔记本电脑、掌上电脑、车载终端、可穿戴设备、以及计步器等。Themobile terminal 400 includes but is not limited to: aradio frequency unit 401, anetwork module 402, anaudio output unit 403, aninput unit 404, asensor 405, adisplay unit 406, auser input unit 407, aninterface unit 408, amemory 409, aprocessor 410, andPower 411 and other components. Those skilled in the art can understand that the structure of the mobile terminal shown in FIG. 5 does not constitute a limitation on the mobile terminal, and the mobile terminal may include more or less components than the one shown, or combine some components, or different components layout. In this embodiment of the present invention, the mobile terminal includes, but is not limited to, a mobile phone, a tablet computer, a notebook computer, a palmtop computer, a vehicle-mounted terminal, a wearable device, a pedometer, and the like.

其中，射频单元401，用于在进行语音通话的过程，接收与终端进行语音通话的对端发送的语音信号。Wherein, theradio frequency unit 401 is configured to receive a voice signal sent by the opposite terminal that conducts the voice call with the terminal during the voice call process.

处理器410，用于在进行语音通话的过程中，对所述终端接收的语音信号进行检测，当检测到所述语音信号中的第一语音信号不满足参数条件时，根据第二语音信号确定所述第一语音信号对应的内容，所述第二语音信号包括所述语音信号中在所述第一语音信号之前接收到的预设时长的语音信号和之后接收到的预设时长的语音信号中的至少一种，将所述第一语音信号对应的内容显示在显示界面上。Theprocessor 410 is configured to detect the voice signal received by the terminal during the voice call, and when it is detected that the first voice signal in the voice signal does not meet the parameter conditions, determine according to the second voice signal Content corresponding to the first voice signal, and the second voice signal includes a voice signal of a preset duration received before the first voice signal and a voice signal of a preset duration received after the voice signal At least one of the content corresponding to the first voice signal is displayed on the display interface.

应理解的是，本发明实施例中，射频单元401可用于收发信息或通话过程中，信号的接收和发送，具体的，将来自基站的下行数据接收后，给处理器410处理；另外，将上行的数据发送给基站。通常，射频单元401包括但不限于天线、至少一个放大器、收发信机、耦合器、低噪声放大器、双工器等。此外，射频单元401还可以通过无线通信系统与网络和其他设备通信。It should be understood that, in this embodiment of the present invention, theradio frequency unit 401 can be used for receiving and sending signals during sending and receiving information or during a call. Specifically, after receiving the downlink data from the base station, it is processed by theprocessor 410; The uplink data is sent to the base station. Generally, theradio frequency unit 401 includes, but is not limited to, an antenna, at least one amplifier, a transceiver, a coupler, a low noise amplifier, a duplexer, and the like. In addition, theradio frequency unit 401 can also communicate with the network and other devices through a wireless communication system.

移动终端通过网络模块402为用户提供了无线的宽带互联网访问，如帮助用户收发电子邮件、浏览网页和访问流式媒体等。The mobile terminal provides the user with wireless broadband Internet access through thenetwork module 402, such as helping the user to send and receive emails, browse web pages, access streaming media, and the like.

音频输出单元403可以将射频单元401或网络模块402接收的或者在存储器409中存储的音频数据转换成音频信号并且输出为声音。而且，音频输出单元403还可以提供与移动终端400执行的特定功能相关的音频输出(例如，呼叫信号接收声音、消息接收声音等等)。音频输出单元403包括扬声器、蜂鸣器以及受话器等。Theaudio output unit 403 may convert audio data received by theradio frequency unit 401 or thenetwork module 402 or stored in thememory 409 into audio signals and output as sound. Also, theaudio output unit 403 may also provide audio output related to a specific function performed by the mobile terminal 400 (eg, call signal reception sound, message reception sound, etc.). Theaudio output unit 403 includes a speaker, a buzzer, a receiver, and the like.

输入单元404用于接收音频或视频信号。输入单元404可以包括图形处理器(Graphics Processing Unit，GPU)4041和麦克风4042，图形处理器4041对在视频捕获模式或图像捕获模式中由图像捕获装置(如摄像头)获得的静态图片或视频的图像数据进行处理。处理后的图像帧可以显示在显示单元906上。经图形处理器4041处理后的图像帧可以存储在存储器409(或其它存储介质)中或者经由射频单元401或网络模块402进行发送。麦克风4042可以接收声音，并且能够将这样的声音处理为音频数据。处理后的音频数据可以在电话通话模式的情况下转换为可经由射频单元401发送到移动通信基站的格式输出。Theinput unit 404 is used to receive audio or video signals. Theinput unit 404 may include a graphics processor (Graphics Processing Unit, GPU) 4041 and amicrophone 4042. Thegraphics processor 4041 captures images of still pictures or videos obtained by an image capture device (such as a camera) in a video capture mode or an image capture mode data is processed. The processed image frames may be displayed on the display unit 906 . The image frames processed by thegraphics processor 4041 may be stored in the memory 409 (or other storage medium) or transmitted via theradio frequency unit 401 or thenetwork module 402 . Themicrophone 4042 can receive sound and can process such sound into audio data. The processed audio data can be converted into a format that can be transmitted to a mobile communication base station via theradio frequency unit 401 for output in the case of a telephone call mode.

移动终端400还包括至少一种传感器405，比如光传感器、运动传感器以及其他传感器。具体地，光传感器包括环境光传感器及接近传感器，其中，环境光传感器可根据环境光线的明暗来调节显示面板4061的亮度，接近传感器可在移动终端400移动到耳边时，关闭显示面板4061和/或背光。作为运动传感器的一种，加速计传感器可检测各个方向上(一般为三轴)加速度的大小，静止时可检测出重力的大小及方向，可用于识别移动终端姿态(比如横竖屏切换、相关游戏、磁力计姿态校准)、振动识别相关功能(比如计步器、敲击)等；传感器405还可以包括指纹传感器、压力传感器、虹膜传感器、分子传感器、陀螺仪、气压计、湿度计、温度计、红外线传感器等，在此不再赘述。Themobile terminal 400 also includes at least onesensor 405, such as a light sensor, a motion sensor, and other sensors. Specifically, the light sensor includes an ambient light sensor and a proximity sensor, wherein the ambient light sensor can adjust the brightness of thedisplay panel 4061 according to the brightness of the ambient light, and the proximity sensor can turn off thedisplay panel 4061 and thedisplay panel 4061 when themobile terminal 400 is moved to the ear. / or backlight. As a kind of motion sensor, the accelerometer sensor can detect the magnitude of acceleration in all directions (usually three axes), and can detect the magnitude and direction of gravity when stationary, and can be used to identify the posture of mobile terminals (such as horizontal and vertical screen switching, related games , magnetometer attitude calibration), vibration recognition related functions (such as pedometer, tapping), etc.; thesensor 405 may also include a fingerprint sensor, a pressure sensor, an iris sensor, a molecular sensor, a gyroscope, a barometer, a hygrometer, a thermometer, Infrared sensors, etc., are not repeated here.

显示单元406用于显示由用户输入的信息或提供给用户的信息。显示单元906可包括显示面板4061，可以采用液晶显示器(Liquid Crystal Display，LCD)、有机发光二极管(Organic Light-Emitting Diode,OLED)等形式来配置显示面板4061。Thedisplay unit 406 is used to display information input by the user or information provided to the user. The display unit 906 may include adisplay panel 4061, and thedisplay panel 4061 may be configured in the form of a Liquid Crystal Display (LCD), an Organic Light-Emitting Diode (OLED), or the like.

用户输入单元407可用于接收输入的数字或字符信息，以及产生与移动终端的用户设置以及功能控制有关的键信号输入。具体地，用户输入单元905包括触控面板4071以及其他输入设备4072。触控面板4071，也称为触摸屏，可收集用户在其上或附近的触摸操作(比如用户使用手指、触笔等任何适合的物体或附件在触控面板4071上或在触控面板4071附近的操作)。触控面板4071可包括触摸检测装置和触摸控制器两个部分。其中，触摸检测装置检测用户的触摸方位，并检测触摸操作带来的信号，将信号传送给触摸控制器；触摸控制器从触摸检测装置上接收触摸信息，并将它转换成触点坐标，再送给处理器410，接收处理器410发来的命令并加以执行。此外，可以采用电阻式、电容式、红外线以及表面声波等多种类型实现触控面板4071。除了触控面板4071，用户输入单元407还可以包括其他输入设备4072。具体地，其他输入设备4072可以包括但不限于物理键盘、功能键(比如音量控制按键、开关按键等)、轨迹球、鼠标、操作杆，在此不再赘述。Theuser input unit 407 may be used to receive input numerical or character information, and generate key signal input related to user settings and function control of the mobile terminal. Specifically, the user input unit 905 includes atouch panel 4071 andother input devices 4072 . Thetouch panel 4071, also referred to as a touch screen, can collect the user's touch operations on or near it (such as the user's finger, stylus, etc., any suitable object or accessory on or near the touch panel 4071). operate). Thetouch panel 4071 may include two parts, a touch detection device and a touch controller. Among them, the touch detection device detects the user's touch orientation, detects the signal brought by the touch operation, and transmits the signal to the touch controller; the touch controller receives the touch information from the touch detection device, converts it into contact coordinates, and then sends it to the touch controller. To theprocessor 410, the command sent by theprocessor 410 is received and executed. In addition, thetouch panel 4071 can be implemented in various types such as resistive, capacitive, infrared, and surface acoustic waves. In addition to thetouch panel 4071 , theuser input unit 407 may also includeother input devices 4072 . Specifically,other input devices 4072 may include, but are not limited to, physical keyboards, function keys (such as volume control keys, switch keys, etc.), trackballs, mice, and joysticks, which will not be repeated here.

进一步的，触控面板4071可覆盖在显示面板4061上，当触控面板4071检测到在其上或附近的触摸操作后，传送给处理器410以确定触摸事件的类型，随后处理器410根据触摸事件的类型在显示面板4061上提供相应的视觉输出。虽然在图5中，触控面板4071与显示面板4061是作为两个独立的部件来实现移动终端的输入和输出功能，但是在某些实施例中，可以将触控面板4071与显示面板4061集成而实现移动终端的输入和输出功能，具体此处不做限定。Further, thetouch panel 4071 can be covered on thedisplay panel 4061. When thetouch panel 4071 detects a touch operation on or near it, it transmits it to theprocessor 410 to determine the type of the touch event, and then theprocessor 410 determines the type of the touch event according to the touch The type of event provides corresponding visual output ondisplay panel 4061. Although in FIG. 5, thetouch panel 4071 and thedisplay panel 4061 are used as two independent components to realize the input and output functions of the mobile terminal, in some embodiments, thetouch panel 4071 and thedisplay panel 4061 may be integrated The input and output functions of the mobile terminal are implemented, which is not specifically limited here.

接口单元408为外部装置与移动终端400连接的接口。例如，外部装置可以包括有线或无线头戴式耳机端口、外部电源(或电池充电器)端口、有线或无线数据端口、存储卡端口、用于连接具有识别模块的装置的端口、音频输入/输出(I/O)端口、视频I/O端口、耳机端口等等。接口单元408可以用于接收来自外部装置的输入(例如，数据信息、电力等等)并且将接收到的输入传输到移动终端400内的一个或多个元件或者可以用于在移动终端400和外部装置之间传输数据。Theinterface unit 408 is an interface for connecting an external device to themobile terminal 400 . For example, external devices may include wired or wireless headset ports, external power (or battery charger) ports, wired or wireless data ports, memory card ports, ports for connecting devices with identification modules, audio input/output (I/O) ports, video I/O ports, headphone ports, and more. Theinterface unit 408 may be used to receive input (eg, data information, power, etc.) from an external device and transmit the received input to one or more elements within themobile terminal 400 or may be used between themobile terminal 400 and the external Transfer data between devices.

存储器409可用于存储软件程序以及各种数据。存储器409可主要包括存储程序区和存储数据区，其中，存储程序区可存储操作系统、至少一个功能所需的应用程序(比如声音播放功能、图像播放功能等)等；存储数据区可存储根据手机的使用所创建的数据(比如音频数据、电话本等)等。此外，存储器409可以包括高速随机存取存储器，还可以包括非易失性存储器，例如至少一个磁盘存储器件、闪存器件、或其他易失性固态存储器件。Thememory 409 may be used to store software programs as well as various data. Thememory 409 may mainly include a stored program area and a stored data area, wherein the stored program area may store an operating system, an application program (such as a sound playback function, an image playback function, etc.) required for at least one function, and the like; Data created by the use of the mobile phone (such as audio data, phone book, etc.), etc. Additionally,memory 409 may include high-speed random access memory, and may also include non-volatile memory, such as at least one magnetic disk storage device, flash memory device, or other volatile solid state storage device.

处理器410是移动终端的控制中心，利用各种接口和线路连接整个移动终端的各个部分，通过运行或执行存储在存储器409内的软件程序和/或模块，以及调用存储在存储器409内的数据，执行移动终端的各种功能和处理数据，从而对移动终端进行整体监控。处理器410可包括一个或多个处理单元；优选的，处理器410可集成应用处理器和调制解调处理器，其中，应用处理器主要处理操作系统、用户界面和应用程序等，调制解调处理器主要处理无线通信。可以理解的是，上述调制解调处理器也可以不集成到处理器410中。Theprocessor 410 is the control center of the mobile terminal, uses various interfaces and lines to connect various parts of the entire mobile terminal, runs or executes the software programs and/or modules stored in thememory 409, and calls the data stored in thememory 409. , perform various functions of the mobile terminal and process data, so as to monitor the mobile terminal as a whole. Theprocessor 410 may include one or more processing units; preferably, theprocessor 410 may integrate an application processor and a modem processor, wherein the application processor mainly processes the operating system, user interface, and application programs, etc., and the modem The processor mainly handles wireless communication. It can be understood that, the above-mentioned modulation and demodulation processor may not be integrated into theprocessor 410.

移动终端400还可以包括给各个部件供电的电源411(比如电池)，优选的，电源411可以通过电源管理系统与处理器410逻辑相连，从而通过电源管理系统实现管理充电、放电、以及功耗管理等功能。Themobile terminal 400 may also include a power supply 411 (such as a battery) for supplying power to various components. Preferably, thepower supply 411 may be logically connected to theprocessor 410 through a power management system, so as to manage charging, discharging, and power consumption management through the power management system. and other functions.

另外，移动终端400包括一些未示出的功能模块，在此不再赘述。In addition, themobile terminal 400 includes some unshown functional modules, which are not repeated here.

优选的，本发明实施例还提供一种终端，包括处理器410，存储器409，存储在存储器409上并可在所述处理器410上运行的计算机程序，该计算机程序被处理器410执行时实现上述语音信号处理方法实施例的各个过程，且能达到相同的技术效果，为避免重复，这里不再赘述。Preferably, an embodiment of the present invention further provides a terminal, including aprocessor 410, amemory 409, a computer program stored in thememory 409 and running on theprocessor 410, and the computer program is implemented when theprocessor 410 executes it. The various processes of the above embodiments of the speech signal processing method can achieve the same technical effect, and are not repeated here to avoid repetition.

本发明实施例还提供一种计算机可读存储介质，计算机可读存储介质上存储有计算机程序，该计算机程序被处理器执行时实现上述语音信号处理方法实施例的各个过程，且能达到相同的技术效果，为避免重复，这里不再赘述。其中，所述的计算机可读存储介质，如只读存储器(Read-Only Memory，简称ROM)、随机存取存储器(Random Access Memory，简称RAM)、磁碟或者光盘等。Embodiments of the present invention further provide a computer-readable storage medium, where a computer program is stored on the computer-readable storage medium. When the computer program is executed by a processor, each process of the above embodiments of the voice signal processing method can be achieved, and the same can be achieved. The technical effect, in order to avoid repetition, will not be repeated here. The computer-readable storage medium is, for example, a read-only memory (Read-Only Memory, ROM for short), a random access memory (Random Access Memory, RAM for short), a magnetic disk, or an optical disk.

需要说明的是，在本文中，术语“包括”、“包含”或者其任何其他变体意在涵盖非排他性的包含，从而使得包括一系列要素的过程、方法、物品或者装置不仅包括那些要素，而且还包括没有明确列出的其他要素，或者是还包括为这种过程、方法、物品或者装置所固有的要素。在没有更多限制的情况下，由语句“包括一个……”限定的要素，并不排除在包括该要素的过程、方法、物品或者装置中还存在另外的相同要素。It should be noted that, herein, the terms "comprising", "comprising" or any other variation thereof are intended to encompass non-exclusive inclusion, such that a process, method, article or device comprising a series of elements includes not only those elements, It also includes other elements not expressly listed or inherent to such a process, method, article or apparatus. Without further limitation, an element qualified by the phrase "comprising a..." does not preclude the presence of additional identical elements in a process, method, article or apparatus that includes the element.

通过以上的实施方式的描述，本领域的技术人员可以清楚地了解到上述实施例方法可借助软件加必需的通用硬件平台的方式来实现，当然也可以通过硬件，但很多情况下前者是更佳的实施方式。基于这样的理解，本发明的技术方案本质上或者说对现有技术做出贡献的部分可以以软件产品的形式体现出来，该计算机软件产品存储在一个存储介质(如ROM/RAM、磁碟、光盘)中，包括若干指令用以使得一台终端(可以是手机，计算机，服务器，空调器，或者网络设备等)执行本发明各个实施例所述的方法。From the description of the above embodiments, those skilled in the art can clearly understand that the method of the above embodiment can be implemented by means of software plus a necessary general hardware platform, and of course can also be implemented by hardware, but in many cases the former is better implementation. Based on this understanding, the technical solutions of the present invention can be embodied in the form of software products in essence or the parts that make contributions to the prior art, and the computer software products are stored in a storage medium (such as ROM/RAM, magnetic disk, CD-ROM), including several instructions to make a terminal (which may be a mobile phone, a computer, a server, an air conditioner, or a network device, etc.) execute the methods described in the various embodiments of the present invention.

上面结合附图对本发明的实施例进行了描述，但是本发明并不局限于上述的具体实施方式，上述的具体实施方式仅仅是示意性的，而不是限制性的，本领域的普通技术人员在本发明的启示下，在不脱离本发明宗旨和权利要求所保护的范围情况下，还可做出很多形式，均属于本发明的保护之内。The embodiments of the present invention have been described above in conjunction with the accompanying drawings, but the present invention is not limited to the above-mentioned specific embodiments, which are merely illustrative rather than restrictive. Under the inspiration of the present invention, without departing from the spirit of the present invention and the scope protected by the claims, many forms can be made, which all belong to the protection of the present invention.