技术领域technical field
本发明涉及通信技术领域,特别是涉及一种唤醒语音助手的方法、唤醒语音助手的装置。The invention relates to the technical field of communication, in particular to a method for waking up a voice assistant and a device for waking up the voice assistant.
背景技术Background technique
语音识别技术从20世纪50年代开始出现,刚开始发展比较缓慢,只能识别少量的孤立的词汇,直至90年代,这项技术开始在应用和产品化方面取得了比较大的突破,成为了技术研究的重点,其中应用较广泛的有苹果公司的Siri(苹果智能语音助手),国内的科大讯飞、百度语音、搜狗语音助手等。Speech recognition technology began to appear in the 1950s, and it developed slowly at the beginning, and could only recognize a small number of isolated words. Until the 1990s, this technology began to make relatively great breakthroughs in application and productization, and became a technology The focus of the research, which is widely used is Apple's Siri (Apple's intelligent voice assistant), domestic iFlytek, Baidu voice, Sogou voice assistant, etc.
在现有的各语音助手中,一般需要输入特定的语音完成触发,从而使语音助手处于语音待输入状态。例如在接电源的情况下,只要对着带Siri的ios(苹果公司的移动操作系统)设备说一声Hey Siri,就会唤醒Siri服务。语音触发作为语音输入的开启动作,不需要接触设备,很好地解决了在特定环境下进行语音输入的问题。In the existing voice assistants, it is generally necessary to input a specific voice to complete the trigger, so that the voice assistant is in a voice waiting state. For example, when the power is connected, as long as you say Hey Siri to the ios (Apple's mobile operating system) device with Siri, the Siri service will be awakened. Voice trigger is used as the opening action of voice input, and does not need to touch the device, which solves the problem of voice input in a specific environment.
但是,现有的语音助手在使用时需要特定语音唤醒,而长时间开启语音唤醒功能会消耗较多的电量,并且用户需要先通过特定语音唤醒语音助手,然后再向语音助手输入相应的语音操作指令,过程较为冗余。However, existing voice assistants require a specific voice wake-up when in use, and turning on the voice wake-up function for a long time will consume more power, and the user needs to wake up the voice assistant through a specific voice first, and then input the corresponding voice operation to the voice assistant Instructions, the process is more redundant.
发明内容Contents of the invention
基于此,有必要针对上述问题,提供一种操作简单的唤醒语音助手的方法及装置。Based on this, it is necessary to provide an easy-to-operate method and device for waking up a voice assistant to address the above problems.
一种唤醒语音助手的方法,包括步骤:A method for waking up a voice assistant, comprising steps:
获取人脸图像以及对应的环境声音;Obtain the face image and the corresponding ambient sound;
检测所述人脸图像是否满足第一预设条件;Detecting whether the face image satisfies a first preset condition;
若所述人脸图像满足第一预设条件,检测所述环境声音是否满足第二预设条件;If the face image satisfies the first preset condition, detecting whether the ambient sound satisfies the second preset condition;
若所述环境声音满足第二预设条件,则唤醒语音助手,并将所述环境声音作为语音操作指令输入语音助手。If the ambient sound satisfies the second preset condition, wake up the voice assistant, and input the ambient sound as a voice operation instruction into the voice assistant.
一种唤醒语音助手的装置,包括:A device for waking up a voice assistant, comprising:
人脸图像获取模块,用于获取人脸图像;A human face image acquisition module is used to obtain a human face image;
环境声音获取模块,用于获取与人脸图像对应的环境声音;The ambient sound acquisition module is used to acquire the ambient sound corresponding to the face image;
人脸图像检测模块,用于检测所述人脸图像是否满足第一预设条件;A face image detection module, configured to detect whether the face image satisfies a first preset condition;
环境声音检测模块,用于在所述人脸图像满足第一预设条件时,检测所述环境声音是否满足第二预设条件;An ambient sound detection module, configured to detect whether the ambient sound satisfies a second preset condition when the face image satisfies a first preset condition;
唤醒模块,用于在所述环境声音满足第二预设条件时,唤醒语音助手,并将所述环境声音作为语音操作指令输入语音助手。A wake-up module, configured to wake up the voice assistant when the ambient sound satisfies a second preset condition, and input the ambient sound as a voice operation instruction into the voice assistant.
本发明唤醒语音助手的方法及装置,在人脸图像和环境声音满足相应的预设条件时,自动唤醒语音助手,并同时将所述环境声音作为语音操作指令输入语音助手,也即是用户可以直接下发操作指令,免去了语音触发的冗余步骤,实现了通信的简化,并节省了安装有语音助手的设备的电量。The method and device for waking up the voice assistant of the present invention automatically wakes up the voice assistant when the face image and the ambient sound meet the corresponding preset conditions, and at the same time inputs the ambient sound as a voice operation command into the voice assistant, that is, the user can Sending operation instructions directly eliminates the redundant steps of voice triggering, simplifies communication, and saves the power of devices equipped with voice assistants.
附图说明Description of drawings
图1为本发明方法实施例的流程示意图;Fig. 1 is the schematic flow sheet of the method embodiment of the present invention;
图2为本发明步骤S120具体实施例的流程示意图;FIG. 2 is a schematic flow chart of a specific embodiment of step S120 of the present invention;
图3为本发明步骤S130具体实施例的流程示意图;FIG. 3 is a schematic flow chart of a specific embodiment of step S130 of the present invention;
图4为本发明装置实施例的结构示意图;Fig. 4 is the structural representation of the device embodiment of the present invention;
图5为本发明人脸图像检测模块实施例的结构示意图;Fig. 5 is the structural representation of embodiment of face image detection module of the present invention;
图6为本发明第一判断单元实施例的结构示意图;FIG. 6 is a schematic structural diagram of an embodiment of a first judging unit of the present invention;
图7为本发明环境声音检测模块实施例的结构示意图。FIG. 7 is a schematic structural diagram of an embodiment of an ambient sound detection module of the present invention.
具体实施方式Detailed ways
下面结合附图对本发明唤醒语音助手的方法的具体实施方式做详细描述。The specific implementation of the method for waking up the voice assistant in the present invention will be described in detail below in conjunction with the accompanying drawings.
如图1所示,一种唤醒语音助手的方法,包括步骤:As shown in Figure 1, a method for waking up a voice assistant includes steps:
S110、获取人脸图像以及对应的环境声音;S110. Acquire a face image and a corresponding ambient sound;
S120、检测所述人脸图像是否满足第一预设条件;S120. Detect whether the face image satisfies a first preset condition;
S130、若所述人脸图像满足第一预设条件,检测所述环境声音是否满足第二预设条件;S130. If the face image satisfies the first preset condition, detect whether the ambient sound satisfies the second preset condition;
S140、若所述环境声音满足第二预设条件,则唤醒语音助手,并将所述环境声音作为语音操作指令输入语音助手。S140. If the ambient sound satisfies the second preset condition, wake up the voice assistant, and input the ambient sound as a voice operation instruction into the voice assistant.
人脸图像可以根据摄像头获取,环境声音可以根据麦克风等获取。获取的摄像头图像和环境声音需进行存储,以便后续的条件检测。为了节省存储空间,可以设置人脸图像和环境声音的保留存储时间,以便一次操作完成后,无用的数据及时清除,腾出存储空间。The face image can be obtained by the camera, and the ambient sound can be obtained by the microphone and the like. The acquired camera images and ambient sounds need to be stored for subsequent condition detection. In order to save storage space, you can set the retention storage time of face images and environmental sounds, so that after an operation is completed, useless data will be cleared in time to free up storage space.
获取人脸图像和环境声音后,可以先检测人脸图像是否满足第一预设条件,其中第一预设条件可以根据用户需要设置为各种具体形式。例如,如图2所示,步骤S120可以包括步骤:After the face image and ambient sound are acquired, it may be detected first whether the face image satisfies a first preset condition, wherein the first preset condition can be set in various specific forms according to user needs. For example, as shown in Figure 2, step S120 may include the steps of:
S1201、判断人脸图像是否为正面人脸图像,若是,进入步骤S1202,否则返回步骤S110;S1201, determine whether the face image is a frontal face image, if so, enter step S1202, otherwise return to step S110;
S1202、判断人脸图像中的嘴部是否有动作,若有动作,进入步骤S1203,否则返回步骤S110;S1202. Determine whether the mouth in the face image has any movement, if there is movement, go to step S1203, otherwise return to step S110;
S1203、判定所述人脸图像满足第一预设条件。S1203. Determine that the face image satisfies a first preset condition.
用户在使用语音助手时,一般正面面对智能手机或平板等,然后通过嘴部发出语音指令,也即是摄像头捕捉到的人脸图像是正面人脸图像,且检测到嘴部有动作时,按照经验,一般使用语音助手的可能性较大。判断人脸图像是否为正面人脸图像的方式有很多种,例如,步骤S1201包括步骤:When using the voice assistant, the user generally faces the smartphone or tablet, etc., and then sends a voice command through the mouth, that is, the face image captured by the camera is a frontal face image, and when the mouth is detected to move, According to experience, it is generally more likely to use voice assistants. There are many ways to judge whether the face image is a frontal face image, for example, step S1201 includes steps:
获取人脸图像中双眼的距离;Obtain the distance between the eyes in the face image;
判断双眼的距离是否在预设范围内;Determine whether the distance between the eyes is within the preset range;
若是,则判定人脸图像是正面人脸图像,否则判定人脸图像不是正面人脸图像。If so, it is determined that the human face image is a frontal human face image, otherwise it is determined that the human face image is not a frontal human face image.
由于现实中使用语音助手时,用户不可能完全正向摄像头,所以本发明检测时允许一定的偏值,即本发明不限制于确定人脸图像是正面人脸图像,也可以判断人脸图像是不是近似正面人脸图像。同时确定人脸图像是不是正面人脸图像也不仅仅限制于上述提供的方法,还可以根据现有技术中其它方式实现。Since it is impossible for the user to face the camera completely when using the voice assistant in reality, the present invention allows a certain bias during detection, that is, the present invention is not limited to determining whether the face image is a frontal face image, but can also determine whether the face image is It is not an approximate frontal face image. At the same time, determining whether the face image is a frontal face image is not limited to the method provided above, and can also be implemented in other ways in the prior art.
嘴部是人的主要声音源,当需要向语音助手发出语音指令时,会伴随着嘴部的动作。所以在判定人脸图像是正面人脸图像后,加入嘴部动作特征的判定,能够提高准确性。根据获取的人脸图像判断嘴部是否有动作可以根据现有技术中已有的方式实现。The mouth is the main source of human voice, and when it is necessary to issue voice commands to the voice assistant, it will be accompanied by the movement of the mouth. Therefore, after judging that the face image is a frontal face image, adding the judgment of mouth movement features can improve the accuracy. Judging whether there is any movement in the mouth according to the acquired face image can be realized according to existing methods in the prior art.
在检测到人脸图像时正面人脸图像且嘴部有动作时,即可以进入环境声音的检测步骤。当然用户可以根据需要添加其它的面部验证条件,从而进一步提高准确性,本发明在此不予详述。When the face image is detected and the face image is frontal and the mouth moves, the detection step of the ambient sound can be entered. Of course, the user can add other facial verification conditions as required, thereby further improving the accuracy, which will not be described in detail in the present invention.
如图3所示,步骤S130可以包括步骤:As shown in Figure 3, step S130 may include steps:
S1301、判断所述环境声音的音量是否在预设范围内;S1301. Determine whether the volume of the ambient sound is within a preset range;
S1302、判断与所述环境声音的声源的距离是否小于预设阈值;S1302. Determine whether the distance to the sound source of the ambient sound is smaller than a preset threshold;
S1303、若所述音量在预设范围内且与所述声源的距离小于预设阈值,则判定所述环境声音满足第二预设条件,否则返回步骤S110。S1303. If the volume is within a preset range and the distance from the sound source is smaller than a preset threshold, determine that the ambient sound satisfies a second preset condition; otherwise, return to step S110.
一般用户使用语音助手时,声音的音量不会太大或者太小,而且距离设备的距离不会太大,所以本发明加入了音量大小以及距离的判断。音量可以根据现有技术中的音量检测仪等得到,设备距离声源的距离可以根据声音在空气中的定性的衰减公式确定。在根据环境声音得到音量和距离后,即可以判断音量是否在预设范围内,距离是否小于预设阈值,从而确定环境声音是否满足第二预设条件,其中预设范围和预设阈值均可以根据经验自行设定。Generally, when a user uses a voice assistant, the volume of the voice will not be too loud or too low, and the distance from the device will not be too large, so the present invention adds the judgment of volume and distance. The volume can be obtained according to the volume detector in the prior art, and the distance between the device and the sound source can be determined according to the qualitative attenuation formula of sound in the air. After obtaining the volume and distance according to the ambient sound, it can be judged whether the volume is within the preset range and whether the distance is less than the preset threshold, so as to determine whether the ambient sound meets the second preset condition, wherein both the preset range and the preset threshold can be Set it yourself based on experience.
在环境声音满足第二预设条件时,确定用户需要对智能设备进行语音输入,唤醒语音助手,并同时将环境声音作为语音操作指令输入语音助手,语音助手直接执行相应的操作,免去了特定语音唤醒语音助手的步骤,用户操作简单。When the ambient sound satisfies the second preset condition, it is determined that the user needs to perform voice input to the smart device, wake up the voice assistant, and at the same time input the ambient sound as a voice operation command into the voice assistant, and the voice assistant directly performs the corresponding operation, eliminating the need for specific The steps of waking up the voice assistant by voice are easy for users to operate.
需要说明的是,本发明并不对人脸图像及环境声音的检测顺序加以限定,另外,用户可以根据实际情况进行其他对话特征的排查,例如检测到用户是在哼歌或自言自语时认为不满足第二预设条件等,本发明并不对判定条件加以限定。It should be noted that the present invention does not limit the detection sequence of face images and environmental sounds. In addition, users can check other dialogue features according to actual conditions, for example, when it is detected that the user is humming or talking to himself, he thinks that If the second preset condition and the like are not satisfied, the present invention does not limit the determination condition.
为了更好的理解本发明的实施过程,下面结合一个具体应用场景进行说明。In order to better understand the implementation process of the present invention, a specific application scenario will be described below.
在厨房中做菜时,我们想借助平板电脑进行菜谱展示,从而按照上面的步骤完成一道自己不熟悉的菜式。当做完一道工序之后我们希望平板电脑中的软件能够展示下一道工序,这时候便要向它输入指令,但是这时候有可能双手都在忙或者沾上油渍不方便触碰平板电脑,这时我们只需要脸转向平板电脑说一声“下一步”,然后本发明提供的语音唤醒软件判断出我们是在向平板电脑的语音助手输入指令,而不是对别的人说话,这样便直接完成了语音助手的唤醒步骤,并且语音助手可以立即将语音解析出来,并使菜谱软件的展示翻到下一步。When cooking in the kitchen, we want to use the tablet computer to display the recipe, so as to complete a dish that we are not familiar with according to the above steps. After finishing a process, we hope that the software in the tablet computer can display the next process. At this time, we need to input instructions to it. You only need to turn your face to the tablet computer and say "next step", and then the voice wake-up software provided by the present invention can judge that we are inputting instructions to the voice assistant of the tablet computer instead of talking to other people, so that the voice assistant is directly completed. The wake-up step, and the voice assistant can immediately analyze the voice and turn the display of the recipe software to the next step.
基于同一发明构思,本发明还提供一种唤醒语音助手的装置,下面结合附图对本发明装置的具体实施方式做详细描述。Based on the same inventive concept, the present invention also provides a device for waking up a voice assistant. The specific implementation of the device of the present invention will be described in detail below with reference to the accompanying drawings.
如图4所示,一种唤醒语音助手的装置,包括:As shown in Figure 4, a device for waking up a voice assistant includes:
人脸图像获取模块410,用于获取人脸图像;Facial image acquisition module 410, used to acquire facial images;
环境声音获取模块420,用于获取与人脸图像对应的环境声音;The ambient sound acquisition module 420 is used to acquire the ambient sound corresponding to the face image;
人脸图像检测模块430,用于检测所述人脸图像是否满足第一预设条件;A face image detection module 430, configured to detect whether the face image satisfies a first preset condition;
环境声音检测模块440,用于在所述人脸图像满足第一预设条件时,检测所述环境声音是否满足第二预设条件;An ambient sound detection module 440, configured to detect whether the ambient sound satisfies a second preset condition when the face image satisfies a first preset condition;
唤醒模块450,用于在所述环境声音满足第二预设条件时,唤醒语音助手,并将所述环境声音作为语音操作指令输入语音助手。The wakeup module 450 is configured to wake up the voice assistant when the ambient sound satisfies a second preset condition, and input the ambient sound as a voice operation instruction into the voice assistant.
人脸图像获取模块410可以根据摄像头获取人脸图像,环境声音获取模块420可以根据麦克风等获取环境声音。人脸图像获取模块410和环境声音获取模块420获取的摄像头图像和环境声音需进行存储,以便后续的条件检测。为了节省存储空间,可以设置人脸图像和环境声音的保留存储时间,以便一次操作完成后,无用的数据及时清除,腾出存储空间。The face image acquisition module 410 may acquire the face image through the camera, and the ambient sound acquisition module 420 may acquire the ambient sound through the microphone or the like. The camera images and ambient sounds acquired by the face image acquisition module 410 and the environmental sound acquisition module 420 need to be stored for subsequent condition detection. In order to save storage space, you can set the retention storage time of face images and environmental sounds, so that after an operation is completed, useless data will be cleared in time to free up storage space.
获取人脸图像和环境声音后,人脸图像检测模块430检测人脸图像是否满足第一预设条件,其中第一预设条件可以根据用户需要设置为各种具体形式。例如,如图5所示,所述人脸图像检测模块430可以包括:After acquiring the face image and ambient sound, the face image detection module 430 detects whether the face image satisfies a first preset condition, wherein the first preset condition can be set in various specific forms according to user needs. For example, as shown in Figure 5, the face image detection module 430 may include:
第一判断单元4301,用于判断人脸图像是否为正面人脸图像;The first judging unit 4301 is used to judge whether the face image is a frontal face image;
第二判断单元4302,用于在人脸图像时正面人脸图像时,判断人脸图像中的嘴部是否有动作;The second judging unit 4302 is used to judge whether the mouth in the face image is moving when the face image is a frontal face image;
判定单元4303,用于在嘴部有动作时,判定所述人脸图像满足第一预设条件。The judging unit 4303 is configured to judge that the face image satisfies the first preset condition when the mouth moves.
第一判断单元4301判断人脸图像是否为正面人脸图像的方式有很多种,例如,如图6所示,所述第一判断单元4301可以包括:There are many ways for the first judging unit 4301 to judge whether the face image is a frontal face image, for example, as shown in Figure 6, the first judging unit 4301 may include:
双眼距离获取单元43011,用于获取人脸图像中双眼的距离;The binocular distance acquisition unit 43011 is used to acquire the binocular distance in the face image;
双眼距离判断单元43012,用于判断双眼的距离是否在预设范围内;A binocular distance judging unit 43012, used to judge whether the binocular distance is within a preset range;
人脸图像判定单元43013,用于在双眼的距离在预设范围内时,判定人脸图像是正面人脸图像,否则判定人脸图像不是正面人脸图像。The human face image judging unit 43013 is configured to judge that the human face image is a frontal human face image when the distance between the eyes is within a preset range, otherwise it is judged that the human face image is not a frontal human face image.
第一判断单元4301不限制于确定人脸图像是正面人脸图像,也可以判断人脸图像是不是近似正面人脸图像。同时第一判断单元4301确定人脸图像是不是正面人脸图像也不仅仅限制于如图6所示的方式,还可以根据现有技术中其它方式实现。在判定人脸图像是正面人脸图像后,第二判断单元4302加入嘴部动作特征的判定,能够提高准确性,其中第二判断单元4302根据获取的人脸图像判断嘴部是否有动作可以根据现有技术中已有的方式实现。The first judging unit 4301 is not limited to determining whether the face image is a frontal face image, and may also judge whether the face image is an approximate frontal face image. At the same time, the first judging unit 4301 determines whether the face image is a frontal face image is not limited to the manner shown in FIG. 6 , and can also be implemented in other manners in the prior art. After judging that the face image is a frontal face image, the second judging unit 4302 adds the judgment of mouth movement features, which can improve the accuracy, wherein the second judging unit 4302 judges whether the mouth has movements according to the acquired face image. Existing mode realizes in the prior art.
环境声音检测模块440在人脸图像满足第一预设条件时,检测环境声音是否满足第二预设条件。需要说明的是,本发明并不对环境声音检测模块440和人脸图像检测模块430的执行顺序加以限定。环境声音检测模块440的实现方式有很多种,例如,如图7所示,所述环境声音检测模块440可以包括:The ambient sound detection module 440 detects whether the ambient sound satisfies the second preset condition when the face image satisfies the first preset condition. It should be noted that the present invention does not limit the execution sequence of the ambient sound detection module 440 and the face image detection module 430 . There are many ways to implement the environmental sound detection module 440. For example, as shown in FIG. 7, the environmental sound detection module 440 may include:
音量判断单元4401,用于判断所述环境声音的音量是否在预设范围内,其中音量可以根据现有技术中的音量检测仪等得到;A volume judging unit 4401, configured to judge whether the volume of the ambient sound is within a preset range, wherein the volume can be obtained according to a volume detector in the prior art;
距离判断单元4402,用于判断与所述环境声音的声源的距离是否小于预设阈值,其中距离判断单元4402可以根据声音在空气中的定性的衰减公式确定与声源的距离;A distance judging unit 4402, configured to judge whether the distance to the sound source of the ambient sound is less than a preset threshold, wherein the distance judging unit 4402 can determine the distance to the sound source according to a qualitative attenuation formula of sound in the air;
声音判定单元4403,用于在所述音量在预设范围内且与所述声源的距离小于预设阈值时,判定所述环境声音满足第二预设条件,否则人脸图像获取模块410和环境声音获取模块420重新获取的摄像头图像和环境声音,其中预设范围和预设阈值均可以根据经验自行设定。A sound determination unit 4403, configured to determine that the ambient sound satisfies a second preset condition when the volume is within a preset range and the distance from the sound source is less than a preset threshold, otherwise the face image acquisition module 410 and The camera image and ambient sound re-acquired by the ambient sound acquisition module 420, wherein the preset range and the preset threshold can be set according to experience.
唤醒模块450在环境声音满足第二预设条件时,确定用户需要对智能设备进行语音输入,唤醒语音助手,并同时将环境声音作为语音操作指令输入语音助手,语音助手直接执行相应的操作,免去了特定语音唤醒语音助手的步骤,用户操作简单。When the ambient sound meets the second preset condition, the wake-up module 450 determines that the user needs to perform voice input to the smart device, wakes up the voice assistant, and at the same time inputs the ambient sound as a voice operation command into the voice assistant, and the voice assistant directly executes the corresponding operation without Go to the steps of waking up the voice assistant with a specific voice, and the user operation is simple.
以上所述实施例的各技术特征可以进行任意的组合,为使描述简洁,未对上述实施例中的各个技术特征所有可能的组合都进行描述,然而,只要这些技术特征的组合不存在矛盾,都应当认为是本说明书记载的范围。The technical features of the above-mentioned embodiments can be combined arbitrarily. To make the description concise, all possible combinations of the technical features in the above-mentioned embodiments are not described. However, as long as there is no contradiction in the combination of these technical features, should be considered as within the scope of this specification.
以上所述实施例仅表达了本发明的几种实施方式,其描述较为具体和详细,但并不能因此而理解为对发明专利范围的限制。应当指出的是,对于本领域的普通技术人员来说,在不脱离本发明构思的前提下,还可以做出若干变形和改进,这些都属于本发明的保护范围。因此,本发明专利的保护范围应以所附权利要求为准。The above-mentioned embodiments only express several implementation modes of the present invention, and the descriptions thereof are relatively specific and detailed, but should not be construed as limiting the patent scope of the invention. It should be pointed out that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention, and these all belong to the protection scope of the present invention. Therefore, the protection scope of the patent for the present invention should be based on the appended claims.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510227622.2ACN104820556A (en) | 2015-05-06 | 2015-05-06 | Method and device for waking up voice assistant |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201510227622.2ACN104820556A (en) | 2015-05-06 | 2015-05-06 | Method and device for waking up voice assistant |
| Publication Number | Publication Date |
|---|---|
| CN104820556Atrue CN104820556A (en) | 2015-08-05 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201510227622.2APendingCN104820556A (en) | 2015-05-06 | 2015-05-06 | Method and device for waking up voice assistant |
| Country | Link |
|---|---|
| CN (1) | CN104820556A (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105204628A (en)* | 2015-09-01 | 2015-12-30 | 涂悦 | Voice control method based on visual awakening |
| CN105700363A (en)* | 2016-01-19 | 2016-06-22 | 深圳创维-Rgb电子有限公司 | Method and system for waking up smart home equipment voice control device |
| CN105912092A (en)* | 2016-04-06 | 2016-08-31 | 北京地平线机器人技术研发有限公司 | Voice waking up method and voice recognition device in man-machine interaction |
| WO2017035768A1 (en)* | 2015-09-01 | 2017-03-09 | 涂悦 | Voice control method based on visual wake-up |
| CN106782524A (en)* | 2016-11-30 | 2017-05-31 | 深圳讯飞互动电子有限公司 | One kind mixing awakening method and system |
| CN106847285A (en)* | 2017-03-31 | 2017-06-13 | 上海思依暄机器人科技股份有限公司 | A kind of robot and its audio recognition method |
| CN107315561A (en)* | 2017-06-30 | 2017-11-03 | 联想(北京)有限公司 | A kind of data processing method and electronic equipment |
| CN107517313A (en)* | 2017-08-22 | 2017-12-26 | 珠海市魅族科技有限公司 | Awakening method and device, terminal and readable storage medium storing program for executing |
| CN107679506A (en)* | 2017-10-12 | 2018-02-09 | Tcl通力电子(惠州)有限公司 | Awakening method, intelligent artifact and the computer-readable recording medium of intelligent artifact |
| CN107678793A (en)* | 2017-09-14 | 2018-02-09 | 珠海市魅族科技有限公司 | Voice assistant starts method and device, terminal and computer-readable recording medium |
| CN108055617A (en)* | 2017-12-12 | 2018-05-18 | 广东小天才科技有限公司 | Microphone awakening method and device, terminal equipment and storage medium |
| CN108098767A (en)* | 2016-11-25 | 2018-06-01 | 北京智能管家科技有限公司 | Method and device for waking up a robot |
| CN108154878A (en)* | 2017-12-12 | 2018-06-12 | 北京小米移动软件有限公司 | Control the method and device of monitoring device |
| CN108154140A (en)* | 2018-01-22 | 2018-06-12 | 北京百度网讯科技有限公司 | Voice awakening method, device, equipment and computer-readable medium based on lip reading |
| CN108363557A (en)* | 2018-02-02 | 2018-08-03 | 刘国华 | Man-machine interaction method, device, computer equipment and storage medium |
| CN109671426A (en)* | 2018-12-06 | 2019-04-23 | 珠海格力电器股份有限公司 | Voice control method and device, storage medium and air conditioner |
| CN109710131A (en)* | 2018-12-28 | 2019-05-03 | 联想(北京)有限公司 | A kind of information control method and device |
| CN109741738A (en)* | 2018-12-10 | 2019-05-10 | 平安科技(深圳)有限公司 | Voice control method, device, computer equipment and storage medium |
| CN109992237A (en)* | 2018-01-03 | 2019-07-09 | 腾讯科技(深圳)有限公司 | Intelligent voice device control method, device, computer equipment and storage medium |
| CN110164444A (en)* | 2018-02-12 | 2019-08-23 | 优视科技有限公司 | Voice input starting method, apparatus and computer equipment |
| CN110188179A (en)* | 2019-05-30 | 2019-08-30 | 浙江远传信息技术股份有限公司 | Speech-oriented identifies exchange method, device, equipment and medium |
| CN110277094A (en)* | 2018-03-14 | 2019-09-24 | 阿里巴巴集团控股有限公司 | Awakening method, device and the electronic equipment of equipment |
| CN110941455A (en)* | 2019-11-27 | 2020-03-31 | 北京声智科技有限公司 | Active wake-up method and device and electronic equipment |
| CN111243583A (en)* | 2019-12-31 | 2020-06-05 | 深圳市瑞讯云技术有限公司 | System awakening method and device |
| CN111651135A (en)* | 2020-04-27 | 2020-09-11 | 珠海格力电器股份有限公司 | Sound awakening method and device, storage medium and electrical equipment |
| WO2020187050A1 (en)* | 2019-03-15 | 2020-09-24 | 海信视像科技股份有限公司 | Display device |
| US11158314B2 (en) | 2018-06-04 | 2021-10-26 | Pegatron Corporation | Voice control device and method |
| CN114187904A (en)* | 2020-08-25 | 2022-03-15 | 广州华凌制冷设备有限公司 | Similarity threshold acquisition method, voice household appliance and computer readable storage medium |
| WO2025091960A1 (en)* | 2023-10-31 | 2025-05-08 | 华为技术有限公司 | Voice assistant interaction method and electronic device |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2000347692A (en)* | 1999-06-07 | 2000-12-15 | Sanyo Electric Co Ltd | Person detecting method, person detecting device, and control system using it |
| EP1215658A2 (en)* | 2000-12-05 | 2002-06-19 | Hewlett-Packard Company | Visual activation of voice controlled apparatus |
| CN102298443A (en)* | 2011-06-24 | 2011-12-28 | 华南理工大学 | Smart home voice control system combined with video channel and control method thereof |
| CN102945672A (en)* | 2012-09-29 | 2013-02-27 | 深圳市国华识别科技开发有限公司 | Voice control system for multimedia equipment, and voice control method |
| CN103472994A (en)* | 2013-09-06 | 2013-12-25 | 乐得科技有限公司 | Operation control achieving method, device and system based on voice |
| US20140222436A1 (en)* | 2013-02-07 | 2014-08-07 | Apple Inc. | Voice trigger for a digital assistant |
| CN104078041A (en)* | 2014-06-26 | 2014-10-01 | 美的集团股份有限公司 | Voice recognition method and system |
| CN104103274A (en)* | 2013-04-11 | 2014-10-15 | 纬创资通股份有限公司 | Speech processing apparatus and speech processing method |
| CN104428832A (en)* | 2012-07-09 | 2015-03-18 | Lg电子株式会社 | Speech recognition apparatus and method |
| CN104423992A (en)* | 2013-09-03 | 2015-03-18 | 冠捷投资有限公司 | Starting method for voice recognition of display |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| JP2000347692A (en)* | 1999-06-07 | 2000-12-15 | Sanyo Electric Co Ltd | Person detecting method, person detecting device, and control system using it |
| EP1215658A2 (en)* | 2000-12-05 | 2002-06-19 | Hewlett-Packard Company | Visual activation of voice controlled apparatus |
| US6970824B2 (en)* | 2000-12-05 | 2005-11-29 | Hewlett-Packard Development Company, L.P. | Enabling voice control of voice-controlled apparatus using a head mounted camera system |
| CN102298443A (en)* | 2011-06-24 | 2011-12-28 | 华南理工大学 | Smart home voice control system combined with video channel and control method thereof |
| CN104428832A (en)* | 2012-07-09 | 2015-03-18 | Lg电子株式会社 | Speech recognition apparatus and method |
| CN102945672A (en)* | 2012-09-29 | 2013-02-27 | 深圳市国华识别科技开发有限公司 | Voice control system for multimedia equipment, and voice control method |
| US20140222436A1 (en)* | 2013-02-07 | 2014-08-07 | Apple Inc. | Voice trigger for a digital assistant |
| CN104103274A (en)* | 2013-04-11 | 2014-10-15 | 纬创资通股份有限公司 | Speech processing apparatus and speech processing method |
| CN104423992A (en)* | 2013-09-03 | 2015-03-18 | 冠捷投资有限公司 | Starting method for voice recognition of display |
| CN103472994A (en)* | 2013-09-06 | 2013-12-25 | 乐得科技有限公司 | Operation control achieving method, device and system based on voice |
| CN104078041A (en)* | 2014-06-26 | 2014-10-01 | 美的集团股份有限公司 | Voice recognition method and system |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN105204628A (en)* | 2015-09-01 | 2015-12-30 | 涂悦 | Voice control method based on visual awakening |
| WO2017035768A1 (en)* | 2015-09-01 | 2017-03-09 | 涂悦 | Voice control method based on visual wake-up |
| CN105700363A (en)* | 2016-01-19 | 2016-06-22 | 深圳创维-Rgb电子有限公司 | Method and system for waking up smart home equipment voice control device |
| CN105700363B (en)* | 2016-01-19 | 2018-10-26 | 深圳创维-Rgb电子有限公司 | A kind of awakening method and system of smart home device phonetic controller |
| CN105912092A (en)* | 2016-04-06 | 2016-08-31 | 北京地平线机器人技术研发有限公司 | Voice waking up method and voice recognition device in man-machine interaction |
| CN108098767A (en)* | 2016-11-25 | 2018-06-01 | 北京智能管家科技有限公司 | Method and device for waking up a robot |
| CN106782524A (en)* | 2016-11-30 | 2017-05-31 | 深圳讯飞互动电子有限公司 | One kind mixing awakening method and system |
| CN106847285A (en)* | 2017-03-31 | 2017-06-13 | 上海思依暄机器人科技股份有限公司 | A kind of robot and its audio recognition method |
| CN106847285B (en)* | 2017-03-31 | 2020-05-05 | 上海思依暄机器人科技股份有限公司 | Robot and voice recognition method thereof |
| CN107315561A (en)* | 2017-06-30 | 2017-11-03 | 联想(北京)有限公司 | A kind of data processing method and electronic equipment |
| CN107517313A (en)* | 2017-08-22 | 2017-12-26 | 珠海市魅族科技有限公司 | Awakening method and device, terminal and readable storage medium storing program for executing |
| CN107678793A (en)* | 2017-09-14 | 2018-02-09 | 珠海市魅族科技有限公司 | Voice assistant starts method and device, terminal and computer-readable recording medium |
| CN107679506A (en)* | 2017-10-12 | 2018-02-09 | Tcl通力电子(惠州)有限公司 | Awakening method, intelligent artifact and the computer-readable recording medium of intelligent artifact |
| CN108055617B (en)* | 2017-12-12 | 2020-12-15 | 广东小天才科技有限公司 | A wake-up method, device, terminal device and storage medium for a microphone |
| CN108154878A (en)* | 2017-12-12 | 2018-06-12 | 北京小米移动软件有限公司 | Control the method and device of monitoring device |
| CN108055617A (en)* | 2017-12-12 | 2018-05-18 | 广东小天才科技有限公司 | Microphone awakening method and device, terminal equipment and storage medium |
| CN114860187B (en)* | 2018-01-03 | 2025-03-18 | 腾讯科技(深圳)有限公司 | Intelligent voice device control method, device, computer device and storage medium |
| CN114860187A (en)* | 2018-01-03 | 2022-08-05 | 腾讯科技(深圳)有限公司 | Intelligent voice equipment control method and device, computer equipment and storage medium |
| CN109992237B (en)* | 2018-01-03 | 2022-04-22 | 腾讯科技(深圳)有限公司 | Intelligent voice equipment control method and device, computer equipment and storage medium |
| CN109992237A (en)* | 2018-01-03 | 2019-07-09 | 腾讯科技(深圳)有限公司 | Intelligent voice device control method, device, computer equipment and storage medium |
| CN108154140A (en)* | 2018-01-22 | 2018-06-12 | 北京百度网讯科技有限公司 | Voice awakening method, device, equipment and computer-readable medium based on lip reading |
| US20190228212A1 (en)* | 2018-01-22 | 2019-07-25 | Beijing Baidu Netcom Science And Technology Co., Ltd. | Wakeup method, apparatus and device based on lip reading, and computer readable medium |
| JP2019128938A (en)* | 2018-01-22 | 2019-08-01 | ベイジン バイドゥ ネットコム サイエンス テクノロジー カンパニー リミテッド | Lip reading based voice wakeup method, apparatus, arrangement and computer readable medium |
| US10810413B2 (en) | 2018-01-22 | 2020-10-20 | Beijing Baidu Netcom Science And Technology Co., Ltd. | Wakeup method, apparatus and device based on lip reading, and computer readable medium |
| WO2019149160A1 (en)* | 2018-02-02 | 2019-08-08 | 刘国华 | Human-machine interaction method and device, computer apparatus, and storage medium |
| CN108363557A (en)* | 2018-02-02 | 2018-08-03 | 刘国华 | Man-machine interaction method, device, computer equipment and storage medium |
| US11483657B2 (en) | 2018-02-02 | 2022-10-25 | Guohua Liu | Human-machine interaction method and device, computer apparatus, and storage medium |
| JP7066877B2 (en) | 2018-02-02 | 2022-05-13 | 國華 劉 | Human-machine interaction methods, devices, computer devices and storage media |
| JP2021513123A (en)* | 2018-02-02 | 2021-05-20 | 劉 國華LIU, Guohua | Human-machine interaction methods, devices, computer devices and storage media |
| CN108363557B (en)* | 2018-02-02 | 2020-06-12 | 刘国华 | Human-computer interaction method and device, computer equipment and storage medium |
| CN110164444A (en)* | 2018-02-12 | 2019-08-23 | 优视科技有限公司 | Voice input starting method, apparatus and computer equipment |
| CN110277094A (en)* | 2018-03-14 | 2019-09-24 | 阿里巴巴集团控股有限公司 | Awakening method, device and the electronic equipment of equipment |
| US11158314B2 (en) | 2018-06-04 | 2021-10-26 | Pegatron Corporation | Voice control device and method |
| CN109671426A (en)* | 2018-12-06 | 2019-04-23 | 珠海格力电器股份有限公司 | Voice control method and device, storage medium and air conditioner |
| CN109741738A (en)* | 2018-12-10 | 2019-05-10 | 平安科技(深圳)有限公司 | Voice control method, device, computer equipment and storage medium |
| CN109710131A (en)* | 2018-12-28 | 2019-05-03 | 联想(北京)有限公司 | A kind of information control method and device |
| WO2020187050A1 (en)* | 2019-03-15 | 2020-09-24 | 海信视像科技股份有限公司 | Display device |
| CN110188179A (en)* | 2019-05-30 | 2019-08-30 | 浙江远传信息技术股份有限公司 | Speech-oriented identifies exchange method, device, equipment and medium |
| CN110941455B (en)* | 2019-11-27 | 2024-02-20 | 北京声智科技有限公司 | Active wake-up method and device and electronic equipment |
| CN110941455A (en)* | 2019-11-27 | 2020-03-31 | 北京声智科技有限公司 | Active wake-up method and device and electronic equipment |
| CN111243583B (en)* | 2019-12-31 | 2023-03-10 | 深圳市瑞讯云技术有限公司 | System awakening method and device |
| CN111243583A (en)* | 2019-12-31 | 2020-06-05 | 深圳市瑞讯云技术有限公司 | System awakening method and device |
| CN111651135A (en)* | 2020-04-27 | 2020-09-11 | 珠海格力电器股份有限公司 | Sound awakening method and device, storage medium and electrical equipment |
| CN111651135B (en)* | 2020-04-27 | 2021-05-25 | 珠海格力电器股份有限公司 | Sound awakening method and device, storage medium and electrical equipment |
| CN114187904A (en)* | 2020-08-25 | 2022-03-15 | 广州华凌制冷设备有限公司 | Similarity threshold acquisition method, voice household appliance and computer readable storage medium |
| WO2025091960A1 (en)* | 2023-10-31 | 2025-05-08 | 华为技术有限公司 | Voice assistant interaction method and electronic device |
| Publication | Publication Date | Title |
|---|---|---|
| CN104820556A (en) | Method and device for waking up voice assistant | |
| US10643621B2 (en) | Speech recognition using electronic device and server | |
| CN108509119B (en) | Operation method of electronic device for function execution and electronic device supporting same | |
| KR102293063B1 (en) | Customizable wake-up voice commands | |
| KR102414122B1 (en) | Electronic device for processing user utterance and method for operation thereof | |
| CN110199350B (en) | Method for sensing the end of speech and electronic device implementing the method | |
| KR102405793B1 (en) | Method for recognizing voice signal and electronic device supporting the same | |
| CN108121490B (en) | Electronic device, method and server for processing multi-mode input | |
| CN108023934B (en) | Electronic device and control method thereof | |
| KR102398649B1 (en) | Electronic device for processing user utterance and method for operation thereof | |
| KR102815504B1 (en) | Method and electronic device for providing contents | |
| US8452597B2 (en) | Systems and methods for continual speech recognition and detection in mobile computing devices | |
| US10217477B2 (en) | Electronic device and speech recognition method thereof | |
| EP3567584A1 (en) | Electronic apparatus and method for operating same | |
| CN104580699B (en) | Acoustic control intelligent terminal method and device when a kind of standby | |
| KR20180083587A (en) | Electronic device and operating method thereof | |
| CN108829235A (en) | Voice data processing method and the electronic equipment for supporting this method | |
| KR102414173B1 (en) | Speech recognition using Electronic Device and Server | |
| KR20180109624A (en) | Method for operating speech recognition service and electronic device supporting the same | |
| KR20180109625A (en) | Method for operating speech recognition service and electronic device supporting the same | |
| US11620995B2 (en) | Voice interaction processing method and apparatus | |
| CN106412312A (en) | Method and system for automatically awakening camera shooting function of intelligent terminal, and intelligent terminal | |
| CN107526522A (en) | Blank screen gesture identification method and device, and mobile terminal, storage medium | |
| CN105718019A (en) | Information processing method and electronic device | |
| CN109949815B (en) | Electronic Devices |
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| EXSB | Decision made by sipo to initiate substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | Application publication date:20150805 | |
| RJ01 | Rejection of invention patent application after publication |