Movatterモバイル変換


[0]ホーム

URL:


CN105045122A - Intelligent household natural interaction system based on audios and videos - Google Patents

Intelligent household natural interaction system based on audios and videos
Download PDF

Info

Publication number
CN105045122A
CN105045122ACN201510355845.7ACN201510355845ACN105045122ACN 105045122 ACN105045122 ACN 105045122ACN 201510355845 ACN201510355845 ACN 201510355845ACN 105045122 ACN105045122 ACN 105045122A
Authority
CN
China
Prior art keywords
module
information
cloud server
signal processing
user
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201510355845.7A
Other languages
Chinese (zh)
Inventor
张子兴
陈宇翔
黄力
林子楠
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by IndividualfiledCriticalIndividual
Priority to CN201510355845.7ApriorityCriticalpatent/CN105045122A/en
Publication of CN105045122ApublicationCriticalpatent/CN105045122A/en
Pendinglegal-statusCriticalCurrent

Links

Classifications

Landscapes

Abstract

The invention discloses an intelligent household natural interaction system based on audios and videos. The system mainly comprises four parts, i.e.,a front end, a central processing unit, a back end and a cloud end. The front end comprises a microphone system, a camera system, a third party sensor interface and a feedback module. The front end is used to collect sound and picture associated information and display system feedback. The central processing unit comprises an audio signal processing and information retrieving module, a video signal processing and information retrieving module, a third party signal processing and information retrieving module and an information integrating module. The central processing unit processes the acquired sounds and visual signals and utilizes a machine learning method to get useful commanding order. The back end comprises an indoor signal controlling and emitting module and a cloud server communication module. The back end is used to convert obtained commanding order into emitting signals. At the same time, the back end provides a communication channel for the system. The cloud end comprises the cloud server which provides computing resources, storing resources and communicating resources. The system is highly human-machine interactive; and the intelligent household natural interaction system greatly improves convenience for controlling household electric appliances and acquiring information.

Description

Intelligent home natural interaction system based on audio and video
Technical Field
The invention relates to the technical field of information, in particular to an intelligent home natural interaction system based on audio and video technologies.
Background
Under the technical wave of the internet of things and artificial intelligence, the intelligent home technology is developed rapidly, and many hardware products related to intelligent families appear, such as an intelligent thermostat and a smoke alarm of Nest, a Hue intelligent bulb of philips, an intelligent refrigerator of hail, an intelligent lock of August and the like. The intelligent devices greatly meet the control requirements of people on the household devices. However, these devices lack a uniform control standard and interface. Generally, each of them has a separate system and a matching control method, such as a mobile phone App. This incompatibility can lead to control complexity for the user, such as multiple iterations. In view of this, Apple publishes its own control platform Homekit, samsung developed SmartHome platform, Quicky has Wink and Relay platforms, etc., and these platforms or devices improve the convenience of operating intelligent devices to a certain extent. However, these existing platforms or devices all use a relatively single voice control, or a smartphone control, etc. In many cases, none of these single interaction means enables natural interaction with home devices.
It is queried that the system and control method of patent publication No. CN102298443 adopts a method of reading lip language to assist a voice recognition system in a home environment. However, lip language recognition is greatly limited by the angle, position, illumination and the like of a user, and it is difficult to achieve a high recognition rate in practical application, thereby affecting user experience. Meanwhile, the system has no interface and cloud service platform which are open to the outside, so that the expansibility and the application range of the system are greatly limited.
Disclosure of Invention
In order to overcome the defects of the existing intelligent household equipment control, the invention provides an intelligent household interaction system based on audio and video. Compared with the existing household equipment control and interaction system, the invention adopts a means of combining voice and images to achieve more natural and robust human-computer interaction experience; the unified information analysis and fusion platform is provided, products of other intelligent home manufacturers can be well expanded and compatible, and the user operation is more natural and convenient.
The specific technical scheme adopted by the invention to solve the problems is as follows:
an intelligent home interaction control system based on audio and video mainly comprises a front end, a central processing unit, a rear end and a cloud end. The front end comprises an audio and video information collection module, such as a microphone system, a camera system, a third-party sensor interface and a feedback display module. The central processing unit comprises an audio signal processing and information extracting module, a video signal processing and information extracting module, a third-party signal processing and information extracting interface module and an information fusion module. The rear end comprises a control signal transmitting module and a cloud server communication module. The cloud is a cloud server.
The microphone system is a microphone array. The system collects sound information under the home environment in real time through a specific sampling frequency and a coding mode, and transmits an original audio signal to an audio signal analysis and information extraction module.
The audio signal analysis and information extraction module is used for carrying out preprocessing such as noise reduction, echo reduction and sound source separation on the collected sound signals, and carrying out processing such as sound source positioning, speaker recognition, voice awakening, voice recognition and instruction detection.
Firstly, a Kalman filter carries out preliminary denoising on a signal of each sound channel, and carries out endpoint detection and signal cutting; the condition that multiple sound sources are mixed possibly exists in the divided signals, different sound sources are separated through a nonnegative matrix algorithm by the module, and a target sound source is extracted; then, the signal is subjected to multi-channel noise reduction and echo reduction technology through a GCCdelay-and-sum-covering algorithm to suppress noise and echo.
The sound source localization system utilizes time difference of received signals (TDOA) of different channels to determine the location of a sound source while applying multi-channel noise and echo suppression techniques. When the sound source is determined, the system can automatically adjust the direction according to the position of the speaker, so that the system and the user are in a relatively proper angle.
Then, the signal processed by noise reduction and echo reduction is input into the speaker confirmation module. The module is used for judging whether the user has the use right of the system. The module uses an i-vector algorithm to identify the speaker. An unauthorized user will not have control rights to the system.
If the user has the use authority, the voice awakening module judges whether the detected sound contains the awakening keyword. If so, the system of the invention enters the active interactive mode from the sleep mode. The subsequently detected sound signal is directly sent to the speech recognition and natural semantic understanding module.
The voice recognition module converts the voice signal into character information, and analyzes and detects a control or interactive instruction through a natural language understanding technology.
The camera system comprises a common camera and a depth camera. It is responsible for collecting the user's action and activity information. In particular, it is used to detect face, gesture, and motion information of a user.
Firstly, face detection is carried out on RGB images obtained by a common camera. Once the human face is detected, the related image is subjected to face recognition and identity verification. Here, the system compares the detected face with a pre-stored face of an authorized user (based on facial features and machine learning), and if the verification is successful, the motion recognition module is activated. The input of the module is a depth image acquired by a depth camera, and the image is firstly used for real-time skeleton tracking and acquiring information such as human joint positions. The information of the skeleton tracking can also be used for positioning the user, and the system can automatically adjust the direction according to the position of the user, so that the system and the user are in a relatively proper angle.
The body joint information is then compared to the actions in the action library in the system. Once a corresponding matching action is found, instructional information associated with the action is generated.
The third-party sensor interface and the third-party signal processing and information extraction interface module are used for function expansion and providing corresponding interfaces for other developers in the future so as to realize the customized function.
And the feedback display module is used for communication and interaction between the system and the user. When the command identifies a ambiguity or error, the user can confirm or correct it through the feedback display module.
The information fusion module is used for fusing the detected voice instruction, gesture instruction and other instruction information and judging the instruction of the user by using probability, and the mathematical description is as follows:wherein. Wherein,as instructionsA predicted probability value of (a);andrespectively voice, video and other sensor pair instructionsA predicted probability of (d);andspeech, video and other sensor signal weights, respectively.
The control signal transmitting module is used for converting a control command into a signal which can actually control the household appliance, and achieves the purpose of controlling the household appliance by utilizing wireless communication modes such as infrared, RF (radio frequency), Bluetooth, wifi (wireless fidelity), Zigbee, Z-Wave and the like.
And the communication module with the cloud server is used for communication between the information fusion module and the cloud server. The local end can send a resource acquisition instruction to the cloud end, and corresponding resources are returned to the local end through the module. The cloud end can also send instructions to the local end through the module so as to realize remote control of household appliances or transmit home information to the cloud end.
The cloud server is used for a) providing additional computing resources for a local end; b) providing additional storage space or data backup locally; c) providing an information exchange platform for a user terminal such as a mobile phone; d) other information is provided to the user, such as query searches or music.
The invention has the beneficial effects that: 1) the front end adopts a voice and gesture recognition interaction mode, so that the interaction naturalness is improved; 2) the voice interaction mode and the visual interaction mode are independent and complementary, and can work independently or cooperatively, so that the application limitation of a single interaction mode in a household is broken through, and the robustness of man-machine interaction is improved; 3) an interface of a third party is provided, and a third party developer can add signal processing and information extraction functions of other sensors as required, so that the system is well expanded; 4) the back end provides a plurality of wireless communication modes, and good compatibility is provided; 5) both local and remote modes of operation are provided. The local mode physically ensures the security and privacy of the user's system, while the remote mode may provide the user with additional information and more advanced services.
Drawings
Fig. 1 is a frame diagram of an audio and video based smart home natural interaction control system according to the present invention.
FIG. 2 is a flow chart of audio signal processing and information extraction according to the present invention.
FIG. 3 is a flow chart of video signal processing and information extraction according to the present invention.
FIG. 4 is a flow chart of an information fusion module of the present invention.
Detailed Description
Aiming at the problems in the prior art, the invention provides an intelligent home interaction system which is based on an intelligent audio and video analysis processing technology, can improve the convenience, comfort level and control accuracy of human-computer interaction, and has high compatibility and expandability.
In order to make the technical solution of the present invention clearer, the following detailed description of the present invention is made with reference to the accompanying drawings and examples, and the description is to be considered as exemplary.
As shown in fig. 1, the system includes: the system comprises a front end, a central processing unit, a rear end and a cloud end. The front end is mainly responsible for collecting sound and image signals and other information and displaying feedback of the system; the central processing unit is mainly responsible for processing the collected sound and visual signals and acquiring useful instruction information by using a machine learning and pattern recognition method; the back end is mainly responsible for converting the acquired instruction into a signal capable of being transmitted so as to control home appliances and the like; meanwhile, information can be obtained and exchanged from a cloud server at the cloud end.
The invention can detect the sound signal and the image signal in the home in real time when the device is in the open state.
A detailed flow chart of the audio signal processing and information extraction of the present invention is shown in fig. 2. When a user speaks, e.g., "lights," at home. The sound is detected by the microphone system (step 202), and after a preliminary de-noising process of the multi-channel audio signal (step 202), an endpoint detection and segmentation are performed (step 203), and an audio signal containing "on lights" is extracted. When multiple sound sources are speaking simultaneously (e.g., multiple users are speaking simultaneously or when users are speaking, music is played simultaneously), the system separates the sound sources (step 204) and strips off the background sound. Meanwhile, the present invention analyzes the source of the sound (step 205) to adjust the direction of the system in time (step 206). For example, when the user is on the back of the system, the system may be rotated 180 degrees to face the user on the front. After further noise reduction and echo reduction (step 207), the system will confirm the user and ignore if not a member with authority; if so, the user's input sound is further processed (step 208) and a system wake-up detection is performed (step 209). If the user's voice can match a wake-up keyword such as "light on," the system will switch from the sleep state to the wake-up state; otherwise, the wake-up instruction is continuously detected. After the system wakes up, speech recognition is performed on the voice of the subsequent user (step 210). For example, when the recognition result is "please turn on the lamp", "turn up the air conditioning temperature", "play the blue and white porcelain of zhou jilun", "view my unread mail", etc., the system extracts the keywords therein through natural semantic understanding (step 211), such as "turn on", "the lamp", "turn up", "air conditioning", "temperature", "play", "zhou jilun", "blue and white porcelain", "view", "my", "unread mail", etc. These keywords are sent to the information fusion module (block 15) for further processing.
The invention detects the video signal in real time while detecting the audio signal. The detailed flow of video signal processing and information extraction is shown in fig. 3. The input of the module is a video signal, which comprises two types: a normal RGB image signal (301) and a depth image signal (302). Firstly, the module carries out face detection (303) in an RGB image in real time, and carries out face recognition and identity confirmation (304) on the image when a face is detected. Once the identity is confirmed and the identity has the corresponding usage rights, further operations are permitted, otherwise the face detection step is returned to. At the same time, the module will also perform real-time skeleton tracking (305) using the depth image, the tracking information can be used to locate the user (306), and adjust the orientation of the system in real-time to achieve the best detection effect (307). Once the user's identity is confirmed, the user's skeletal information is used for action recognition (309), and the recognizable actions are stored in an action library (308). Finally, the recognized action is translated into an instruction (311) in an instruction library (310). The instruction is sent to the information fusion module for further processing.
When the system detects a voice or gesture command signal, the information fusion module of the present invention (as shown in fig. 4) will decide the final command by the maximum probability. Some typical application scenarios are exemplified below.
1) Only the audio system is active. For example, when a user is cooking, both hands are busy. At this time, if the user wants to listen to the song, the system can be awakened through voice, and the song that the user wants to play is selected.
2) Only the video system is active. For example, when a family party is in a high noisy environment, the owner can realize the control of the family device through gesture instructions.
3) The audio and video are simultaneously activated. At the moment, the audio and video information are mutually supplemented, and the identification accuracy of the instruction is improved. For example, when a user speaks "turn off the light" while pointing at a particular light with a hand, the present invention combines voice and gesture commands to turn off the particular light.
As mentioned above, the audio system and the video system of the present invention can work independently or jointly. The high integration of human-computer interaction is achieved, and meanwhile, the robustness of instruction identification is improved. If the maximum probability of the command obtained by the information fusion module is lower than a specified threshold value or the audio-video command is in conflict, that is, the command identification is uncertain, the system can obtain the confirmation of the user through the feedback display module (module 14). The feedback of the invention has three modes: speech, images and text. The text feedback can be directly displayed on the feedback display module, and the voice needs to be played through the user feedback module after being synthesized by the voice. For example, the present invention can feed back "do you determine to turn off the lamp? Similarly, the image can be output in the feedback display module, so as to improve the interactivity of the system. The user can confirm the system by voice or gestures to avoid misoperation.
Then, the information fusion module will deliver the command to the control signal transmitting module (module 16) or to the cloud server communication module (module 17) for processing according to the command type.
The command related to the household appliance, such as turning on the lamp, is sent to the control signal transmitting module. The module converts "turn on lamp" into a specific signal that the lamp controller can receive and transmit. The signal may be infrared, RF, Bluetooth, Wifi, Zigbee, Z-Wave, etc. Similarly, the user may also use motion commands, such as hand gestures to move left and right to switch music being played, and up and down to adjust volume.
The instructions related to the internet, such as query information and the like, are sent to the cloud end through the communication module of the cloud end server. For example, "view my unread mail", the instruction is sent to the cloud server to obtain the unread mail and return to the local end; for another example, "download Zhou Jilun blue and white porcelain", the module also downloads songs via a music library of a network connection server.
The cloud server mentioned above is connected to the local end. Its function is, but not limited to, the following example.
1) Providing additional computing resources for the local end. The voice recognition, the face recognition and the like related by the invention can save local computing resources and improve the recognition accuracy by transferring part or all of the computing requirements to the cloud server.
2) And providing a space for information backup and storage for the local end. The user can save data such as documents, pictures, videos and the like to the cloud according to the needs of the user. This example has the advantage of enabling the user to obtain the data anywhere and at any time via the internet.
3) Providing a resource portal for a third party. For example, songs can be played by connecting a cloud server of the system with a third-party music library to obtain and return the songs, so that the entertainment requirements of users are met. For another example, through the cloud server, the user can query the online goods to provide an entrance for electronic commerce.
4) And an entrance for information exchange is provided for a mobile terminal (such as a mobile phone, a tablet and the like). The user can be connected with the cloud server through the mobile phone APP, and the cloud server is utilized to forward the control signal to the local end, so that the purpose of controlling the household appliances is achieved. The embodiment can meet the requirement of a user for remotely controlling the household appliance. For another example, the mobile terminal may query the situation of the home through the cloud server, and the present invention may send a request for obtaining an image or a video to the local terminal through the cloud server.
The cloud server is in two-way communication with the local end, so that an internet entrance is provided for a user at home, and external information is acquired; and can provide local end entrance for outside users to know and monitor the conditions in the house.
In addition, the cloud server is a user selectable module. Namely, under the condition of closing the cloud server module, the invention is in a local working mode, and the communication channel with the external information is cut off. By doing so, the information security of the user can be ensured, but the function provided by the cloud server can be lost.
It will be evident to those skilled in the art that the invention is not limited to the details of the foregoing illustrative embodiments, and that the present invention may be embodied in other specific forms without departing from the spirit or essential attributes thereof. The present embodiments are therefore to be considered in all respects as illustrative and not restrictive, the scope of the invention being indicated by the appended claims rather than by the foregoing description, and all changes which come within the meaning and range of equivalency of the claims are therefore intended to be embraced therein. Any reference sign in a claim should not be construed as limiting the claim concerned.

Claims (10)

CN201510355845.7A2015-06-242015-06-24Intelligent household natural interaction system based on audios and videosPendingCN105045122A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201510355845.7ACN105045122A (en)2015-06-242015-06-24Intelligent household natural interaction system based on audios and videos

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201510355845.7ACN105045122A (en)2015-06-242015-06-24Intelligent household natural interaction system based on audios and videos

Publications (1)

Publication NumberPublication Date
CN105045122Atrue CN105045122A (en)2015-11-11

Family

ID=54451742

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201510355845.7APendingCN105045122A (en)2015-06-242015-06-24Intelligent household natural interaction system based on audios and videos

Country Status (1)

CountryLink
CN (1)CN105045122A (en)

Cited By (71)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN105653709A (en)*2015-12-302016-06-08广东顺德中山大学卡内基梅隆大学国际联合研究院Intelligent home voice text control method
CN105957535A (en)*2016-04-152016-09-21青岛克路德机器人有限公司Robot voice signal detecting and identifying system
CN105955040A (en)*2016-05-202016-09-21深圳市大拿科技有限公司Intelligent household system according to real-time video picture visual control and control method thereof
CN106019973A (en)*2016-07-302016-10-12杨超坤Smart home with emotion recognition function
CN106200396A (en)*2016-08-052016-12-07易晓阳A kind of appliance control method based on Motion Recognition
CN106200395A (en)*2016-08-052016-12-07易晓阳A kind of multidimensional identification appliance control method
CN106254186A (en)*2016-08-052016-12-21易晓阳A kind of interactive voice control system for identifying
CN106406119A (en)*2016-11-152017-02-15福州大学Service robot based on voice interaction, cloud technology and integrated intelligent home monitoring
CN106444415A (en)*2016-12-082017-02-22湖北大学Smart home control method and system
CN106445455A (en)*2016-09-292017-02-22深圳前海弘稼科技有限公司Planting device and method for controlling planting device
CN106507047A (en)*2016-11-152017-03-15浙江工业大学 An audio and video terminal system for smart home
CN106531165A (en)*2016-12-152017-03-22北京塞宾科技有限公司Portable smart home voice control system and control method adopting same
CN106604181A (en)*2016-12-152017-04-26北京塞宾科技有限公司Distributed microphone smart home system
CN106653020A (en)*2016-12-132017-05-10中山大学Multi-business control method and system for smart sound and video equipment based on deep learning
CN106647305A (en)*2016-12-282017-05-10重庆金鑫科技产业发展有限公司Control method and terminal
CN106710594A (en)*2016-11-172017-05-24北京中科汇联科技股份有限公司Intelligent speech interaction system based on cloud end
CN106782540A (en)*2017-01-172017-05-31联想(北京)有限公司Speech ciphering equipment and the voice interactive system including the speech ciphering equipment
CN106896743A (en)*2015-12-182017-06-27北京奇虎科技有限公司A kind of instruction responding device, the method for control terminal equipment, server and device
CN106899460A (en)*2015-12-182017-06-27北京奇虎科技有限公司A kind of instruction responding device, the method for control terminal equipment, server and system
CN106910500A (en)*2016-12-232017-06-30北京第九实验室科技有限公司The method and apparatus of Voice command is carried out to the equipment with microphone array
CN107065586A (en)*2017-05-232017-08-18中国科学院自动化研究所Interactive intelligent home services system and method
CN107230476A (en)*2017-05-052017-10-03众安信息技术服务有限公司A kind of natural man machine language's exchange method and system
CN107371060A (en)*2017-08-092017-11-21北京智网时代科技有限公司Video image synthesis system and methods for using them based on TV output
CN107395746A (en)*2017-08-212017-11-24时瑞科技(深圳)有限公司A kind of Internet of things system
CN107682240A (en)*2017-09-272018-02-09四川长虹电器股份有限公司A kind of distributed sound interactive system for intelligent domestic
WO2018027505A1 (en)*2016-08-092018-02-15曹鸿鹏Lighting control system
WO2018027507A1 (en)*2016-08-092018-02-15曹鸿鹏Emotion recognition-based lighting control system
WO2018027504A1 (en)*2016-08-092018-02-15曹鸿鹏Lighting control method
CN107734213A (en)*2016-08-112018-02-23漳州立达信光电子科技有限公司 Smart Home Electronic Devices and Systems
CN107993660A (en)*2017-12-262018-05-04江苏可美智能科技股份有限公司Speech control system for Internet of Things intelligence control system
CN108154140A (en)*2018-01-222018-06-12北京百度网讯科技有限公司Voice awakening method, device, equipment and computer-readable medium based on lip reading
CN108229391A (en)*2018-01-022018-06-29京东方科技集团股份有限公司Gesture identifying device and its server, gesture recognition system, gesture identification method
CN108364648A (en)*2018-02-112018-08-03北京百度网讯科技有限公司Method and device for obtaining audio-frequency information
CN108388138A (en)*2018-02-022018-08-10宁夏玲杰科技有限公司Apparatus control method, apparatus and system
CN108460329A (en)*2018-01-152018-08-28任俊芬A kind of face gesture cooperation verification method based on deep learning detection
CN108563208A (en)*2018-06-282018-09-21马雷明Intelligent domestic system and its control method
CN108828501A (en)*2018-04-292018-11-16桂林电子科技大学The method that real-time tracking positioning is carried out to moving sound in sound field environment indoors
CN108965459A (en)*2018-08-022018-12-07上海伟赛智能科技有限公司A kind of personnel activity's behavior detecting system based on radio-frequency technique
CN109036430A (en)*2018-09-292018-12-18芜湖星途机器人科技有限公司Voice control terminal
CN109085761A (en)*2018-08-162018-12-25夏琦A kind of detection device and the smart home system using the device
CN109151393A (en)*2018-10-092019-01-04深圳市亿联智能有限公司A kind of sound fixation and recognition method for detecting
CN109168110A (en)*2018-09-292019-01-08芜湖星途机器人科技有限公司External hanging type speech packet
CN109326288A (en)*2018-10-312019-02-12四川长虹电器股份有限公司A kind of AI speech dialogue system
CN109473095A (en)*2017-09-082019-03-15北京君林科技股份有限公司A kind of intelligent home control system and control method
CN109547771A (en)*2019-01-072019-03-29中国人民大学A kind of household intelligent robot having bore hole 3D display device
CN109545240A (en)*2018-11-192019-03-29清华大学A kind of method of the sound separation of human-computer interaction
WO2019071989A1 (en)*2017-10-132019-04-18歌尔股份有限公司Smart device speech enhancement method and device and smart device
CN109784867A (en)*2019-01-182019-05-21创新奇智(北京)科技有限公司A kind of self feed back artificial intelligence model management system
CN109803013A (en)*2019-01-212019-05-24浙江大学A kind of weak interactive system and its control method based on artificial intelligence
CN109884908A (en)*2019-03-142019-06-14苏州宏裕千智能设备科技有限公司Cloud platform, apparatus control method and system, readable storage medium storing program for executing
CN109991864A (en)*2019-03-132019-07-09佛山市云米电器科技有限公司Home automation scenery control system and its control method based on image recognition
CN110020629A (en)*2019-04-102019-07-16杨文广A kind of fusion intelligent video service system and method based on Internet of Things
CN110147046A (en)*2019-06-172019-08-20东莞理工学院城市学院Intelligent household mirror based on Internet of Things
CN110213138A (en)*2019-04-232019-09-06深圳康佳电子科技有限公司Intelligent terminal user authentication method, intelligent terminal and storage medium
CN110392021A (en)*2018-04-182019-10-29北京视联动力国际信息技术有限公司Method, view networked server, view networked terminals and the device of a kind of equipment control
CN110493092A (en)*2019-08-282019-11-22深圳市云之尚网络科技有限公司Universal remote control and household appliance remote control method based on far field voice and IOT
CN110808050A (en)*2018-08-032020-02-18蔚来汽车有限公司 Speech recognition method and smart device
CN110874061A (en)*2018-08-312020-03-10格力电器(武汉)有限公司Intelligent household working method and device
CN111007806A (en)*2018-10-082020-04-14珠海格力电器股份有限公司Smart home control method and device
CN111107407A (en)*2019-01-082020-05-05姜鹏飞Audio and video playing control method, device and equipment and computer readable storage medium
CN111724786A (en)*2019-03-222020-09-29上海博泰悦臻网络技术服务有限公司Lip language identification system and method
WO2020215966A1 (en)*2019-04-262020-10-29北京大米科技有限公司Remote teaching interaction method, server, terminal and system
CN111973222A (en)*2020-08-232020-11-24云知声智能科技股份有限公司Ultrasonic detection system and ultrasonic detection method
WO2020244573A1 (en)*2019-06-062020-12-10阿里巴巴集团控股有限公司Voice instruction processing method and device, and control system
CN112201252A (en)*2020-10-102021-01-08南京机电职业技术学院Voice interaction learning and application system of express robot
CN113872729A (en)*2021-09-242021-12-31上海物骐微电子有限公司Audio data communication method and wireless audio system
CN114253386A (en)*2020-09-112022-03-29成都木帆科技有限公司Communication system based on perception
CN114530151A (en)*2022-02-102022-05-24山东企联信息技术股份有限公司Artificial intelligence AI voice control system and experience device thereof
CN114578705A (en)*2022-04-012022-06-03深圳冠特家居健康系统有限公司Intelligent home control system based on 5G Internet of things
TWI783344B (en)*2021-01-112022-11-11圓展科技股份有限公司Sound source tracking system and method
CN116071863A (en)*2023-03-152023-05-05潍坊职业学院Instruction recognition and transmission system

Cited By (92)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN106896743A (en)*2015-12-182017-06-27北京奇虎科技有限公司A kind of instruction responding device, the method for control terminal equipment, server and device
CN106896743B (en)*2015-12-182020-12-04北京奇虎科技有限公司 An instruction response device, method, server and device for controlling terminal equipment
CN106899460A (en)*2015-12-182017-06-27北京奇虎科技有限公司A kind of instruction responding device, the method for control terminal equipment, server and system
CN105653709A (en)*2015-12-302016-06-08广东顺德中山大学卡内基梅隆大学国际联合研究院Intelligent home voice text control method
CN105957535A (en)*2016-04-152016-09-21青岛克路德机器人有限公司Robot voice signal detecting and identifying system
CN105955040A (en)*2016-05-202016-09-21深圳市大拿科技有限公司Intelligent household system according to real-time video picture visual control and control method thereof
CN106019973A (en)*2016-07-302016-10-12杨超坤Smart home with emotion recognition function
CN106200396A (en)*2016-08-052016-12-07易晓阳A kind of appliance control method based on Motion Recognition
CN106200395A (en)*2016-08-052016-12-07易晓阳A kind of multidimensional identification appliance control method
CN106254186A (en)*2016-08-052016-12-21易晓阳A kind of interactive voice control system for identifying
WO2018027504A1 (en)*2016-08-092018-02-15曹鸿鹏Lighting control method
WO2018027505A1 (en)*2016-08-092018-02-15曹鸿鹏Lighting control system
WO2018027507A1 (en)*2016-08-092018-02-15曹鸿鹏Emotion recognition-based lighting control system
CN107734213A (en)*2016-08-112018-02-23漳州立达信光电子科技有限公司 Smart Home Electronic Devices and Systems
CN106445455A (en)*2016-09-292017-02-22深圳前海弘稼科技有限公司Planting device and method for controlling planting device
CN106406119A (en)*2016-11-152017-02-15福州大学Service robot based on voice interaction, cloud technology and integrated intelligent home monitoring
CN106507047B (en)*2016-11-152019-05-31浙江工业大学A kind of audio-video terminal system towards smart home
CN106507047A (en)*2016-11-152017-03-15浙江工业大学 An audio and video terminal system for smart home
CN106406119B (en)*2016-11-152019-05-10福州大学 Service robot based on voice interaction, cloud technology and integrated smart home monitoring
CN106710594A (en)*2016-11-172017-05-24北京中科汇联科技股份有限公司Intelligent speech interaction system based on cloud end
CN106444415A (en)*2016-12-082017-02-22湖北大学Smart home control method and system
CN106653020A (en)*2016-12-132017-05-10中山大学Multi-business control method and system for smart sound and video equipment based on deep learning
CN106531165A (en)*2016-12-152017-03-22北京塞宾科技有限公司Portable smart home voice control system and control method adopting same
CN106604181A (en)*2016-12-152017-04-26北京塞宾科技有限公司Distributed microphone smart home system
CN106910500A (en)*2016-12-232017-06-30北京第九实验室科技有限公司The method and apparatus of Voice command is carried out to the equipment with microphone array
US10453457B2 (en)2016-12-232019-10-22Beijing Xiaoniao Tingting Technology, Co., Ltd.Method for performing voice control on device with microphone array, and device thereof
CN106910500B (en)*2016-12-232020-04-17北京小鸟听听科技有限公司Method and device for voice control of device with microphone array
CN106647305A (en)*2016-12-282017-05-10重庆金鑫科技产业发展有限公司Control method and terminal
CN106782540A (en)*2017-01-172017-05-31联想(北京)有限公司Speech ciphering equipment and the voice interactive system including the speech ciphering equipment
CN106782540B (en)*2017-01-172021-04-13联想(北京)有限公司Voice equipment and voice interaction system comprising same
CN107230476A (en)*2017-05-052017-10-03众安信息技术服务有限公司A kind of natural man machine language's exchange method and system
CN107065586B (en)*2017-05-232020-02-07中国科学院自动化研究所Interactive intelligent home service system and method
CN107065586A (en)*2017-05-232017-08-18中国科学院自动化研究所Interactive intelligent home services system and method
CN107371060B (en)*2017-08-092023-08-08北京智网时代科技有限公司Video image synthesis system based on television output and application method
CN107371060A (en)*2017-08-092017-11-21北京智网时代科技有限公司Video image synthesis system and methods for using them based on TV output
CN107395746A (en)*2017-08-212017-11-24时瑞科技(深圳)有限公司A kind of Internet of things system
CN109473095A (en)*2017-09-082019-03-15北京君林科技股份有限公司A kind of intelligent home control system and control method
CN109473095B (en)*2017-09-082020-01-10北京君林科技股份有限公司Intelligent household control system and control method
CN107682240A (en)*2017-09-272018-02-09四川长虹电器股份有限公司A kind of distributed sound interactive system for intelligent domestic
WO2019071989A1 (en)*2017-10-132019-04-18歌尔股份有限公司Smart device speech enhancement method and device and smart device
US10984816B2 (en)2017-10-132021-04-20Goertek Inc.Voice enhancement using depth image and beamforming
CN107993660A (en)*2017-12-262018-05-04江苏可美智能科技股份有限公司Speech control system for Internet of Things intelligence control system
CN108229391A (en)*2018-01-022018-06-29京东方科技集团股份有限公司Gesture identifying device and its server, gesture recognition system, gesture identification method
US10725553B2 (en)2018-01-022020-07-28Boe Technology Group Co., Ltd.Gesture recognition device, gesture recognition method, and gesture recognition system
CN108460329B (en)*2018-01-152022-02-11任俊芬Face gesture cooperation verification method based on deep learning detection
CN108460329A (en)*2018-01-152018-08-28任俊芬A kind of face gesture cooperation verification method based on deep learning detection
US10810413B2 (en)2018-01-222020-10-20Beijing Baidu Netcom Science And Technology Co., Ltd.Wakeup method, apparatus and device based on lip reading, and computer readable medium
CN108154140A (en)*2018-01-222018-06-12北京百度网讯科技有限公司Voice awakening method, device, equipment and computer-readable medium based on lip reading
CN108388138A (en)*2018-02-022018-08-10宁夏玲杰科技有限公司Apparatus control method, apparatus and system
CN108364648A (en)*2018-02-112018-08-03北京百度网讯科技有限公司Method and device for obtaining audio-frequency information
CN110392021B (en)*2018-04-182023-05-09视联动力信息技术股份有限公司Equipment control method, video networking server, video networking terminal and device
CN110392021A (en)*2018-04-182019-10-29北京视联动力国际信息技术有限公司Method, view networked server, view networked terminals and the device of a kind of equipment control
CN108828501A (en)*2018-04-292018-11-16桂林电子科技大学The method that real-time tracking positioning is carried out to moving sound in sound field environment indoors
CN108828501B (en)*2018-04-292020-07-28桂林电子科技大学Method for real-time tracking and positioning of mobile sound source in indoor sound field environment
CN108563208A (en)*2018-06-282018-09-21马雷明Intelligent domestic system and its control method
CN108965459A (en)*2018-08-022018-12-07上海伟赛智能科技有限公司A kind of personnel activity's behavior detecting system based on radio-frequency technique
CN110808050B (en)*2018-08-032024-04-30蔚来(安徽)控股有限公司 Voice recognition method and intelligent device
CN110808050A (en)*2018-08-032020-02-18蔚来汽车有限公司 Speech recognition method and smart device
CN109085761A (en)*2018-08-162018-12-25夏琦A kind of detection device and the smart home system using the device
CN110874061A (en)*2018-08-312020-03-10格力电器(武汉)有限公司Intelligent household working method and device
CN109168110A (en)*2018-09-292019-01-08芜湖星途机器人科技有限公司External hanging type speech packet
CN109036430A (en)*2018-09-292018-12-18芜湖星途机器人科技有限公司Voice control terminal
CN111007806A (en)*2018-10-082020-04-14珠海格力电器股份有限公司Smart home control method and device
CN111007806B (en)*2018-10-082022-04-08珠海格力电器股份有限公司Smart home control method and device
CN109151393A (en)*2018-10-092019-01-04深圳市亿联智能有限公司A kind of sound fixation and recognition method for detecting
CN109326288A (en)*2018-10-312019-02-12四川长虹电器股份有限公司A kind of AI speech dialogue system
CN109545240B (en)*2018-11-192022-12-09清华大学 A sound separation method for human-computer interaction
CN109545240A (en)*2018-11-192019-03-29清华大学A kind of method of the sound separation of human-computer interaction
CN109547771A (en)*2019-01-072019-03-29中国人民大学A kind of household intelligent robot having bore hole 3D display device
CN111107407A (en)*2019-01-082020-05-05姜鹏飞Audio and video playing control method, device and equipment and computer readable storage medium
CN109784867A (en)*2019-01-182019-05-21创新奇智(北京)科技有限公司A kind of self feed back artificial intelligence model management system
CN109803013B (en)*2019-01-212020-10-23浙江大学Weak interaction system based on artificial intelligence and control method thereof
CN109803013A (en)*2019-01-212019-05-24浙江大学A kind of weak interactive system and its control method based on artificial intelligence
CN109991864A (en)*2019-03-132019-07-09佛山市云米电器科技有限公司Home automation scenery control system and its control method based on image recognition
CN109884908A (en)*2019-03-142019-06-14苏州宏裕千智能设备科技有限公司Cloud platform, apparatus control method and system, readable storage medium storing program for executing
CN111724786A (en)*2019-03-222020-09-29上海博泰悦臻网络技术服务有限公司Lip language identification system and method
CN110020629A (en)*2019-04-102019-07-16杨文广A kind of fusion intelligent video service system and method based on Internet of Things
CN110213138A (en)*2019-04-232019-09-06深圳康佳电子科技有限公司Intelligent terminal user authentication method, intelligent terminal and storage medium
WO2020215966A1 (en)*2019-04-262020-10-29北京大米科技有限公司Remote teaching interaction method, server, terminal and system
WO2020244573A1 (en)*2019-06-062020-12-10阿里巴巴集团控股有限公司Voice instruction processing method and device, and control system
CN110147046A (en)*2019-06-172019-08-20东莞理工学院城市学院Intelligent household mirror based on Internet of Things
CN110493092A (en)*2019-08-282019-11-22深圳市云之尚网络科技有限公司Universal remote control and household appliance remote control method based on far field voice and IOT
CN111973222A (en)*2020-08-232020-11-24云知声智能科技股份有限公司Ultrasonic detection system and ultrasonic detection method
CN114253386A (en)*2020-09-112022-03-29成都木帆科技有限公司Communication system based on perception
CN112201252A (en)*2020-10-102021-01-08南京机电职业技术学院Voice interaction learning and application system of express robot
TWI783344B (en)*2021-01-112022-11-11圓展科技股份有限公司Sound source tracking system and method
CN113872729B (en)*2021-09-242022-03-25上海物骐微电子有限公司Audio data communication method and wireless audio system
CN113872729A (en)*2021-09-242021-12-31上海物骐微电子有限公司Audio data communication method and wireless audio system
CN114530151A (en)*2022-02-102022-05-24山东企联信息技术股份有限公司Artificial intelligence AI voice control system and experience device thereof
CN114578705B (en)*2022-04-012022-12-27深圳冠特家居健康系统有限公司Intelligent home control system based on 5G Internet of things
CN114578705A (en)*2022-04-012022-06-03深圳冠特家居健康系统有限公司Intelligent home control system based on 5G Internet of things
CN116071863A (en)*2023-03-152023-05-05潍坊职业学院Instruction recognition and transmission system

Similar Documents

PublicationPublication DateTitle
CN105045122A (en)Intelligent household natural interaction system based on audios and videos
US11902707B1 (en)Location based device grouping with voice control
CN109427333B (en)Method for activating speech recognition service and electronic device for implementing said method
US11429345B2 (en)Remote execution of secondary-device drivers
CN105471705B (en)Intelligent control method, equipment and system based on instant messaging
JP6752870B2 (en) Methods and systems for controlling artificial intelligence devices using multiple wake words
US9729821B1 (en)Sensor fusion for location based device grouping
US10367652B2 (en)Smart home automation systems and methods
US20200286482A1 (en)Processing voice commands based on device topology
CN107703872B (en)Terminal control method and device of household appliance and terminal
CN204631465U (en) A Humanized Smart Home Control System with Remote Voice Control
TW201805744A (en)Control system and control processing method and apparatus capable of directly controlling a device according to the collected information with a simple operation
CN206516350U (en)A kind of intelligent domestic system controlled based on distributed sound
CN109093627A (en)intelligent robot
CN117857237A (en)Voice control method and device of equipment, electronic equipment and storage medium
WO2019180434A1 (en)Processing a command
CN111539219B (en)Method, equipment and system for disambiguation of natural language content titles
JP2021152928A (en)Terminal device, method, and program
CN108415572B (en) Module control method, device and storage medium applied to mobile terminal
CN118283339A (en)Display device, server and voice instruction recognition method
WO2018023514A1 (en)Home background music control system
WO2018023513A1 (en)Home control method based on motion recognition
CN109616110A (en)A kind of exchange method, system, electronic equipment and server
US20240404529A1 (en)Methods and systems for combined voice and gesture control
KR20240126672A (en)Apparatus for artificial intelligence, controlling method of operational thereof and artificial intelligence service system

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
WD01Invention patent application deemed withdrawn after publication

Application publication date:20151111

WD01Invention patent application deemed withdrawn after publication

[8]ページ先頭

©2009-2025 Movatter.jp