Embodiment
Below in conjunction with accompanying drawing the present invention is described in further detail.
Fig. 1 illustrates the system topological figure that is used to realize voice control controlled plant according to one aspect of the invention.At this, voice opertaing device 1 includes but not limited to remote control equipment, controlled plant or both combinations.Remote control equipment include but not limited to any can with the user through modes such as voice carry out man-machine interaction, and through sending the electronic product of instruction with the control controlled plant, for example telepilot, operating grip etc.; Controlled plant includes but not limited to any can carrying out alternately with remote control equipment; Like the user's voice input information that sends through the reception remote control equipment; And the electronic product of the operation of execution correspondence, for example computing machine, mobile phone, PDA, palm PC PPC or IPTV etc.Those skilled in the art will be understood that above-mentioned voice opertaing device, remote control equipment and controlled plant are merely for example; Other voice opertaing devices existing or that possibly occur from now on, remote control equipment or controlled plant are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
When said voice opertaing device 1 comprises remote control equipment, this remote control equipment can carry out alternately with one or more controlled plants, to realize through the control of user's voice input information to controlled plant.Particularly; This remote control equipment obtains the speech input information of user's input; According to this speech input information; And combine said user's user related information, and confirm to control instruction with said speech input information is corresponding, control instruction and be sent to the one or more controlled plants corresponding said with this remote control equipment.At this, when a remote control equipment and a plurality of controlled plant carry out when mutual, can be through modes such as different communications or communication identifiers, to distinguish different controlled equipment.
When said voice opertaing device 1 comprises controlled plant; This controlled plant obtains the speech input information of user's input; Perhaps this controlled plant can carry out alternately with one or more remote control equipments, receive should (etc.) user that remote control equipment sent through should (etc.) speech input information of remote control equipment input, according to this speech input information; And combine said user's user related information; Confirm to control instruction with said speech input information is corresponding, will saidly control instruction provides to this controlled plant, carries out the operation of correspondence like the operation actuating unit of this remote control equipment.
When said voice opertaing device 1 comprised combining of remote control equipment and controlled plant, remote control equipment cooperates with controlled plant to be accomplished through the control of user's voice input information to controlled plant.Particularly; This remote control equipment obtains the speech input information of user's input; With this speech input information or based on the definite phonetic order information of this speech input information, send to this controlled plant, this controlled plant is according to this speech input information or based on the definite phonetic order information of this speech input information; And combine said user's user related information; Confirm to control instruction with said speech input information is corresponding, will saidly control instruction provides to this controlled plant, carries out the operation of correspondence like the operation actuating unit of this remote control equipment.At this; User related information both can be stored in the controlled plant, was directly read by controlled plant, perhaps when having a plurality of users' user related information in the controlled plant; By the similar information such as ID that controlled plant provides according to remote control equipment, confirm corresponding user related information; Also can in controlled plant, offer controlled plant at transmission speech input information or phonetic order information by remote control equipment.
Communicating by letter between remote control equipment and the controlled plant is separate, includes but not limited to such as wireless transmission protocols such as infrared, bluetooths.Those skilled in the art will be understood that above-mentioned wireless transmission protocol is merely for example, and other wireless transmission protocols existing or that possibly occur from now on also should be included in the protection domain of the present invention, and be contained in this at this with way of reference as applicable to the present invention.
Those skilled in the art also will be understood that; The quantity that is merely the disparate networks element that illustrates for simplicity among Fig. 1 maybe be less than the quantity in the real network, but this omission far and away with can not influence to the present invention carry out clear, be disclosed as prerequisite fully.
For for simplicity, remove special declaration, below be that remote control equipment is that example is described only with said voice opertaing device.Remote control equipment obtains the speech input information of user's input; According to this speech input information; And combine said user's user related information, and confirm to control instruction with said speech input information is corresponding, control instruction and be sent to the controlled plant corresponding said with this remote control equipment.Those skilled in the art will be understood that; Based on the voice opertaing device is scheme description or its variant of remote control equipment; Can be applicable to said voice opertaing device equally and comprise the scene that combines of controlled plant, remote control equipment and controlled plant, thus repeat no more, and mode is contained in this by reference.
Fig. 2 illustrates the equipment synoptic diagram that is used to realize voice control controlled plant according to one aspect of the invention; Voice opertaing device 1 comprises voice deriving means 101,operation generating apparatus 102 andoperation generator 103.
Wherein, voice deriving means 101 obtains the speech input information of user's input.Particularly; User's mutual through with the voice opertaing device; Voice opertaing device 1 or with third party device that this voice opertaing device is connected in, through speech input devices such as microphones, imported speech input information; Application programming interfaces (API) or the mode of other agreements of voice deriving means 101 through calling this voice opertaing device 1 or this third party device obtained this speech input information.For example; When said voice opertaing device comprises remote control equipment; This remote control equipment is built-in with speech input device, and the user has imported speech input information through this speech input device; The application programming interfaces (API) of voice deriving means 101 on it through calling this voice opertaing device 1 obtain this speech input information.And for example; When said voice opertaing device comprises controlled plant, this controlled plant is circumscribed with a speech input device, like microphone; The user is through this speech input device; Imported speech input information, the application programming interfaces (API) that the voice deriving means on it provides through this speech input device obtain this speech input information.Those skilled in the art will be understood that the above-mentioned mode of obtaining the user's voice input information is merely for example; Other existing or modes of obtaining the user's voice input information that possibly occur from now on are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Operation generating apparatus 102 is according to said speech input information, and combines said user's user related information, confirms to control instruction with said speech input information is corresponding.Particularly,operation generating apparatus 102 confirms that the mode that controls instruction includes but not limited to:
1) speech input information that is obtained according to voice deriving means 101, and combine said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, confirm the said instruction that controls.
2) speech input information that is obtained according to voice deriving means 101, and combine said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.
3) speech input information that is obtained according to voice deriving means 101 according to said speech input information, generates and the corresponding phonetic order information of said speech input information; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.
Those skilled in the art will be understood that the above-mentioned mode of confirming to control instruction is merely for example; The mode that other existing or confirming of possibly occurring from now on control instruction is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Operation generator 103 will be said controls to instruct to be provided to controlled plant.Particularly; When said voice opertaing device comprises remote control equipment;Operation generator 103 on it is controled instruction according to whatoperation generating apparatus 102 generated, through such as bluetooth, wireless transmission method such as infrared, will saidly control to instruct and be sent to and the corresponding controlled plant of this remote control equipment; So that this controlled plant controls instruction according to this, carry out corresponding control operation.When said voice opertaing device comprises controlled plant;Operation generator 103 on it is controled instruction according to whatoperation generating apparatus 102 generated; Provide on it such as the operation actuating unit etc.; So that follow up device such as this operation actuating unit can control instruction according to this, carry out corresponding control operation.Those skilled in the art will be understood that the above-mentioned mode that controls instruction that provides is merely for example; The mode that other are existing or providing of possibly occurring from now on controls instruction is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Preferably, be constant work between each device of voice opertaing device 1.Particularly, voice deriving means 101 continues to obtain the speech input information of user's input;Operation generating apparatus 102 continues according to said speech input information, and combines said user's user related information, confirms to control instruction with said speech input information is corresponding;Operation generator 103 continues will saidly control to instruct to provide to controlled plant.At this; It will be understood by those skilled in the art that " continuing " is meant that each device of voice opertaing device 1 constantly obtains the speech input information of user input respectively, confirms correspondingly with it to control instruction and provide to controlled plant, stops to obtain the speech input information of user's input in a long time until this voice opertaing device 1.
At this, the present invention is according to the speech input information of user's input, in conjunction with this user's user related information; Obtain controling the instruction and provide to controlled plant; Make the control instruction of controlled plant, carry out personalized control operation, made things convenient for the control of user controlled plant according to this personalization; Control the efficient of controlled plant thereby improved the user, and promoted user's experience.
Preferably, the speech input information thatoperation generating apparatus 102 is obtained according to voice deriving means 101, and combine said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, confirm the said instruction that controls.Particularly; The speech input information thatoperation generating apparatus 102 is obtained according tovoice deriving means 101; Through converting thereof into one or more character strings that are complementary with it, again through such as in user information database, carrying out matching inquiry, confirm this user such as user related informations such as preference setting, historical input record, historical controlling recording; In conjunction with should (etc.) character string and this user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, directly confirm to control instruction or, obtain the corresponding instruction that controls through in the operational order storehouse, carrying out matching inquiry.
For example, user's mutual through with voice opertaing device 1 imported speech input information " xianhua "; Voice deriving means 101 obtains this speech input information, converts thereof into the corresponding characters string, like " fresh flower ", " digression " etc.; Operate generatingapparatus 102 according to this speech input information, and combine this user's user related information, carried out search with " fresh flower " as search word like this user's history; Or this user's conception of history seen the video of relevant " fresh flower ", then operates generatingapparatus 102 according to this speech input information (or this speech input information corresponding characters string), in conjunction with this user related information; Generate machine recognizable phonetic order information " fresh flower "; Subsequently,operation generating apparatus 102 is confirmed the corresponding instruction that controls again according to this phonetic order information; As confirming the instruction that controls about search, this controls and comprises search sequence " fresh flower " in the instruction.
At this,operation generating apparatus 102 comprises the mode that the user's voice input information converts character string to: 1) through speech recognition, directly convert this speech input information into be complementary with it one or more input of character string; 2) through in the speech recognition library of this locality or network-side, carrying out matching inquiry, confirm the one or more input of character string that are complementary with this speech input information.At this, store the mapping relations of candidate's speech input information and candidate character strings in the speech recognition library, this speech recognition library both can be arranged in voice opertaing device 1, also can be arranged in the third party device that is connected through network with this voice opertaing device.What store the user in the user information database imports user related informations such as record, historical controlling recording such as preference setting, history; This user information database both can be arranged in voice opertaing device 1, also can be arranged in the third party device that is connected through network with this voice opertaing device.
Preferably, the speech input information thatoperation generating apparatus 102 is obtained according to voice deriving means 101, and combine said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.Particularly; The speech input information thatoperation generating apparatus 102 is obtained according to voice deriving means 101; Convert thereof into one or more character strings that are complementary with it, again through such as in user information database, carrying out matching inquiry, confirm this user such as user related informations such as preference setting, historical input record, historical controlling recording; In conjunction with should (etc.) character string and this user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, combine above-mentioned one or more user related information again, confirm to control instruction; Perhaps; In conjunction with above-mentioned one or more user related informations,, obtain the corresponding instruction that controls through in the operational order storehouse, carrying out matching inquiry.
For example, user's mutual through with voice opertaing device 1 imported speech input information " east channel "; Voice deriving means 101 obtains this speech input information, andoperation generating apparatus 102 converts thereof into character string " east channel " according to this speech input information; And combine this user's user related information; Preference like this user is set to the television channel selection, generates corresponding phonetic order information " Dragon TV ", " east movie channel ", subsequently; The user related information that combines this user again; The historical record of watching last time like this user is " Dragon TV ", then should operate generatingapparatus 102 according to this phonetic order information and user related information, confirms that the instruction that controls of correspondence " is selected television channel ' Dragon TV ' ".
Preferably, the speech input information thatoperation generating apparatus 102 is obtained according to voice deriving means 101 according to said speech input information, generates and the corresponding phonetic order information of said speech input information; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.Particularly; The speech input information thatoperation generating apparatus 102 is obtained according to voice deriving means 101; Convert thereof into one or more character strings that are complementary with it, according to this (etc.) character string, generate and the corresponding phonetic order information of this speech input information; Then, through such as in user information database, carrying out matching inquiry, confirm this user such as user related informations such as preference setting, historical input record, historical controlling recording; According to this phonetic order information and above-mentioned one or more user related information; Confirm to control instruction, perhaps, according to this phonetic order information and above-mentioned one or more user related information; Through in the operational order storehouse, carrying out matching inquiry, obtain the corresponding instruction that controls.
For example; User's mutual through with voice opertaing device 1 imported speech input information and " heightened volume ", and voice deriving means 101 obtains this speech input information;Operation generating apparatus 102 is according to this speech input information; Through modes such as speech recognitions, convert thereof into character string and " heighten volume ", and generated machine recognizable phonetic order information and " heighten volume "; Subsequently,, combine this user's user related information again, volume is adjusted to 30, confirm that the corresponding instruction that controls is " volume is adjusted to 30 " as most of in the historical volume adjustment of this user record according to this phonetic order information; And for example,operation generating apparatus 102 is the elderly according to this user, confirms to control instruction and is " volume is adjusted to 40 ".
Those skilled in the art will be understood that the above-mentioned mode of confirming to control instruction is merely for example; The mode that other existing or confirming of possibly occurring from now on control instruction is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
More preferably, confirm in the saidoperation generating apparatus 102 that the mode that controls instruction comprises: according to said phonetic order information, in the operational order storehouse, carry out matching inquiry, to obtain the said instruction that controls.Particularly;Operation generating apparatus 102 is after generating phonetic order information; According to part or all of key word included in this phonetic order information; Through the mode of database matching inquiry, in the operational order storehouse, carry out matching inquiry, obtain to control instruction with this phonetic order information is corresponding.For example,operation generating apparatus 102 " is heightened volume " according to phonetic order information, in the operational order storehouse, carries out matching inquiry, obtains to heighten the instruction that controls of a unit volume.Preferably;Operation generating apparatus 102 is according to this phonetic order information, in conjunction with this user's user related information, through the mode of database matching inquiry; In the operational order storehouse, carry out matching inquiry, obtain to control instruction with this phonetic order information is corresponding.At this, store the mapping relations of phonetic order and operational order in the operational order storehouse, as " heightening volume " correspondence " heighten a unit volume ", " lightening screen " correspondence " lightens a unit screen intensity " etc.This operational order storehouse both can be arranged in voice opertaing device 1, also can be arranged in the third party device that is connected through network with this voice opertaing device.This operational order storehouse can generate according to the director data study of a large number of users, also can set up or renewal according to being provided with of active user.
More preferably, voice opertaing device 1 also comprises the updating device (not shown), and this updating device is according to the mapping relations of the phonetic order and the operational order of user preset, sets up or upgrades said operational order storehouse.Particularly, user's mutual through with voice opertaing device 1, the mode of for example importing through phonetic entry or literal; Be provided with the mapping relations of phonetic order and operational order in advance, this phonetic order obtains according to the user's voice input information, for example; The user is provided with when its input speech input information " sound a bit " through the mode of phonetic entry, and the pairing operational order of phonetic order that obtains according to this speech input information is " heightening five unit volumes "; Perhaps; The user is provided with when its input speech input information " brighter " through the mode of literal input, and the pairing operational order of phonetic order that obtains according to this speech input information is " lightening five unit screen intensitys "; The application programming interfaces (API) of updating device through calling this voice opertaing device 1; Obtain the set above-mentioned phonetic order of this user and the mapping relations of operational order, and deposit these mapping relations in the operational order storehouse, to realize foundation or renewal this operational order storehouse.
Those skilled in the art will be understood that the mode in above-mentioned foundation or renewal operational order storehouse is merely for example; Other foundation existing or that possibly occur from now on or the mode of upgrading the operational order storehouse are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Preferably, voice opertaing device 1 also comprises the information acquisition device (not shown).This information acquisition device is confirmed said user related information according to the speech related information of said speech input information;
Wherein, said speech related information comprise following at least each:
The accent information of-said speech input information;
The element information of-said speech input information;
The voiceprint of-said speech input information;
Wherein, said user related information comprise following at least each:
-said user's preference setting;
-said user's history input record;
-said user's historical controlling recording.
Particularly; The speech input information that information acquisition device is obtained according to voice deriving means 101; Through this speech input information is carried out speech analysis, obtain the speech related information of this speech input information, like the accent information of this speech input information, element information, voiceprint etc.; With definite pairing user of this speech input information, and then definite this user's user related information.For example, information acquisition device is confirmed and the corresponding ID of this speech input information according to analyzing the speech related information that is obtained; Subsequently; Elder generation confirms and the corresponding user of this ID that according to this ID again through matching inquiry in user information database, that confirms this user imports user related informations such as record, historical controlling recording such as preference setting, history; Perhaps; Directly according to this ID, matching inquiry in user information database is confirmed and the corresponding user related information of this ID.
For example; When the corresponding a plurality of user of said voice opertaing device 1, can store these a plurality of users' user related information in this voice opertaing device 1, like each user's preference setting, historical input record, historical controlling recording etc.; When information acquisition device according to speech input information such as accent information, element information or voiceprint etc.; Confirm this user's ID, then can confirm this user's user related information, and then; According to the different user relevant information of different user, confirm the different voice command information or control instruction.
At this; The voice accent information of said speech input information is as having the speech of individual, place or native language characteristic; Information acquisition device for example passes through the voice accent information of analyzing speech input information; Judge the pairing user's of this speech input information regional feature information etc., and then combine this regional feature information to confirm the pairing ID of this speech input information; The phonetic feature information of said speech input information includes but not limited to the element characteristics such as pitch, loudness of a sound, the duration of a sound and tone color of this speech input information; Information acquisition device is confirmed the pairing ID of this speech input information according to wherein several kinds of a kind of or combinations in the above-mentioned phonetic feature information; The voice voiceprint of said speech input information is the sound wave spectrum that carries language message, and information acquisition device is confirmed the pairing ID of this speech input information through the analysis to voice voiceprint or sound wave spectrum.And then this information acquisition device is confirmed and the corresponding user related information of this ID according to this ID.
At this, said user's preference setting comprise this user preference input habit, preference control custom etc.; Said user's history input record comprises user's voice information history input record, historical search record, historical viewings record etc.; Said user's historical controlling recording comprises this user's historical voice controlling recording, historical manually controlling recording etc.
At this, the present invention confirms the input user that this speech input information is corresponding according to the speech related information of speech input information; And then confirm the user related information that this user is corresponding, and make when the corresponding a plurality of user of voice opertaing device 1, can accurately know user and the user related information thereof of importing speech input information; Think that it provides the personalized instruction that controls; Make the control instruction of controlled plant, carry out personalized control operation, made things convenient for the control of user controlled plant according to this personalization; Control the efficient of controlled plant thereby improved the user, and promoted user's experience.
Those skilled in the art will be understood that above-mentioned voice deriving means and information acquisition device are merely example, and in practice, they can be two independently modules, also can be integrated in the module.
Those skilled in the art will be understood that above-mentioned speech related information, user related information and definite mode thereof are merely for example; Other speech related information existing or that possibly occur from now on, user related information or its mode of confirming are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
In a preferred embodiment, saidly control instruction and comprise search operation instruction; Wherein, Saidoperation generating apparatus 102 is according to said speech input information, and combines said user related information, confirms and the corresponding search operation instruction of said speech input information; Wherein, said search operation instruction comprises and the corresponding search sequence of said speech input information.Followingly with reference to Fig. 2 the preferred embodiment is described, voice deriving means 101 obtains the speech input information of user's input;Operation generating apparatus 102 is according to said speech input information; And combine said user related information; Confirm and the corresponding search operation instruction of said speech input information that wherein, said search operation instruction comprises and the corresponding search sequence of said speech input information;Operation generator 103 will be said controls to instruct to be provided to controlled plant.Wherein, the voice deriving means 101 in the voice opertaing device 1 andoperation generator 103 respectively with the foregoing description in pairing device identical or basic identical, so locate to repeat no more, and mode by reference is contained in this.
Particularly, when comprising " search ", " inquiry " or searching key words such as " searching " in the user's voice input information, operation generating apparatus 102 is according to this speech input information; The user related information that combines this user again; Like this user's historical search record, historical viewings record etc., confirm the search operation instruction, and with the input information except that searching key word, function word etc. in this speech input information; As search sequence, write in this search operation instruction.Further; This operation generating apparatus 102 can also combine this user's user related information, and the generated query sequence is as according to included keyword in user's historical search record, the historical viewings record; The generated query sequence, and write in the search operation instruction.For example; User's mutual through with the voice opertaing device; Imported speech input information " search Dragon TV ", operation generating apparatus 102 identifies this speech input information and comprises searching key word " search " through speech recognition; And this user's historical viewings is crossed video " conference of Dragon TV dance circle "; Operation generating apparatus 102 is confirmed the search operation instruction of search video " conference of Dragon TV dance circle " in view of the above, wherein, comprises search sequence " Dragon TV " or " conference of Dragon TV dance circle " in this search operation instruction.
Preferably, when comprising " search ", " inquiry " or searching key words such as " searching " in the user's voice input information,operation generating apparatus 102 is according to this speech input information; Directly or combine this user's user related information; Generate phonetic order information, again according to this phonetic order information, in conjunction with this user related information; Confirm the search operation instruction, this search operation instruction comprises and the corresponding search sequence of said speech input information.
Those skilled in the art will be understood that the above-mentioned definite search operation instruction and the mode of search sequence are merely for example; The mode of other definite search operations instructions existing or that possibly occur from now on or search sequence is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Preferably, voice opertaing device 1 also comprises searcher (not shown) and generator (not shown) as a result.Searcher carries out match search according to said search sequence in the search information storehouse, with the Search Results that obtains to be complementary with said search sequence; Generator provides said Search Results to said controlled plant as a result.Particularly, included search sequence during the search operation that searcher is generated according tooperation generating apparatus 102 instructs is carried out match search in the search information storehouse of this locality or network-side, with the Search Results that obtains to be complementary with this search sequence; Subsequently, generator perhaps through such as players such as flash, provides Search Results to controlled plant through such as dynamic web page techniques such as JSP, ASP or PHP as a result.At this, Search Results includes but not limited to video, channel, webpage, animation or the picture etc. of this locality or network-side.
Those skilled in the art will be understood that above-mentioned coupling and provide the mode of Search Results to be merely for example; Other couplings existing or that possibly occur from now on or mode that Search Results is provided are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
At this; The present invention confirms corresponding search operation instruction also according to specific searching key word included in the user's voice input information, makes controlled plant carry out search operation according to this search operation instruction; And Search Results offered the user, promoted user's experience.
Fig. 3 illustrates the method flow diagram that is used to realize voice control controlled plant according to a further aspect of the present invention.
In step S201, voice opertaing device 1 obtains the speech input information of user's input.Particularly; User's mutual through with the voice opertaing device, voice opertaing device 1 or with third party device that this voice opertaing device is connected in, through speech input devices such as microphones; Imported speech input information; In step S201, application programming interfaces (API) or the mode of other agreements of voice opertaing device 1 through calling this voice opertaing device 1 or this third party device obtained this speech input information.For example; When said voice opertaing device comprises remote control equipment, this remote control equipment is built-in with speech input device, and the user is through this speech input device; Imported speech input information; In step S201, the application programming interfaces (API) of voice opertaing device 1 through calling this voice opertaing device 1 obtain this speech input information.And for example, when said voice opertaing device comprises controlled plant, this controlled plant is circumscribed with a speech input device; Like microphone; The user has imported speech input information, in step S201 through this speech input device; The application programming interfaces that voice opertaing device 1 provides through this speech input device (API) obtain this speech input information.Those skilled in the art will be understood that the above-mentioned mode of obtaining the user's voice input information is merely for example; Other existing or modes of obtaining the user's voice input information that possibly occur from now on are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
In step S202, voice opertaing device 1 is according to said speech input information, and combines said user's user related information, confirms to control instruction with said speech input information is corresponding.Particularly, in step S202, voice opertaing device 1 confirms that the mode that controls instruction includes but not limited to:
1) according to the speech input information that in step S201, is obtained, and combines said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, confirm the said instruction that controls.
2) according to the speech input information that in step S201, is obtained, and combine said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.
3),, generate and the corresponding phonetic order information of said speech input information according to said speech input information according to the speech input information that in step S201, is obtained; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.
Those skilled in the art will be understood that the above-mentioned mode of confirming to control instruction is merely for example; The mode that other existing or confirming of possibly occurring from now on control instruction is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
In step S203, voice opertaing device 1 with said control the instruction provide to controlled plant.Particularly; When said voice opertaing device comprises remote control equipment, in step S203, voice opertaing device 1 is according to the instruction that controls that in step S202, is generated; Through such as bluetooth, wireless transmission method such as infrared; With said control the instruction be sent to and the corresponding controlled plant of this remote control equipment so that this controlled plant controls instruction according to this, carry out corresponding control operation.When said voice opertaing device comprises controlled plant, in step S203, voice opertaing device 1 is carried out corresponding control operation according to the instruction that controls that in step S202, is generated.Those skilled in the art will be understood that the above-mentioned mode that controls instruction that provides is merely for example; The mode that other are existing or providing of possibly occurring from now on controls instruction is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Preferably, between each step of voice opertaing device 1 be constant work.Particularly, in step S201, voice opertaing device 1 continues to obtain the speech input information of user's input; In step S202, voice opertaing device 1 continues according to said speech input information, and combines said user's user related information, confirms to control instruction with said speech input information is corresponding; In step S203, voice opertaing device 1 continues will saidly control to instruct to provide to controlled plant.At this; It will be understood by those skilled in the art that " continuing " is meant that each step of voice opertaing device 1 constantly obtains the speech input information of user input respectively, confirms correspondingly with it to control instruction and provide to controlled plant, stops to obtain the speech input information of user's input in a long time until this voice opertaing device 1.
At this, the present invention is according to the speech input information of user's input, in conjunction with this user's user related information; Obtain controling the instruction and provide to controlled plant; Make the control instruction of controlled plant, carry out personalized control operation, made things convenient for the control of user controlled plant according to this personalization; Control the efficient of controlled plant thereby improved the user, and promoted user's experience.
Preferably, in step S202, the speech input information that voice opertaing device 1 basis is obtained in step S201, and combine said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, confirm the said instruction that controls.Particularly; In step S202, voice opertaing device 1 is according to the speech input information that in step S201, is obtained, through converting thereof into one or more character strings that are complementary with it; Again through such as in user information database, carrying out matching inquiry; Confirm this user such as user related informations such as preference setting, historical input record, historical controlling recording, combine should (etc.) character string and this user related information, generation and the corresponding phonetic order information of said speech input information; According to said phonetic order information, directly confirm to control instruction or, obtain the corresponding instruction that controls through in the operational order storehouse, carrying out matching inquiry.
For example, user's mutual through with voice opertaing device 1 imported speech input information " xianhua "; In step S201, voice opertaing device 1 obtains this speech input information, converts thereof into the corresponding characters string; Like " fresh flower ", " digression " etc., in step S202, voice opertaing device 1 is according to this speech input information; And combine this user's user related information, carried out search with " fresh flower " as search word like this user's history, or this user's conception of history has been seen the video of relevant " fresh flower "; Then voice opertaing device 1 in conjunction with this user related information, generates machine recognizable phonetic order information " fresh flower " according to this speech input information (or this speech input information corresponding characters string); Subsequently, voice opertaing device 1 is confirmed the corresponding instruction that controls again according to this phonetic order information; As confirming the instruction that controls about search, this controls and comprises search sequence " fresh flower " in the instruction.
At this, in step S202, voice opertaing device 1 comprises the mode that the user's voice input information converts character string to: 1) through speech recognition, directly convert this speech input information into be complementary with it one or more input of character string; 2) through in the speech recognition library of this locality or network-side, carrying out matching inquiry, confirm the one or more input of character string that are complementary with this speech input information.At this, store the mapping relations of candidate's speech input information and candidate character strings in the speech recognition library, this speech recognition library both can be arranged in voice opertaing device 1, also can be arranged in the third party device that is connected through network with this voice opertaing device.What store the user in the user information database imports user related informations such as record, historical controlling recording such as preference setting, history; This user information database both can be arranged in voice opertaing device 1, also can be arranged in the third party device that is connected through network with this voice opertaing device.
Preferably, in step S202, the speech input information that voice opertaing device 1 basis is obtained in step S201, and combine said user related information, generate and the corresponding phonetic order information of said speech input information; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.Particularly; In step S202, voice opertaing device 1 converts thereof into one or more character strings that are complementary with it according to the speech input information that in step S201, is obtained; Again through such as in user information database, carrying out matching inquiry; Confirm this user such as user related informations such as preference setting, historical input record, historical controlling recording, combine should (etc.) character string and this user related information, generation and the corresponding phonetic order information of said speech input information; According to said phonetic order information, combine above-mentioned one or more user related information again, confirm to control instruction; Perhaps; In conjunction with above-mentioned one or more user related informations,, obtain the corresponding instruction that controls through in the operational order storehouse, carrying out matching inquiry.
For example, user's mutual through with voice opertaing device 1 imported speech input information " east channel "; In step S201, voice opertaing device 1 obtains this speech input information, in step S202; Voice opertaing device 1 converts thereof into character string " east channel " according to this speech input information, and combines this user's user related information; Preference like this user is set to the television channel selection, generates corresponding phonetic order information " Dragon TV ", " east movie channel ", subsequently; The user related information that combines this user again; The historical record of watching last time like this user is " Dragon TV ", and then voice opertaing device 1 confirms that according to this phonetic order information and user related information the instruction that controls of correspondence " is selected television channel ' Dragon TV ' ".
Preferably, in step S202, voice opertaing device 1, generates and the corresponding phonetic order information of said speech input information according to said speech input information according to the speech input information that in step S201, is obtained; According to said phonetic order information, and combine said user related information, confirm the said instruction that controls.Particularly; In step S202, voice opertaing device 1 converts thereof into one or more character strings that are complementary with it according to the speech input information that in step S201, is obtained; According to this (etc.) character string, generate and the corresponding phonetic order information of this speech input information; Then, through such as in user information database, carrying out matching inquiry, confirm this user such as user related informations such as preference setting, historical input record, historical controlling recording; According to this phonetic order information and above-mentioned one or more user related information; Confirm to control instruction, perhaps, according to this phonetic order information and above-mentioned one or more user related information; Through in the operational order storehouse, carrying out matching inquiry, obtain the corresponding instruction that controls.
For example, user's mutual through with voice opertaing device 1 imported speech input information and " heightened volume "; In step S201, voice opertaing device 1 obtains this speech input information, in step S202; Voice opertaing device 1 is according to this speech input information; Through modes such as speech recognitions, convert thereof into character string and " heighten volume ", and generated machine recognizable phonetic order information and " heighten volume "; Subsequently,, combine this user's user related information again, volume is adjusted to 30, confirm that the corresponding instruction that controls is " volume is adjusted to 30 " as most of in the historical volume adjustment of this user record according to this phonetic order information; And for example, in step S202, voice opertaing device 1 is the elderly according to this user, confirms to control instruction and is " volume is adjusted to 40 ".
Those skilled in the art will be understood that the above-mentioned mode of confirming to control instruction is merely for example; The mode that other existing or confirming of possibly occurring from now on control instruction is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
More preferably, in step S202, confirm in the voice opertaing device 1 that the mode that controls instruction comprises: according to said phonetic order information, in the operational order storehouse, carry out matching inquiry, to obtain the said instruction that controls.Particularly; In step S202; Voice opertaing device 1 is after generating phonetic order information, according to part or all of key word included in this phonetic order information, through the mode of database matching inquiry; In the operational order storehouse, carry out matching inquiry, obtain to control instruction with this phonetic order information is corresponding.For example, in step S202, voice opertaing device 1 " is heightened volume " according to phonetic order information, in the operational order storehouse, carries out matching inquiry, obtains to heighten the instruction that controls of a unit volume.Preferably, in step S202, voice opertaing device 1 is according to this phonetic order information; User related information in conjunction with this user; Through the mode of database matching inquiry, in the operational order storehouse, carry out matching inquiry, obtain to control instruction with this phonetic order information is corresponding.At this, store the mapping relations of phonetic order and operational order in the operational order storehouse, as " heightening volume " correspondence " heighten a unit volume ", " lightening screen " correspondence " lightens a unit screen intensity " etc.This operational order storehouse both can be arranged in voice opertaing device 1, also can be arranged in the third party device that is connected through network with this voice opertaing device.This operational order storehouse can generate according to the director data study of a large number of users, also can set up or renewal according to being provided with of active user.
More preferably, in step S204 (not shown), voice opertaing device 1 is according to the mapping relations of the phonetic order and the operational order of user preset, sets up or upgrades said operational order storehouse.Particularly, user's mutual through with voice opertaing device 1, the mode of for example importing through phonetic entry or literal; Be provided with the mapping relations of phonetic order and operational order in advance, this phonetic order obtains according to the user's voice input information, for example; The user is through the mode of phonetic entry; Be provided with when its input speech input information " sound a bit ", the pairing operational order of phonetic order that obtains according to this speech input information is " heightening five unit volumes ", perhaps; The user is through the mode of literal input; Be provided with when its input speech input information " brighter ", the pairing operational order of phonetic order that obtains according to this speech input information is " lightening five unit screen intensitys ", in step S204; The application programming interfaces (API) of voice opertaing device 1 through calling this voice opertaing device 1; Obtain the set above-mentioned phonetic order of this user and the mapping relations of operational order, and deposit these mapping relations in the operational order storehouse, to realize foundation or renewal this operational order storehouse.
Those skilled in the art will be understood that the mode in above-mentioned foundation or renewal operational order storehouse is merely for example; Other foundation existing or that possibly occur from now on or the mode of upgrading the operational order storehouse are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Preferably, in step S205 (not shown), voice opertaing device 1 is confirmed said user related information according to the speech related information of said speech input information;
Wherein, said speech related information comprise following at least each:
The accent information of-said speech input information;
The element information of-said speech input information;
The voiceprint of-said speech input information;
Wherein, said user related information comprise following at least each:
-said user's preference setting;
-said user's history input record;
-said user's historical controlling recording.
Particularly; In step S205, voice opertaing device 1 is according to the speech input information that in step S201, is obtained, through this speech input information is carried out speech analysis; Obtain the speech related information of this speech input information; Like the accent information of this speech input information, element information, voiceprint etc., confirming the pairing user of this speech input information, and then confirm this user's user related information.For example, in step S205, voice opertaing device 1 is confirmed and the corresponding ID of this speech input information according to analyzing the speech related information that is obtained; Subsequently; Elder generation confirms and the corresponding user of this ID that according to this ID again through matching inquiry in user information database, that confirms this user imports user related informations such as record, historical controlling recording such as preference setting, history; Perhaps; Directly according to this ID, matching inquiry in user information database is confirmed and the corresponding user related information of this ID.
For example, when the corresponding a plurality of user of said voice opertaing device 1, can store these a plurality of users' user related information in this voice opertaing device 1; Like each user's preference setting, historical input record, historical controlling recording etc.; When in step S205, voice opertaing device 1 according to speech input information such as accent information, element information or voiceprint etc., confirm this user's ID; Then can confirm this user's user related information; And then, according to the different user relevant information of different user, confirm the different voice command information or control instruction.
At this; The voice accent information of said speech input information is as having the speech of individual, place or native language characteristic; In step S205; Voice opertaing device 1 for example through the voice accent information of analyzing speech input information, is judged the pairing user's of this speech input information regional feature information etc., and then is combined this regional feature information to confirm the pairing ID of this speech input information; The phonetic feature information of said speech input information includes but not limited to the element characteristics such as pitch, loudness of a sound, the duration of a sound and tone color of this speech input information; In step S205; Voice opertaing device 1 is confirmed the pairing ID of this speech input information according to wherein several kinds of a kind of or combinations in the above-mentioned phonetic feature information; The voice voiceprint of said speech input information is the sound wave spectrum that carries language message, and in step S205, voice opertaing device 1 is confirmed the pairing ID of this speech input information through the analysis to voice voiceprint or sound wave spectrum.And then voice opertaing device 1 is confirmed and the corresponding user related information of this ID according to this ID.
At this, said user's preference setting comprise this user preference input habit, preference control custom etc.; Said user's history input record comprises user's voice information history input record, historical search record, historical viewings record etc.; Said user's historical controlling recording comprises this user's historical voice controlling recording, historical manually controlling recording etc.
At this, the present invention confirms the input user that this speech input information is corresponding according to the speech related information of speech input information; And then confirm the user related information that this user is corresponding, and make when the corresponding a plurality of user of voice opertaing device 1, can accurately know user and the user related information thereof of importing speech input information; Think that it provides the personalized instruction that controls; Make the control instruction of controlled plant, carry out personalized control operation, made things convenient for the control of user controlled plant according to this personalization; Control the efficient of controlled plant thereby improved the user, and promoted user's experience.
Those skilled in the art will be understood that above-mentioned speech related information, user related information and definite mode thereof are merely for example; Other speech related information existing or that possibly occur from now on, user related information or its mode of confirming are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
In a preferred embodiment, saidly control instruction and comprise search operation instruction; Wherein, in step S202, voice opertaing device 1 is according to said speech input information; And combine said user related information; Confirm and the corresponding search operation instruction of said speech input information that wherein, said search operation instruction comprises and the corresponding search sequence of said speech input information.Followingly with reference to Fig. 3 the preferred embodiment is described, in step S201, voice opertaing device 1 obtains the speech input information of user's input; In step S202; Voice opertaing device 1 is according to said speech input information, and combines said user related information, confirms and the corresponding search operation instruction of said speech input information; Wherein, said search operation instruction comprises and the corresponding search sequence of said speech input information; In step S203, voice opertaing device 1 with said control the instruction provide to controlled plant.Wherein, step S201 and S203 respectively with the foregoing description in pairing step identical or basic identical, so locate to repeat no more, and mode by reference is contained in this.
Particularly, when comprising " search ", " inquiry " or searching key words such as " searching " in the user's voice input information, in step S202; Voice opertaing device 1 combines this user's user related information again according to this speech input information, like this user's historical search record, historical viewings record etc.; Confirm the search operation instruction; And,, write in this search operation instruction as search sequence with the input information except that searching key word, function word etc. in this speech input information.Further, in step S202, voice opertaing device 1 can also combine this user's user related information; The generated query sequence; As according to included keyword in user's historical search record, the historical viewings record, the generated query sequence, and write in the search operation instruction.For example, user's mutual through with the voice opertaing device imported speech input information " search Dragon TV "; In step S202; Voice opertaing device 1 identifies this speech input information and comprise searching key word " search ", and this user's historical viewings is crossed video " conference of Dragon TV dance circle " through speech recognition; In step S202; Voice opertaing device 1 is confirmed the search operation instruction of search video " conference of Dragon TV dance circle " in view of the above, wherein, comprises search sequence " Dragon TV " or " conference of Dragon TV dance circle " in this search operation instruction.
Preferably, when comprising " search ", " inquiry " or searching key words such as " searching " in the user's voice input information, in step S202; Voice opertaing device 1 is according to this speech input information, directly or combine this user's user related information, generates phonetic order information; Again according to this phonetic order information; In conjunction with this user related information, confirm the search operation instruction, this search operation instruction comprises and the corresponding search sequence of said speech input information.
Those skilled in the art will be understood that the above-mentioned definite search operation instruction and the mode of search sequence are merely for example; The mode of other definite search operations instructions existing or that possibly occur from now on or search sequence is as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
Preferably, the method for voice control controlled plant also comprises step S206 (not shown) and step S207 (not shown).In step S206, voice opertaing device 1 carries out match search according to said search sequence in the search information storehouse, with the Search Results that obtains to be complementary with said search sequence; In step S207, voice opertaing device 1 provides said Search Results to said controlled plant.Particularly; In step S206; Voice opertaing device 1 carries out match search, with the Search Results that obtains to be complementary with this search sequence according to included search sequence in the search operation instruction that in step S202, is generated in the search information storehouse of this locality or network-side; Subsequently, in step S207, voice opertaing device 1 perhaps through such as players such as flash, provides Search Results to controlled plant through such as dynamic web page techniques such as JSP, ASP or PHP.At this, Search Results includes but not limited to video, channel, webpage, animation or the picture etc. of this locality or network-side.
Those skilled in the art will be understood that above-mentioned coupling and provide the mode of Search Results to be merely for example; Other couplings existing or that possibly occur from now on or mode that Search Results is provided are as applicable to the present invention; Also should be included in the protection domain of the present invention, and be contained in this with way of reference at this.
At this; The present invention confirms corresponding search operation instruction also according to specific searching key word included in the user's voice input information, makes controlled plant carry out search operation according to this search operation instruction; And Search Results offered the user, promoted user's experience.
To those skilled in the art, obviously the invention is not restricted to the details of above-mentioned example embodiment, and under the situation that does not deviate from spirit of the present invention or essential characteristic, can realize the present invention with other concrete form.Therefore; No matter from which point; All should regard embodiment as exemplary; And be nonrestrictive, scope of the present invention is limited accompanying claims rather than above-mentioned explanation, therefore is intended to the implication of the equivalents that drops on claim and all changes in the scope are included in the present invention.Should any Reference numeral in the claim be regarded as limit related claim.In addition, obviously other unit or step do not got rid of in " comprising " speech, and odd number is not got rid of plural number.A plurality of unit of stating in the device claim or device also can be realized through software or hardware by a unit or device.The first, the second word such as grade is used for representing title, and does not represent any specific order.