Disclosure of Invention
The present invention is directed to solving at least one of the problems of the prior art or the related art.
Therefore, an object of the present invention is to provide a voice processing method of an air conditioner.
Another object of the present invention is to provide a voice processing apparatus of an air conditioner.
Another object of the present invention is to provide an air conditioner.
It is another object of the present invention to provide a computer-readable storage medium.
In order to achieve the above object, according to an embodiment of a first aspect of the present invention, there is provided a voice processing method of an air conditioner, including: responding to a received control instruction, and generating a voice broadcast text according to the control instruction; determining a language type identifier contained in the control instruction; and determining voice parameters for broadcasting the voice broadcast text according to the voice type identifier.
In the technical scheme, the voice broadcast text is generated according to the control instruction by responding to the received control instruction, and the voice broadcast text is used for reflecting the response of the air conditioner to the control instruction and broadcasting the action executed by the air conditioner, so that a user can know the running state of the air conditioner more intuitively through the voice broadcast text.
In addition, by determining the language type identifier included in the control command, the language that the user is accustomed to using or the language or language type set by the user, such as, but not limited to, mandarin chinese, dialect chinese, english, french, german, japanese, and korean, can be determined.
Finally, the voice parameters of the voice broadcast text are determined and broadcasted according to the voice type identification, so that the playing mode of the voice broadcast text can be enriched, for example, multiple timbres, tones or languages are set for one voice broadcast text, or the volume or timbre is set for the appointed voice broadcast text, and the voice broadcast text is displayed in multiple colors on an interface for displaying the voice broadcast text in a fire disaster.
In any one of the above technical solutions, preferably, in response to a received control instruction, generating a voice broadcast text according to the control instruction, specifically including: in response to a received control instruction, determining text content contained in the control instruction; determining a target operation parameter and/or a target operation mode of the air conditioner contained in the text content; and generating the voice broadcast text according to the target operation parameters and/or the target operation mode.
In the technical scheme, the text content contained in the received control instruction is determined in response to the received control instruction, so that the target operation parameter and/or the target operation mode of the air conditioner are determined, the voice broadcast text is generated according to the target operation parameter and/or the target operation mode, so that the action and the operation state of the air conditioner are visually prompted to a user, and the language type identifier contained in the control instruction is determined while the text content is generated by analyzing the control instruction, so that the identifiability of the voice broadcast text is improved.
In any of the above technical solutions, preferably, determining the text content included in the control instruction in response to the received control instruction specifically includes: locally analyzing the control instruction to determine the text content contained in the control instruction; and/or reporting the control instruction to a server, and receiving the text content fed back by the server.
In this embodiment, the control command is analyzed locally to determine the text content included in the control command, so as to improve the response rate of the voice recognition scheme, for example, extracting a keyword or a wakeup word in the control command can determine the operation requirement of the user on the air conditioner, such as "increase", "wind speed", "decrease", "temperature", "heating", "dehumidifying", and "cooling", but not limited thereto.
In addition, the control instruction can also be reported to a server, the text content fed back by the server is received, and the accuracy and reliability of identifying the control instruction can be improved based on a huge word database and a sentence segmentation algorithm of the server.
Finally, it is also possible to combine the local parsing and the server parsing, for example, to preferentially perform semantic recognition locally, and if the local recognition rate is lower than the preset recognition rate, or no keyword or wake word is extracted, send the result to the server for semantic recognition, where the preset recognition rate may be 30%, 50%, 70%, and so on, but is not limited thereto.
In any one of the above technical solutions, preferably, determining the voice parameter of the broadcast voice broadcast text according to the voice type identifier specifically includes: adding a voice identifier and a display identifier to the voice broadcast text locally according to the voice type identifier; and/or receiving the voice broadcast text fed back by the server, and the voice identifier and the display identifier added to the voice broadcast text by the server; and determining the voice parameters according to the voice identification and/or the display identification.
In the technical scheme, the voice identifier and the display identifier are added to the voice broadcast text locally according to the voice type identifier, and/or the voice broadcast text fed back by the server is received, and the voice identifier and the display identifier added to the voice broadcast text by the server can further enrich the broadcast form of the voice broadcast text, for example, a dialect is adopted for broadcasting the running state contained in the voice broadcast text, and a mandarin or another dialect is adopted for broadcasting the numerical value of the transportation parameter in the voice broadcast text.
In addition, the voice parameters are determined according to the voice identification and/or the display identification, and in fact, the voice parameters are equivalent to configuration files of voice broadcast texts, and can be played on an air conditioner body, or broadcast on sound equipment associated with the air conditioner, or broadcast on a terminal associated with the air conditioner.
In any one of the above technical solutions, preferably, determining the language type identifier included in the control instruction specifically includes: determining that the received control instruction is a voice instruction; and determining the language type identification corresponding to the voice instruction.
In the technical scheme, the corresponding voice type identifier is determined through the voice instruction, and the habitual language of the user sending the voice instruction is determined, so that the user can understand the content of the voice broadcast text more easily.
In any one of the above technical solutions, preferably, determining a language type identifier included in the control instruction further includes: determining that the received control instruction is a terminal communication instruction; and determining user identity information and/or user usage records corresponding to the terminal communication instruction, and determining the language type identifier according to the user identity information and/or the user usage records.
In the technical scheme, the language type identifier is determined according to the user identity information and/or the user usage record, and further, the user can easily understand the content of the voice broadcast text.
In any one of the above technical solutions, preferably, determining a language type identifier included in the control instruction further includes: determining that the received control instruction is a language setting instruction; and determining the language type identifier set in the language setting instruction.
In the technical scheme, the received control instruction is determined to be a language setting instruction, and the language type identifier set in the language setting instruction is determined, that is, the subsequent voice broadcast text is completed according to the voice type identifier set by the user, so that the user can understand the voice broadcast text conveniently, and the voice use experience of the user is improved.
According to a second aspect of the present invention, there is provided a voice processing device for an air conditioner, comprising: a memory and a processor, the memory being configured to store a computer program, the computer program, when executed by the processor, being capable of implementing the steps of the voice processing method of the air conditioner according to any one of the above-mentioned technical solutions.
According to a third aspect of the present invention, there is provided an air conditioner comprising: a memory and a processor, the memory being configured to store a computer program, the computer program, when executed by the processor, being capable of implementing the steps of the voice processing method of the air conditioner according to any one of the above-mentioned technical solutions.
According to an aspect of the fourth aspect of the present invention, there is provided a computer-readable storage medium having a computer program stored thereon, the computer program, when executed, implementing the steps of the voice processing method of the air conditioner as defined in any one of the above aspects.
Additional aspects and advantages of the invention will be set forth in part in the description which follows and, in part, will be obvious from the description, or may be learned by practice of the invention.
Detailed Description
In order that the above objects, features and advantages of the present invention can be more clearly understood, a more particular description of the invention will be rendered by reference to the appended drawings. It should be noted that the embodiments and features of the embodiments of the present application may be combined with each other without conflict.
In the following description, numerous specific details are set forth in order to provide a thorough understanding of the present invention, however, the present invention may be practiced in other ways than those specifically described herein, and therefore the scope of the present invention is not limited by the specific embodiments disclosed below.
Embodiments of a voice processing method and apparatus for an air conditioner, and a computer-readable storage medium according to embodiments of the present invention are specifically described below with reference to fig. 1 to 5.
As shown in fig. 1, a voice processing method of an air conditioner according to an embodiment of the present invention includes: step S102, responding to a received control instruction, and generating a voice broadcast text according to the control instruction; step S104, determining a language type identifier contained in the control instruction; and step S106, determining the voice parameters of the voice broadcast text according to the voice type identification.
In the technical scheme, the voice broadcast text is generated according to the control instruction by responding to the received control instruction, and the voice broadcast text is used for reflecting the response of the air conditioner to the control instruction and broadcasting the action executed by the air conditioner, so that a user can know the running state of the air conditioner more intuitively through the voice broadcast text.
In addition, by determining the language type identifier included in the control command, the language that the user is accustomed to using or the language or language type set by the user, such as, but not limited to, mandarin chinese, dialect chinese, english, french, german, japanese, and korean, can be determined.
Finally, the voice parameters of the voice broadcast text are determined and broadcasted according to the voice type identification, so that the playing mode of the voice broadcast text can be enriched, for example, multiple timbres, tones or languages are set for one voice broadcast text, or the volume or timbre is set for the appointed voice broadcast text, and the voice broadcast text is displayed in multiple colors on an interface for displaying the voice broadcast text in a fire disaster.
In any one of the above technical solutions, preferably, in response to a received control instruction, generating a voice broadcast text according to the control instruction, specifically including: in response to a received control instruction, determining text content contained in the control instruction; determining a target operation parameter and/or a target operation mode of the air conditioner contained in the text content; and generating the voice broadcast text according to the target operation parameters and/or the target operation mode.
In the technical scheme, the text content contained in the received control instruction is determined in response to the received control instruction, so that the target operation parameter and/or the target operation mode of the air conditioner are determined, the voice broadcast text is generated according to the target operation parameter and/or the target operation mode, so that the action and the operation state of the air conditioner are visually prompted to a user, and the language type identifier contained in the control instruction is determined while the text content is generated by analyzing the control instruction, so that the identifiability of the voice broadcast text is improved.
In any of the above technical solutions, preferably, determining the text content included in the control instruction in response to the received control instruction specifically includes: locally analyzing the control instruction to determine the text content contained in the control instruction; and/or reporting the control instruction to a server, and receiving the text content fed back by the server.
In this embodiment, the control command is analyzed locally to determine the text content included in the control command, so as to improve the response rate of the voice recognition scheme, for example, extracting a keyword or a wakeup word in the control command can determine the operation requirement of the user on the air conditioner, such as "increase", "wind speed", "decrease", "temperature", "heating", "dehumidifying", and "cooling", but not limited thereto.
In addition, the control instruction can also be reported to a server, the text content fed back by the server is received, and the accuracy and reliability of identifying the control instruction can be improved based on a huge word database and a sentence segmentation algorithm of the server.
Finally, it is also possible to combine the local parsing and the server parsing, for example, to preferentially perform semantic recognition locally, and if the local recognition rate is lower than the preset recognition rate, or no keyword or wake word is extracted, send the result to the server for semantic recognition, where the preset recognition rate may be 30%, 50%, 70%, and so on, but is not limited thereto.
In any one of the above technical solutions, preferably, determining the voice parameter of the broadcast voice broadcast text according to the voice type identifier specifically includes: adding a voice identifier and a display identifier to the voice broadcast text locally according to the voice type identifier; and/or receiving the voice broadcast text fed back by the server, and the voice identifier and the display identifier added to the voice broadcast text by the server; and determining the voice parameters according to the voice identification and/or the display identification.
In the technical scheme, the voice identifier and the display identifier are added to the voice broadcast text locally according to the voice type identifier, and/or the voice broadcast text fed back by the server is received, and the voice identifier and the display identifier added to the voice broadcast text by the server can further enrich the broadcast form of the voice broadcast text, for example, a dialect is adopted for broadcasting the running state contained in the voice broadcast text, and a mandarin or another dialect is adopted for broadcasting the numerical value of the transportation parameter in the voice broadcast text.
In addition, the voice parameters are determined according to the voice identification and/or the display identification, and in fact, the voice parameters are equivalent to configuration files of voice broadcast texts, and can be played on an air conditioner body, or broadcast on sound equipment associated with the air conditioner, or broadcast on a terminal associated with the air conditioner.
In any one of the above technical solutions, preferably, determining the language type identifier included in the control instruction specifically includes: determining that the received control instruction is a voice instruction; and determining the language type identification corresponding to the voice instruction.
In the technical scheme, the corresponding voice type identifier is determined through the voice instruction, and the habitual language of the user sending the voice instruction is determined, so that the user can understand the content of the voice broadcast text more easily.
In any one of the above technical solutions, preferably, determining a language type identifier included in the control instruction further includes: determining that the received control instruction is a terminal communication instruction; and determining user identity information and/or user usage records corresponding to the terminal communication instruction, and determining the language type identifier according to the user identity information and/or the user usage records.
In the technical scheme, the language type identifier is determined according to the user identity information and/or the user usage record, and further, the user can easily understand the content of the voice broadcast text.
In any one of the above technical solutions, preferably, determining a language type identifier included in the control instruction further includes: determining that the received control instruction is a language setting instruction; and determining the language type identifier set in the language setting instruction.
In the technical scheme, the received control instruction is determined to be a language setting instruction, and the language type identifier set in the language setting instruction is determined, that is, the subsequent voice broadcast text is completed according to the voice type identifier set by the user, so that the user can understand the voice broadcast text conveniently, and the voice use experience of the user is improved.
As shown in fig. 2, the voice processing method of an air conditioner according to an embodiment of the present invention includes:
1. after the air conditioner is distributed with the network, a user can self-define the combination of broadcast sounds in the app device interface, wherein the combination comprises single Mandarin broadcast, single dialect (cantonese, Sichuan, Shandongdong and Shanghai) broadcast, and Mandarin + dialect broadcast.
2. And setting a corresponding self-defined broadcast combination by the AI cloud and the voice module terminal.
3. The voice module end recognizes the voice and text input of the user, converts the voice and text input into voice text and adds language identification.
4. And judging whether the local word list at the voice module end has corresponding text input and language identification corresponding to the text, if not, entering the step 5, and if so, entering the step 9.
5. And the voice module end uploads the language and the text of the input end to the AI cloud.
6. And synthesizing an broadcast text on line by the AI cloud.
7. And the AI cloud divides the broadcast text into sentences by punctuation marks.
8. The AI cloud will cut apart the statement and add broadcast pronunciation or colour sign to will report data transmission to voice module end, include the following embodiment at least:
①, language identifiers such as, but not limited to, cantonese, Sichuan, Shandong, and Shanghai.
②, color identification, different colors represent different languages, such as red for cantonese, yellow for Sichuan, blue for Shandong, green for Shanghai, and mandarin, but not limited thereto, and sends the broadcast data to the voice module end, and step 12, if the broadcast data is adjusted to +26 °, the Sichuan + mandarin is adopted for broadcast.
③, the broadcast can be in the form of control means (one dialect) + value (Mandarin or another dialect), for example, is good, the air conditioner wind speed is immediately adjusted to 98 percent, and the broadcast is in the form of Guangdong language + Mandarin.
9. And the voice module end locally broadcasts text synthesis.
10. And the voice module end divides the local broadcast text into sentences by punctuation marks.
11. The voice module end adds a broadcast language identifier or a color identifier to the segmented sentence to generate broadcast data, the language identifier comprises a cantonese language, a Sichuan language, a Shandong language, a Shanghai language and the like, but not limited to the above, different colors represent different languages, for example, red represents a cantonese language, yellow represents a Sichuan language, blue represents a Shandong language, green represents a Shanghai language, and the others are mandarins, but not limited to the above, for example, if the broadcast data is adjusted to +26 degrees, the Shanghai + mandarina form is adopted for broadcast. And generating broadcast data and entering the step 12.
12. And the voice module end carries out voice broadcast according to the broadcast data.
In summary, the technical solution of the present invention at least includes the following beneficial effects:
(1) when the dialect is identified, the user can freely combine and select to broadcast the control command with the dialect and broadcast the sound of other skills with the Mandarin, and an interesting broadcasting effect is realized.
(2) The user can customize the combination of broadcast tones.
As shown in fig. 3, a voice processing apparatus 300 of an air conditioner according to another embodiment of the present invention includes: a memory 302 and a processor 304, wherein the memory 302 is configured to store a computer program, and the computer program can realize the steps of the voice processing method of the air conditioner according to any one of the above technical solutions when being executed by the processor 304.
As shown in fig. 4, an air conditioner 400 according to another embodiment of the present invention includes: a memory 402 and a processor 404, wherein the memory 402 is configured to store a computer program, and the computer program can realize the steps of the voice processing method of the air conditioner according to any one of the above technical solutions when being executed by the processor 404.
As shown in fig. 5, according to the computer-readable storage medium 500 of the embodiment of the present invention, the computer-readable storage medium stores thereon a computer program, and when the computer program is executed by the air conditioner 400, the method for processing the voice of the air conditioner as defined in any one of the above technical solutions is implemented, and specifically includes the following steps: responding to a received control instruction, and generating a voice broadcast text according to the control instruction; determining a language type identifier contained in the control instruction; and determining voice parameters for broadcasting the voice broadcast text according to the voice type identifier.
In the technical scheme, the voice broadcast text is generated according to the control instruction by responding to the received control instruction, and the voice broadcast text is used for reflecting the response of the air conditioner to the control instruction and broadcasting the action executed by the air conditioner, so that a user can know the running state of the air conditioner more intuitively through the voice broadcast text.
In addition, by determining the language type identifier included in the control command, the language that the user is accustomed to using or the language or language type set by the user, such as, but not limited to, mandarin chinese, dialect chinese, english, french, german, japanese, and korean, can be determined.
Finally, the voice parameters of the voice broadcast text are determined and broadcasted according to the voice type identification, so that the playing mode of the voice broadcast text can be enriched, for example, multiple timbres, tones or languages are set for one voice broadcast text, or the volume or timbre is set for the appointed voice broadcast text, and the voice broadcast text is displayed in multiple colors on an interface for displaying the voice broadcast text in a fire disaster.
In any one of the above technical solutions, preferably, in response to a received control instruction, generating a voice broadcast text according to the control instruction, specifically including: in response to a received control instruction, determining text content contained in the control instruction; determining a target operation parameter and/or a target operation mode of the air conditioner contained in the text content; and generating the voice broadcast text according to the target operation parameters and/or the target operation mode.
In the technical scheme, the text content contained in the received control instruction is determined in response to the received control instruction, so that the target operation parameter and/or the target operation mode of the air conditioner are determined, the voice broadcast text is generated according to the target operation parameter and/or the target operation mode, so that the action and the operation state of the air conditioner are visually prompted to a user, and the language type identifier contained in the control instruction is determined while the text content is generated by analyzing the control instruction, so that the identifiability of the voice broadcast text is improved.
In any of the above technical solutions, preferably, determining the text content included in the control instruction in response to the received control instruction specifically includes: locally analyzing the control instruction to determine the text content contained in the control instruction; and/or reporting the control instruction to a server, and receiving the text content fed back by the server.
In this embodiment, the control command is analyzed locally to determine the text content included in the control command, so as to improve the response rate of the voice recognition scheme, for example, extracting a keyword or a wakeup word in the control command can determine the operation requirement of the user on the air conditioner, such as "increase", "wind speed", "decrease", "temperature", "heating", "dehumidifying", and "cooling", but not limited thereto.
In addition, the control instruction can also be reported to a server, the text content fed back by the server is received, and the accuracy and reliability of identifying the control instruction can be improved based on a huge word database and a sentence segmentation algorithm of the server.
Finally, it is also possible to combine the local parsing and the server parsing, for example, to preferentially perform semantic recognition locally, and if the local recognition rate is lower than the preset recognition rate, or no keyword or wake word is extracted, send the result to the server for semantic recognition, where the preset recognition rate may be 30%, 50%, 70%, and so on, but is not limited thereto.
In any one of the above technical solutions, preferably, determining the voice parameter of the broadcast voice broadcast text according to the voice type identifier specifically includes: adding a voice identifier and a display identifier to the voice broadcast text locally according to the voice type identifier; and/or receiving the voice broadcast text fed back by the server, and the voice identifier and the display identifier added to the voice broadcast text by the server; and determining the voice parameters according to the voice identification and/or the display identification.
In the technical scheme, the voice identifier and the display identifier are added to the voice broadcast text locally according to the voice type identifier, and/or the voice broadcast text fed back by the server is received, and the voice identifier and the display identifier added to the voice broadcast text by the server can further enrich the broadcast form of the voice broadcast text, for example, a dialect is adopted for broadcasting the running state contained in the voice broadcast text, and a mandarin or another dialect is adopted for broadcasting the numerical value of the transportation parameter in the voice broadcast text.
In addition, the voice parameters are determined according to the voice identification and/or the display identification, and in fact, the voice parameters are equivalent to configuration files of voice broadcast texts, and can be played on an air conditioner body, or broadcast on sound equipment associated with the air conditioner, or broadcast on a terminal associated with the air conditioner.
In any one of the above technical solutions, preferably, determining the language type identifier included in the control instruction specifically includes: determining that the received control instruction is a voice instruction; and determining the language type identification corresponding to the voice instruction.
In the technical scheme, the corresponding voice type identifier is determined through the voice instruction, and the habitual language of the user sending the voice instruction is determined, so that the user can understand the content of the voice broadcast text more easily.
In any one of the above technical solutions, preferably, determining a language type identifier included in the control instruction further includes: determining that the received control instruction is a terminal communication instruction; and determining user identity information and/or user usage records corresponding to the terminal communication instruction, and determining the language type identifier according to the user identity information and/or the user usage records.
In the technical scheme, the language type identifier is determined according to the user identity information and/or the user usage record, and further, the user can easily understand the content of the voice broadcast text.
In any one of the above technical solutions, preferably, determining a language type identifier included in the control instruction further includes: determining that the received control instruction is a language setting instruction; and determining the language type identifier set in the language setting instruction.
In the technical scheme, the received control instruction is determined to be a language setting instruction, and the language type identifier set in the language setting instruction is determined, that is, the subsequent voice broadcast text is completed according to the voice type identifier set by the user, so that the user can understand the voice broadcast text conveniently, and the voice use experience of the user is improved.
The technical scheme of the invention is described in detail above with reference to the accompanying drawings, and the invention provides a voice processing method and device of an air conditioner, the air conditioner and a computer readable storage medium.
In addition, by determining the language type identifier included in the control command, the language that the user is accustomed to using or the language or language type set by the user, such as, but not limited to, mandarin chinese, dialect chinese, english, french, german, japanese, and korean, can be determined.
Finally, the voice parameters of the voice broadcast text are determined and broadcasted according to the voice type identification, so that the playing mode of the voice broadcast text can be enriched, for example, multiple timbres, tones or languages are set for one voice broadcast text, or the volume or timbre is set for the appointed voice broadcast text, and the voice broadcast text is displayed in multiple colors on an interface for displaying the voice broadcast text in a fire disaster.
The steps in the method of the invention can be sequentially adjusted, combined and deleted according to actual needs.
The units in the device of the invention can be merged, divided and deleted according to actual needs.
It will be understood by those skilled in the art that all or part of the steps in the methods of the embodiments described above may be implemented by instructions associated with a program, which may be stored in a computer-readable storage medium, where the storage medium includes Read-Only Memory (ROM), Random Access Memory (RAM), Programmable Read-Only Memory (PROM), Erasable Programmable Read-Only Memory (EPROM), One-time Programmable Read-Only Memory (OTPROM), Electrically Erasable Programmable Read-Only Memory (EEPROM), compact disc-Read-Only Memory (CD-ROM), or other Memory, magnetic disk, magnetic tape, or magnetic tape, Or any other medium which can be used to carry or store data and which can be read by a computer.
The above is only a preferred embodiment of the present invention, and is not intended to limit the present invention, and various modifications and changes will occur to those skilled in the art. Any modification, equivalent replacement, or improvement made within the spirit and principle of the present invention should be included in the protection scope of the present invention.