Printing control system and methodTechnical Field
The invention relates to the technical field of printing and imaging, in particular to a printing control system and a method for identifying and printing out voice information.
Background
As one of conventional electronic apparatuses, image forming apparatuses (hereinafter, referred to as image forming apparatuses) such as printers, multifunction machines having a printing function, and the like have been widely used in daily life. However, with the diversification of information development in the modern society, the conventional image forming apparatus gradually shows its functional limitation. For example, conventional image forming apparatuses can print out only conventional information carriers such as documents and photographs, but cannot print out voice information that is increasingly used in daily communication. The voice information refers to information transmitted in a voice format file between electronic devices such as smartphones, and chat information such as WeChat voice, QQ voice, and the like transmitted between different smartphones.
As a result, printers capable of recognizing voice information and printing out text data in the voice information have appeared on the market, which are provided with a voice information recognition program in a printer processor, recognize and convert the voice information into text data by the voice information recognition program, and then print out the text data.
However, this printer recognizes voice information only by a voice information recognition program in a printer processor, and its voice information recognition capability is quite limited, and many voice information cannot be recognized; secondly, when the voice information is recognized and converted into text data, the user cannot find and correct errors in the voice information recognition process in time, the errors can be found only after the text data is printed and output, and at the moment, the user can only input the voice information again and print the voice information again, which brings great inconvenience to the user.
Therefore, there is a need for a print control system and method that can correct errors occurring during voice recognition.
Disclosure of Invention
One of the purposes of the invention is to solve the technical defect that the existing printing system can not correct errors in the voice recognition process in time.
An embodiment of the present invention first provides a print control system including:
a voice information receiving unit for receiving voice information;
the voice information identification unit is used for identifying the received voice information and converting the voice information into text data;
the text data processing unit is used for identifying errors in the text data and correcting the content of the errors;
a format conversion unit for converting the corrected text data into printable data that can be recognized and printed by the image forming apparatus;
a printing unit for printing the print data on a recording medium.
In one embodiment, the voice information receiving unit comprises a voice acquisition module arranged in the mobile terminal, a voice auxiliary transmission system and a voice receiving module arranged in the imaging device; wherein,
the voice acquisition module acquires voice information and transmits the voice information to the voice receiving module through the voice auxiliary transmission system.
In one embodiment, the voice information receiving unit comprises a voice input module arranged on the operation terminal, and the voice input module receives voice information and forwards the voice information to an imaging device in data communication with the operation terminal; or
The voice information receiving unit comprises a voice input module arranged on the imaging device and used for receiving voice information.
In one embodiment, the text data processing unit is arranged on the server and used for automatically identifying errors in the text data based on a corpus data model in the server and automatically correcting the content of the errors; or
The text data processing unit is arranged on the mobile terminal or the operation terminal or the imaging device and is used for displaying the text data on the display screen, receiving a modification instruction of a user on the text data and correcting wrong contents, or receiving a deletion instruction of the user on the wrong text data and receiving updated text data input by the user; or
The text data processing unit comprises an automatic correction subunit and a manual correction subunit, wherein the automatic correction subunit is arranged on the server, the manual correction subunit is arranged on the mobile terminal or the operating terminal or the imaging equipment, and the automatic correction subunit and the manual correction subunit simultaneously identify and correct errors in the text data.
In one embodiment, the print control system further comprises:
a text data translation unit for translating the text data generated by the voice information recognition unit into text data of another language.
In one embodiment, the print control system further comprises:
and a formatting instruction determining unit for determining a formatting instruction from the corrected text data according to a preset formatting instruction list so as to enable the image forming apparatus to execute the formatting instruction.
The embodiment of the invention also provides a printing control method, which comprises the following steps:
receiving voice information;
recognizing the received voice information and converting the voice information into text data;
identifying errors in the text data and correcting the content of the errors;
converting the corrected text data into printable data which can be identified and printed by the imaging equipment;
the print data is printed on a recording medium.
In one embodiment, the step of receiving the voice information comprises:
acquiring voice information by using a voice acquisition module arranged in the mobile terminal, and transmitting the voice information to a voice receiving module arranged in the imaging equipment through a voice auxiliary transmission system; or,
receiving voice information by using a voice input module arranged on an operation terminal and forwarding the voice information to an imaging device for data communication with the operation terminal; or
And receiving voice information by using a voice input module arranged on the imaging device.
In one embodiment, the step of identifying errors in the text data and correcting the contents of the errors comprises:
automatically identifying errors in the text data by adopting a text data processing unit arranged on a server based on a corpus data model, and automatically correcting the content of the errors; or
Displaying the text data on a display screen by adopting a text data processing unit arranged on the mobile terminal or the operation terminal or the imaging equipment, receiving a modification instruction of a user on the text data and correcting wrong contents, or receiving a deletion instruction of the user on the wrong text data and receiving updated text data input by the user; or
And simultaneously identifying and correcting errors in the text data by adopting an automatic correction subunit arranged on the server and a manual correction subunit arranged on a mobile terminal, an operation terminal or an imaging device.
In one embodiment, the step of automatically identifying the error in the text data and automatically correcting the content of the error specifically includes:
dividing the voice information into a plurality of voice information sections;
comparing the voice information segment with the corpus data model base, and searching a corpus data model corresponding to the voice information segment;
acquiring text data segments obtained by converting each voice information segment;
comparing the text data segment with the corpus data model base, and searching a corpus data model corresponding to the text data segment;
and if the corpus data model corresponding to the voice information segment is different from the corpus data model corresponding to the text data segment, determining that an error exists in the text data segment obtained by converting the voice information segment, and replacing the text data segment obtained by converting the voice information segment with the text data in the corpus data model corresponding to the voice information segment.
In one embodiment, after the step of converting to text data, the method further comprises:
the text data generated by the speech information recognition unit is translated into text data of another language.
In one embodiment, the step of correcting the error content further comprises:
and determining a formatting command from the text data subjected to correction according to a preset formatting command list so as to enable the imaging equipment to execute the formatting command.
The printing control system of the embodiment of the invention performs voice information recognition at the server end and has strong voice information recognition capability. Because the error recognition and correction are carried out on the text data after the voice information is converted into the text data, the error can be corrected in time when the voice information recognition is wrong, and great convenience is brought to the user.
The printing control system of the embodiment of the invention can also reserve the basic structure of the existing imaging device, further improve the function of the operation terminal to realize the correction of the errors of voice recognition, and can reduce the manufacturing cost of the control system.
The printing control system of the embodiment of the invention can also utilize the mobile terminal to collect voice information, and the voice information sent by the user at a position far away from the imaging equipment can also be printed by the imaging equipment. In practical application, different application software is installed on the mobile terminal and/or the imaging device, and remote printing of voice information can be achieved flexibly by using different communication modes. Moreover, the mobile terminal can also correct the text data obtained by converting the voice information.
The printing control system of the embodiment of the invention also translates the voice information of the user into different languages, and the user can control the imaging equipment to operate by using the voice.
Additional features and advantages of the invention will be set forth in the description which follows, and in part will be obvious from the description, or may be learned by practice of the invention. The objectives and other advantages of the invention will be realized and attained by the structure particularly pointed out in the written description and claims hereof as well as the appended drawings.
Drawings
The accompanying drawings, which are included to provide a further understanding of the invention and are incorporated in and constitute a part of this specification, illustrate embodiments of the invention and together with the description serve to explain the principles of the invention and not to limit the invention. In the drawings:
fig. 1 is a schematic structural diagram of a printing control system according to a first embodiment of the present invention;
FIG. 2 is a schematic structural diagram of a print control system according to a second embodiment of the present invention;
FIG. 3a is a schematic structural diagram of a print control system according to a third embodiment of the present invention;
FIG. 3b is a schematic diagram of another embodiment of a print control system according to a third embodiment of the present invention;
FIG. 4a is a schematic structural diagram of a printing control system according to a fourth embodiment of the present invention;
FIG. 4b is a schematic diagram of another embodiment of a printing control system according to the fourth embodiment of the present invention;
FIG. 4c is a schematic diagram of another configuration of a print control system according to a fourth embodiment of the present invention;
FIG. 5 is a flowchart illustrating steps of a print control method according to a fifth embodiment of the present invention;
fig. 6 is a flowchart of steps of a print control method according to a sixth embodiment of the present invention.
Detailed Description
In order to make the objects, technical solutions and advantages of the present invention more apparent, the present invention is further described in detail below with reference to the accompanying drawings.
The embodiments of the present invention will be described in conjunction with the drawings of the specification, it being understood that the preferred embodiments described herein are merely for purposes of illustration and explanation, and are not intended to limit the invention. And features in embodiments of the invention may be combined with each other without conflict.
The embodiment of the invention provides a printing control system and a control method, which can accurately identify all voice information in daily communication, and can correct in time when errors occur in voice information identification, so that the inconvenience that a user inputs voice information again and prints the voice information again is avoided.
Example one
Fig. 1 is a schematic configuration diagram of the print control system of the present embodiment. The system mainly includes a voice information receiving unit 10, a voice information recognizing unit 11, a text data processing unit 12, a format converting unit 13, and a printing unit 14.
The voice information receiving unit 10 is used for receiving voice information. As shown in fig. 1, the voice information receiving unit 10 may include a voice input module 115 provided on the imaging device to allow a user to input voice information when operating the imaging device in use.
The voice information recognition unit 11 is used to recognize the received voice information and convert it into text data. In a preferred embodiment, the voice information recognition unit 11 is provided on a voice processing server that performs data communication with the imaging apparatus.
The text data processing unit 12 is used for identifying errors in the text data and correcting the content of the errors. As shown in fig. 1, the text data processing unit 12 may be provided on a voice processing server. In a preferred example, a corpus data model library is further integrated on the voice processing server, and the corpus data models in the corpus data model library comprise voice information segments with certain lengths and text data segments with the same meaning as the voice information segments. The text data processing unit 12 automatically recognizes an error in the text data based on the corpus data model, and automatically corrects the content of the error.
The following describes a process of automatically recognizing an error in text data and automatically correcting the content of the error. The text data processing unit 12 first divides the speech information received by the speech information receiving unit 10 into a plurality of speech information segments according to the length of the corpus data model, and then sequentially processes each speech information segment.
Taking the speech information segment a as an example, in the result that the speech information recognition unit 11 recognizes and converts the received speech information into text data, the text data corresponding to the speech information segment a is a text data segment b, wherein a conversion error may exist in the text data segment b, and the text data processing unit 12 corrects the conversion error in the text data segment b.
The text data processing unit 12 compares the speech information segment a with data in the corpus data model base to find the corpus data model a matching the current speech information segment a. And then, comparing the text data segment B with data in the corpus data model base to search the corpus data model B matched with the text data segment B.
Comparing the corpus data model a with the corpus data model B, and when the two are the same, determining that there is no recognition error in the text data segment B corresponding to the speech information segment a in the result of recognition and conversion by the speech information recognition unit 11, and keeping the text data segment B. Otherwise, judging that the text data segment b has a recognition error, and replacing the text data segment b with the text data in the corpus data model A, thereby correcting the text data segment b into the text data in the corpus data model A.
However, because the corpus data model base has limitations, it needs to be updated and refined continuously to adapt to the rapidly updated idioms. For example, the popular words of network at present, such as "eyebrow beauty", " silk", "decay", etc., are popular terms commonly used by people. If these corpus data models are not stored in the corpus data model library, it is impossible to accurately recognize and correct the contents of errors in the text data converted by the speech information recognition unit 11.
In another manner, the text data processing unit 12 may also be provided on the imaging device for displaying text data on a display screen of the imaging device. And the user checks whether the text data has errors, and if so, receives a modification instruction of the user on the text data and corrects the error content. Or receiving a deleting instruction sent by the user, deleting the text data with errors, and receiving updated text data input by the user. The text data input after the user corrects the wrong content or deletes the wrong text data can be simultaneously returned to the corpus data model base on the voice processing server for storage so as to supplement and update the corpus data models in the corpus data model base, so that the influence of the limitation of the corpus data model base on the accuracy of voice information recognition can be gradually reduced, and the times of manual correction of the user in the text data after the voice information recognition and conversion are reduced.
In still another manner, the text data processing unit 12 may include an automatic correction subunit 121 (not shown in the drawings) and a manual correction subunit 122 (not shown in the drawings). The automatic correction subunit 121 is disposed on the voice server, and is configured to automatically recognize an error in the text data based on a corpus data model in the server, and automatically correct the content of the error. The manual correction subunit 122 is disposed on the mobile terminal, the operation terminal, or the imaging device, and is configured to display the text data after the automatic correction is completed on a display screen of the mobile terminal, the operation terminal, or the imaging device, receive a modification instruction of the text data by the user and correct an error still existing, or receive deletion of the error text data by the user and receive updated text data input by the user.
Obviously, such an arrangement can improve the accuracy of speech recognition. Moreover, the user can also supplement and update the corpus data model in the corpus data model library of the server by manually correcting the text data, and the accuracy can be improved when the text data processing unit 12 arranged in the server automatically identifies errors based on the corpus data model next time.
The format conversion unit 13 is configured to convert the text data subjected to the correction into printable data that can be recognized and printed by the image forming apparatus, and the printing unit 14 is configured to print the print data on a recording medium, thereby printing out the text data subjected to the correction.
Only a case where both the format conversion unit 13 and the printing unit 14 are provided inside the image forming apparatus is shown in the example of fig. 1. Without being limited thereto, the format conversion unit 13 may also be provided on the voice processing server.
In the example of fig. 1, only a case where the voice information processing server is provided separately from the imaging apparatus is shown, and without being limited thereto, the voice processing server may also be a server built in the imaging apparatus.
The technical scheme of voice information recognition at the server side is adopted in the embodiment, so that the printing control system has strong voice information recognition capability; furthermore, after the voice information is converted into the text data, the technical scheme of carrying out error recognition and correction on the text data is adopted, so that the error can be corrected in time when the voice information is recognized in error, and great convenience is brought to the user.
Example two
Fig. 2 is a schematic structural diagram of the print control system of the present embodiment. Unlike the first embodiment, the image forming apparatus is connected to an external operation terminal for controlling the image forming operation of the image forming apparatus.
The voice information receiving unit 10 includes a voice input module 116 provided on the external operation terminal. When the user uses the operation terminal, voice is input to the voice input module, and the voice input module receives voice information and forwards the voice information to the voice processing server. The external operation terminal can also store the voice information to be printed which is recorded in advance, and the voice information to be printed is forwarded to the voice processing server.
The voice information recognition unit 11 is provided in the voice processing server, and recognizes and converts the received voice information into text data.
The text data processing unit 12 is provided on the operation terminal, and displays the text data generated by conversion by the voice information recognition unit 11 on the display screen of the operation terminal. The user can check the text data and modify the text data by using the operating terminal to complete the correction of the wrong content or update the wrong content into new text data.
In the example of fig. 2, a format conversion unit 13 is also provided on the operation terminal, and converts the text data for which the correction is completed into printable data that can be recognized and printed by the image forming apparatus. The printing unit 14 is provided in the image forming apparatus, and prints print data on a recording medium, thereby printing out text data for which correction has been completed.
Thus, only the printing unit 14 is integrally provided in the image forming apparatus, and the voice information receiving unit 10, the text information processing unit 12, and the format conversion unit 13 are all provided on an operation terminal that performs data communication with the image forming apparatus. Thus, the imaging device can be compatible with the existing imaging device, and only the operation terminal is configured. The printing control system of the embodiment can keep the basic structure of the existing imaging device, further improve the function of the operation terminal to realize the correction of the errors of voice recognition, and can reduce the manufacturing cost of the control system.
Further, since manufacturers and standards of image forming apparatuses are different, formats of print data required for the apparatuses are different. The format conversion unit 13 may also be integrally provided inside the image forming apparatus, and the interface between the operation terminal and the image forming apparatus performs only the transmission of the standard text data, and the format conversion of the text data is performed by the built-in format conversion unit 13.
EXAMPLE III
Fig. 3a and 3b are schematic structural views of the print control system of the present embodiment. Different from the two embodiments, the embodiment collects voice information by using the mobile terminal, and the mobile terminal communicates with the imaging device through different network connection modes.
In a first mode
In the example of fig. 3a, a voice capture module 117 is provided within the mobile terminal, a voice receiving module 119 is provided within the imaging device, and a voice assisted transmission system 118 is provided between the mobile terminal and the imaging device. The voice information receiving unit 10 includes a voice collecting module 117, a voice auxiliary transmission system 118 and a voice receiving module 119. The voice collecting module 117 collects voice information and transmits the voice information to the voice receiving module 119 through the voice auxiliary transmission system 118.
Preferably, an instant messenger can be used to implement the voice information transmission from the mobile terminal to the imaging device. These instant messaging tools include, but are not limited to WeChat, QQ of Tencent, MSN, Skype of Microsoft, ICQ of AOL, Wangwang of Taobao, Paoyi, Michata of Setaria, Feixin of China Mobile, etc. Hereinafter, details will be described by taking the WeChat as an example.
Firstly, the mobile terminal and the imaging device are both provided with WeChat clients and log in respectively. Installing a WeChat client on a mobile terminal and logging in is a well-known operation, and the process of logging in WeChat on an imaging device side is described below and can be implemented by at least one of the following two methods.
One is that a wechat account is already built in when the imaging device leaves the factory, or an interface is provided so that a user can configure the wechat account of the imaging device. Of course, at this time, the wechat client needs to be installed inside the imaging device. When logging in the WeChat account, the user is required to input a corresponding 'Webpage' and a 'WeChat login password', and the 'Webpage' and the 'WeChat login password' can be input through a Webpage page embedded in the imaging equipment and/or an imaging equipment operation panel.
The other is logging in through a WeChat webpage edition interface. A browser is arranged in the imaging equipment, a user starts a webpage version wechat client through a user interface of the browser, then the webpage version wechat client can give a two-dimensional code required by login (the two-dimensional code can be displayed on the user interface and can also be directly printed out), the user scans the two-dimensional code through a mobile terminal and confirms, and the imaging equipment can log in the webpage version wechat.
Therefore, the mobile terminal and the imaging device are connected to the WeChat system, and the WeChat server can be used for providing a voice auxiliary transmission system.
For example, a friend relationship is established between the wechat account of the mobile terminal and the wechat account of the imaging device, and after login is completed, the user can send voice information to the imaging device end by using the wechat of the mobile terminal.
As another example, the wechat community platform is formed by a wechat server and an imaging device vendor server (which may be one or more servers). And sending voice information to the imaging equipment by the WeChat account of the mobile terminal through the WeChat public number platform. The specific implementation mode is as follows:
the user logs in the wechat account on the mobile terminal and pays attention to the wechat public number of the imaging equipment manufacturer. The mobile terminal can be used for scanning a two-dimensional code provided by an imaging equipment manufacturer (the two-dimensional code can be provided on imaging equipment, a user manual, a publicity manual or in other reasonable modes), or directly searching a public number name of the imaging equipment manufacturer on an 'add friend' interface of a WeChat client, or paying attention to the public number of the imaging equipment manufacturer through a public number name card of the imaging equipment manufacturer shared by others. Since the WeChat public concern is a well-known operation, it is not described in detail herein, and the way of paying attention to the public concern in the embodiment of the present invention is not limited to the above.
After paying attention to the public number of the manufacturer of the imaging equipment, a user needs to bind the imaging equipment to a WeChat account of the user, wherein the binding refers to establishing a corresponding relation between the WeChat account and the imaging equipment. In the above public number step of the manufacturer of the imaging device, the two-dimensional code provided by the manufacturer on the imaging device may contain specific information such as an imaging device code (the imaging device code is a unique identifier that is distinguished from other devices). When a user pays attention to the public number by scanning the two-dimensional code on the imaging equipment, specific information such as the coding of the imaging equipment and the like can be simultaneously associated with a user WeChat account, and the binding of the imaging equipment is realized; if a user scans a two-dimensional code provided by a manufacturer at other places to pay attention to a public number or pays attention to the public number through other ways, the user is required to manually input an imaging device code through a menu command mode at a WeChat public number interface of an imaging device manufacturer to bind the imaging device. After binding the imaging devices, the user may view a list of bound imaging devices and their status information (including but not limited to device online/offline, standby/busy, normal/fault, etc.).
It should be noted that one wechat account can bind to multiple imaging devices, and one imaging device can also establish a binding relationship with multiple wechat accounts.
After the imaging equipment is bound, a user can send voice information to the imaging equipment through the WeChat public number platform. The user inputs voice information on a public number interface of the mobile terminal and sends the voice information to the micro-message public number server. Before or after the above-described transmission step, the user needs to select an imaging apparatus to be used, i.e., a target imaging apparatus. The WeChat public signal server sends the received voice information to a manufacturer server; the vendor server transmits the received voice information to the target imaging device via a communication link with the imaging device.
Mode two
In the example of fig. 3b, a voice capture module is provided within the mobile terminal, a voice receiving module is provided within the imaging device, and a communication connection is established between the mobile terminal and the imaging device. For example, a cloud print app installed on a mobile terminal is an application program that performs a remote print job through a cloud print server. A wireless link is established between the mobile terminal and the imaging device through WIFI, a user logs in the cloud printing app through a pre-registered account, then the voice information is collected by the voice collection module on the mobile terminal, and the voice information is transmitted to the voice receiving module through the WIFI wireless link.
Note that, in fig. 3a and 3b, the voice information recognition unit 11 is provided on a voice processing server in data communication with the imaging apparatus, and is configured to recognize and convert received voice information into text data; the text data processing unit 12 is disposed in the mobile terminal, and is used for recognizing errors in the text data and correcting the content of the errors. A format conversion unit 13 is also provided on the mobile terminal, and converts the text data subjected to correction into printable data that can be recognized and printed by the image forming apparatus. The printing unit 14 is provided in the image forming apparatus, and prints print data on a recording medium, thereby printing out voice information for which correction has been completed. For example, after a piece of voice information is entered through a cloud print app on the mobile terminal, the voice information is sent to a voice information receiving module of the imaging device via a wireless network (e.g., WiFi). The imaging device sends the voice information to a voice information recognition unit 11 at the server side for recognition and conversion into text data, then the server sends the text data back to the imaging device, the imaging device forwards the text data to the mobile terminal, and the content of the text data is displayed in an app interface of the mobile terminal. After confirming that the text data content is wrong, the user can select the wrong content by pressing the screen for a long time, input the corrected content and then send the corrected content to the imaging device in the form of text data.
In the embodiment, the voice information input by the user is collected by using the mobile terminal, and the voice information sent by the user at a position far away from the imaging device can also be printed by the imaging device. In practical application, different application software is installed on the mobile terminal and/or the imaging device, and remote printing of voice information can be achieved flexibly by using different communication modes. Moreover, the mobile terminal can also correct the text data obtained by converting the voice information.
Not limited to the structural forms of fig. 3a and 3b, the text data processing unit 12 may be provided on a voice processing server, and the format conversion unit 13 may be provided in an image forming apparatus.
The text data processing unit 12 is provided on the voice processing server, and the format conversion unit 13 may be provided on the voice processing server, without being limited to the structural forms of fig. 3a and 3 b.
Example four
Fig. 4a to 4c are schematic structural diagrams of the print control system of the present embodiment. The print control system provided in this embodiment mainly includes a voice information receiving unit 10, a voice information recognition unit 11, a text data processing unit 12, a format conversion unit 13, and a printing unit 14, and unlike the first embodiment, further includes a text data translation unit 15 and a formatting instruction determination unit 16.
The functions and the setting manners of the voice information receiving unit 10, the voice information identifying unit 11, the text data processing unit 12, the format converting unit 13, and the printing unit 14 are the same as those in the first embodiment, and are not described again.
The control system shown in fig. 4a may translate the voice information in one language input by the user into another language and print out.
In fig. 4a, a text data translation unit 15 is provided on the speech processing server for translating the text data converted by the speech information recognition unit 11 into text data of another speech. For example, english text data converted after recognition of english voice information is translated into chinese text data. Specifically, the text data translation unit 15 compares the text data converted by the speech information recognition unit 11 with data in a database, and converts the text data into text data of another language through a series of calculations.
The text data processing unit 12 is provided on the voice processing server, and is configured to recognize an error in the translated text data and correct the content of the error.
Without being limited to the configuration shown in fig. 4a, the text data processing unit 12 may perform error correction on the text data converted by the speech information recognition unit 11, and then the text data translation unit 15 may translate the text data after the error correction.
The text data translation unit 15 may also be provided in the image forming apparatus without being limited to the structural form shown in fig. 4 a.
In fig. 4a, a formatting instruction determining unit 16 is disposed in the image forming apparatus, and is configured to determine whether the received text data is a formatting instruction according to a preset formatting instruction list, and if so, the image forming apparatus may execute the formatting instruction determined by the formatting instruction determining unit 16. The formatting instruction is a command for performing specific operation control on the image forming apparatus, for example, "set paper" indicates setting of a default paper type, paper size, and the like of the image forming apparatus, and "save printing" indicates performing a print job in the save mode. Therefore, the voice control instruction of the user can be converted into the text data control instruction and then sent to the imaging device, so that the user can directly control the imaging device to execute corresponding operation by using the voice information, and great convenience and good experience are brought to the user.
Fig. 4b is another schematic structural diagram of the print control system of the present embodiment. Similarly to the second embodiment, the image forming apparatus performs data communication with an external operation terminal for controlling the image forming operation of the image forming apparatus. The voice information receiving unit 10, the text data processing unit 12, the format converting unit 13, the text data translating unit 15, and the formatting instruction determining unit 16 are all provided on the operation terminal.
The text data translation unit 15 translates the text data converted by the voice information recognition unit 11 into text data of another voice. The text data processing unit 12 is configured to recognize an error in the translated text data and correct the content of the error.
Alternatively, without being limited to the configuration shown in fig. 4b, the text data processing unit 12 may perform error correction on the text data converted by the speech information recognition unit 11, and then the text data translation unit 15 may translate the text data after the error correction.
Fig. 4c is another schematic structural diagram of the print control system of this embodiment, which mainly performs print output for voice information between users communicating with different languages. Similar to the third embodiment, the voice information is collected by using the mobile terminal, and the mobile terminal communicates with the imaging device through different network connection modes. The text data processing unit 12, the format conversion unit 13, the text data translation unit 15, and the formatting instruction determination unit 16 are all provided on the mobile terminal.
For example, when a user uses instant messaging software to communicate in the english language, the english voice message can be printed remotely to output chinese text. The user can also utilize the mobile communication terminal to send English voice control instructions to remotely control the imaging equipment to execute corresponding operations.
EXAMPLE five
Fig. 5 is a flowchart of steps of a printing control method according to the present embodiment.
Voice information is received in step S501. With different structures as described above, the following methods may be included, but not limited to.
Receiving voice information with a voice information receiving unit 10 provided at the image forming apparatus; receiving voice information with a voice information receiving unit 10 provided on an external operation terminal that performs data communication with the image forming apparatus; or receiving voice information using a mobile terminal in data communication with the image forming apparatus.
In step S502, the received voice information is recognized and converted into text data. After the voice information receiving unit 10 receives the voice information, the voice information is forwarded to the voice processing server. A voice information recognition unit 11 provided on the voice processing server parses and compares the voice information with data in a database to recognize the voice information, and converts the voice information into text data through a series of calculations.
The server may be a web server located on a network, and may be a server built in the image forming apparatus without being limited thereto.
In step S503, an error in the text data is recognized, and the content of the error is corrected. In this step, different implementation methods are provided based on different structural forms.
After the voice information is converted into text data by the voice information recognition unit 11 arranged on the voice processing server, the text data is sent to the text data processing unit 12 located in the imaging device, the operation terminal or the mobile terminal, the text data content is displayed on the webpage of the imaging device, the operation panel of the imaging device, the application program interface of the operation terminal or the app interface of the mobile terminal, and the user can correct the content of the recognition error in the text data or input the voice information again.
After the voice information is converted into text data by the voice information recognition unit 11 provided in the voice processing server, the text data may be sent to the text data processing unit 12 located in the voice processing server, and the server may perform automatic correction by analysis and calculation.
The following describes a process of performing error recognition and correction on text data by way of example, but the method of the present embodiment is not limited thereto.
For example, after a piece of voice information is input through a mobile printing app on a mobile terminal, the voice information is sent to an imaging device through a wireless network (for example, WiFi), and the imaging device forwards the voice information to a voice information recognition unit at a voice processing server side for recognition and converts the voice information into text data. And then the voice processing server sends the text data back to the imaging device, the imaging device sends the text data back to the mobile terminal, and the content of the text data is displayed on an app interface of the mobile terminal. After confirming that the text data content is wrong, the user can select the wrong content by pressing the screen for a long time, input the corrected content and then send the corrected content to the imaging device in the form of text data.
The text data whose correction is completed is converted into printable data that can be recognized and printed by the image forming apparatus in step S504. After the text data processing unit 12 on the speech processing server corrects the text data, the corrected text data is sent to the format conversion unit 13 on the server side or the imaging device side. The format conversion unit 13 converts the text data into printable data that can be recognized and printed by a print output unit of the image forming apparatus.
Further, when the voice information received by the voice information receiving unit in step S501 is from a mobile terminal or an operation terminal, the format converting unit 13 may also be provided in the mobile terminal or the operation terminal. At this time, after the text data processing unit 12 on the voice processing server corrects the text data, the corrected text data is transmitted to the format conversion unit 13 of the mobile terminal or the operation terminal for format conversion.
The print data is printout in step S505. The printing unit 14 of the image forming apparatus prints the format-converted text data on a recording medium including, but not limited to, paper, film, and the like, to be output.
In the print control method in this embodiment, a technical scheme of performing voice information recognition at the server side is adopted, so that the print control system has a strong voice information recognition capability. Furthermore, after the voice information is converted into the text data, the technical scheme of carrying out error recognition and correction on the text data is adopted, so that the error can be corrected in time when the voice information is recognized in error, and great convenience is brought to the user.
EXAMPLE six
Fig. 6 is a flowchart of steps of a printing control method according to the present embodiment.
The voice information is received in step S601, and the received voice information is recognized and converted into text data in step S602. Specifically, the method described in steps S501 and S502 in the fifth embodiment may be adopted, and details are not described again.
The text data translation unit 15 translates the text data generated by the voice information recognition unit into text data of another language in step S603. In step S604, an error in the text data is recognized, and the content of the error is corrected.
In the example of fig. 6, the steps are performed in the order of step S603 and step S604, but of course, the steps may be performed in the order of step S604 and step S603.
In step S605, it is determined whether the text data is a formatting instruction. The formatting instruction determining unit 16 compares the text data with a preset formatting instruction list to determine whether the text data is a formatting instruction. If the format command is determined to be the format command, step S606 is executed, and the processor of the imaging device executes the operation corresponding to the format command. If it is determined that the format command is not the format command, step S607 is executed to convert the text data into printable data that can be recognized and printed by the image forming apparatus.
The print data is printout in step S608.
Steps S607 and S608 are substantially the same as steps S504 and S505 in the fifth embodiment, and will not be described again.
By applying the print control method in this embodiment, since the text data is further translated into text data of another language after the voice information is converted into the text data, voice information between users who communicate using different languages can be printed out; furthermore, whether the text data is a formatting command or not can be judged, and the voice information control command of the user can be converted into the text data control command and then sent to the imaging device, so that the user can directly control the imaging device to execute corresponding operation by using the voice information, and great convenience and good experience are brought to the user.
Although the embodiments of the present invention have been described above, the above description is only for the convenience of understanding the present invention, and is not intended to limit the present invention. It will be understood by those skilled in the art that various changes in form and details may be made therein without departing from the spirit and scope of the invention as defined by the appended claims.