JP6050171B2

Movatterモバイル変換

Info

Publication number: JP6050171B2
Application number: JP2013067907A
Authority: JP
Inventors: 英司高田; 淳内村
Original assignee: NEC Corp
Current assignee: NEC Corp
Priority date: 2013-03-28
Filing date: 2013-03-28
Publication date: 2016-12-21
Anticipated expiration: 2033-03-28
Also published as: JP2014191246A

Description

本発明は、認識処理性能を向上させる認識処理制御装置、認識処理制御方法および認識処理制御プログラムに関する。 The present invention relates to a recognition processing control device, a recognition processing control method, and a recognition processing control program for improving recognition processing performance.

近年、情報通信インフラの発展にともない、スマートフォンやタブレット端末などの情報端末の普及が進んでいる。さらに、これら情報端末を利用したクラウドサービスが始まっている。例えば、音声認識を用いたＷｅｂ検索システムや会話型応答システムがクラウドサービスとして利用可能となってきた。 In recent years, with the development of information communication infrastructure, information terminals such as smartphones and tablet terminals have been popularized. In addition, cloud services using these information terminals have begun. For example, Web search systems and conversational response systems using voice recognition have become available as cloud services.

特許文献１には、音声認識を使用した機械翻訳の技術が開示されている。特許文献１の技術は、原言語の用例に基づく音声認識結果と、原言語の文字列として音声認識結果の類似度を算定する。特許文献１の技術は、算定した類似度が所定の閾値を超えている場合に、用例に基づく音声認識結果を用いて翻訳を行うものである。 Patent Document 1 discloses a machine translation technique using speech recognition. The technique ofPatent Document 1 calculates the similarity between a speech recognition result based on a source language example and a speech recognition result as a source language character string. The technique ofPatent Document 1 performs translation using a speech recognition result based on an example when the calculated similarity exceeds a predetermined threshold.

特許文献２には、特定の文法を定義し、定義された文法に適合する音声認識を行う技術が開示されている。「文法」とは、特許文献２では、単語または文の集合との意味である。特許文献２の技術は、音声認識対象として限定された単語または文の集合を与える特定の文法を定義する。その際、その文法は、音声認識対象となる単語または文を効率よく追加、削除できる記述方法で定義される。 Patent Document 2 discloses a technique for defining a specific grammar and performing speech recognition conforming to the defined grammar. The term “grammar” means a word or a set of sentences inPatent Document 2. The technique ofPatent Document 2 defines a specific grammar that gives a set of words or sentences limited as speech recognition targets. At that time, the grammar is defined by a description method that can efficiently add and delete words or sentences to be speech-recognized.

特開２００８−０７６８６５号公報（第６〜１３頁、第１図）JP 2008-0776865 A (pages 6 to 13, FIG. 1)特開２００９−２２９５２９号公報（第６〜１４頁、第１図）JP 2009-229529 A (pages 6 to 14, FIG. 1)

特許文献１、２の技術は、特定の用例や特定の文法に基づいて音声認識を行う技術である。すなわち、特許文献１、２の技術では、特定の用例や特定の文法を用いて音声認識を行うため、音声認識を用いるサービスは、特定の用例や特定の文法に基づいた特定の分野やサービスに限定される。また、特許文献１、２の技術は、音声認識対象が特定の用例や特定の文法に基づいて限定されるため、認識不可能な音声が存在するという点で、音声認識の性能にも限界がある。 The techniques ofPatent Documents 1 and 2 are techniques for performing speech recognition based on specific examples and specific grammars. That is, in the techniques ofPatent Documents 1 and 2, since speech recognition is performed using a specific example or a specific grammar, a service using speech recognition is a specific field or service based on a specific example or a specific grammar. Limited. In addition, since the techniques ofPatent Documents 1 and 2 are limited based on specific examples and specific grammars, the voice recognition performance is limited in that there are unrecognizable voices. is there.

したがって、特許文献１、２の技術は、特定の用例や特定の文法を定義しているために、用例や文法によって特定の分野やサービスに限定され、認識対象が限定されてしまい、認識性能を向上させられないという課題がある。 Therefore, since the techniques ofPatent Documents 1 and 2 define specific examples and specific grammars, they are limited to specific fields and services depending on the examples and grammars, and the recognition targets are limited. There is a problem that it cannot be improved.

この課題は、音声認識にとどまらず、一般に、ある入力情報に対する認識処理を行う場合にも当てはまる。例えば、文字認識を行う場合に、文字認識の対象となるイメージ画像には、どの言語の文字が記載されているかが不明であるときに、日本語の文字を認識する機能しかない文字認識エンジンでは対応ができない。 This problem is not limited to voice recognition, and is generally applicable to recognition processing for certain input information. For example, when performing character recognition, a character recognition engine that has only a function of recognizing Japanese characters when it is unclear what language characters are included in the image image that is the target of character recognition. Cannot handle.

本発明の目的は、上記の課題に鑑みてなされたもので、効率的に認識処理を行って認識性能を向上させる認識処理制御装置、認識処理制御方法および認識処理制御プログラムを提供することにある。 An object of the present invention is to provide a recognition processing control device, a recognition processing control method, and a recognition processing control program that efficiently perform recognition processing to improve recognition performance. .

本発明にかかる認識処理制御装置は、認識処理対象の処理属性にしたがって認識処理を行う認識エンジンを指定するエンジン指定情報に基づいて、認識処理対象を認識エンジンに送信する送信手段と、認識エンジンから認識処理の結果情報を受信する受信手段と、予め設定された前記処理属性に対応する認識エンジンを選択する際の優先度の高さに基づいて結果情報を評価し、評価結果を出力する出力手段を備えることを特徴とする。
The recognition processing control device according to the present invention includes a transmission unit that transmits a recognition processing target to the recognition engine based on engine designation information that specifies a recognition engine that performs the recognition processing according to the processing attribute of the recognition processing target, and a recognition engine. receiving means for receiving the result information of the recognition processing, and evaluation result information basedon the height ofpriority in selecting a recognition engine corresponding to the processing attribute that is set in advance, an output means for outputting the evaluation result It is characterized by providing.

本発明にかかる認識処理制御方法は、認識処理対象の処理属性にしたがって認識処理を行う認識エンジンを指定するエンジン指定情報に基づいて、認識処理対象を認識エンジンに送信し、認識エンジンから認識処理の結果情報を受信し、予め設定された前記処理属性に対応する認識エンジンを選択する際の優先度の高さに基づいて結果情報を評価し、評価結果を出力することを特徴とする。
The recognition processing control method according to the present invention transmits a recognition processing target to a recognition engine based on engine designation information for specifying a recognition engine that performs the recognition processing according to the processing attribute of the recognition processing target. results receives information to evaluate the result information basedon the height ofpriority in selecting a recognition engine corresponding to the processing attribute set in advance, and outputs the evaluation result.

本発明にかかる認識処理制御プログラムは、認識処理制御装置が備えるコンピュータを、認識処理対象の処理属性にしたがって認識処理を行う認識エンジンを指定するエンジン指定情報に基づいて、認識処理対象を認識エンジンに送信する送信手段と、認識エンジンから認識処理の結果情報を受信する受信手段と、予め設定された前記処理属性に対応する認識エンジンを選択する際の優先度の高さに基づいて結果情報を評価し、評価結果を出力する出力手段として動作させることを特徴とする。

The recognition processing control program according to the present invention allows a computer included in a recognition processing control device to recognize a recognition processing target as a recognition engine based on engine designation information that specifies a recognition engine that performs recognition processing according to a processing attribute of the recognition processing target. evaluation and sending means for sending, receiving means for receiving the result information of the recognition processing from the recognition engine, the result information basedon the height ofpriority in selecting a recognition engine corresponding to the processing attribute set in advance And operating as output means for outputting the evaluation result.

本発明によれば、効率的に認識処理を行って認識性能を向上させることが可能となる。 According to the present invention, it is possible to improve recognition performance by efficiently performing recognition processing.

本発明の第１の実施形態の認識処理システムの構成例を示す全体構成図である。It is a whole lineblock diagram showing an example of composition of a recognition processing system of a 1st embodiment of the present invention.本発明の第１の実施形態の認識処理システムの属性情報を例示したものである。It illustrates the attribute information of the recognition processing system of the first exemplary embodiment of the present invention.本発明の第１の実施形態の認識処理システムの属性情報を例示したものである。It illustrates the attribute information of the recognition processing system of the first exemplary embodiment of the present invention.本発明に第１の実施形態の認識処理システムの属性情報を例示したものである。The attribute information of the recognition processing system of the first embodiment is illustrated in the present invention.本発明の第１の実施形態の認識処理システムの属性情報を例示したものである。It illustrates the attribute information of the recognition processing system of the first exemplary embodiment of the present invention.本発明の第１の実施形態の認識処理システムの属性情報を例示したものである。It illustrates the attribute information of the recognition processing system of the first exemplary embodiment of the present invention.本発明の第２の実施形態の認識処理制御装置の構成例を示すブロック図である。It is a block diagram which shows the structural example of the recognition process control apparatus of the 2nd Embodiment of this invention.本発明の第３の実施形態の認識処理システムの構成例を示す全体構成図である。It is a whole block diagram which shows the structural example of the recognition processing system of the 3rd Embodiment of this invention.本発明の第３の実施形態の認識処理システムの属性情報を例示したものである。It illustrates the attribute information of the recognition processing system of the third exemplary embodiment of the present invention.本発明の第３の実施形態の認識処理システムの属性情報を例示したものである。It illustrates the attribute information of the recognition processing system of the third exemplary embodiment of the present invention.本発明の第３の実施形態の認識処理システムの属性情報を例示したものである。It illustrates the attribute information of the recognition processing system of the third exemplary embodiment of the present invention.本発明の第４の実施形態の認識処理システムの構成例を示す全体構成図である。It is a whole block diagram which shows the structural example of the recognition processing system of the 4th Embodiment of this invention.本発明に第４の実施形態の認識処理制御装置の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the recognition process control apparatus of 4th Embodiment to this invention.本発明の第４の実施形態の通信処理装置の動作例を示すフローチャートである。It is a flowchart which shows the operation example of the communication processing apparatus of the 4th Embodiment of this invention.

以下に、本発明の実施形態について図面を参照して詳細に説明する。
（第１の実施形態）
本発明の第１の実施形態について説明する。図１は、本発明の第１の実施形態の認識処理システムの構成例を示す全体構成図である。Embodiments of the present invention will be described below in detail with reference to the drawings.
(First embodiment)
A first embodiment of the present invention will be described. FIG. 1 is an overall configuration diagram illustrating a configuration example of a recognition processing system according to a first embodiment of this invention.

図１を参照して、本実施形態の認識処理システム１０の構成について説明する。認識処理システム１０は、入力手段１、出力手段２、属性記憶手段３、配信手段４と認識制御手段５を備える。認識処理手段９は、配信手段４と接続された認識エンジン９−１〜ｎ（ｎは、１以上の整数）を備える。 With reference to FIG. 1, the structure of therecognition processing system 10 of this embodiment is demonstrated. Therecognition processing system 10 includes aninput unit 1, anoutput unit 2, anattribute storage unit 3, adistribution unit 4 and arecognition control unit 5. Therecognition processing unit 9 includes recognition engines 9-1 to n (n is an integer of 1 or more) connected to thedistribution unit 4.

入力手段１は、認識処理を行うための認識処理対象信号の入力と、登録および操作設定を受け付ける。出力手段２は、認識処理結果情報の出力と、登録および操作設定に関わる表示および出力を行う。属性記憶手段３は、認識処理対象信号を判断するための処理属性情報、認識処理結果情報を評価するための評価情報と、認識処理手段９が備える認識エンジン９−１〜ｎに関わる配信先情報を含む属性情報を記憶する。配信手段４は、認識処理手段９が備える認識エンジン９−１〜ｎに認識処理対象情報を送信し、認識エンジン９−１〜ｎから認識処理結果情報を受信する。認識制御手段５は、入力手段１、出力手段２、属性記憶手段３、配信手段４の制御を行う。 Theinput unit 1 accepts input of a recognition processing target signal for performing recognition processing, registration, and operation settings. The output means 2 outputs recognition process result information, and displays and outputs related to registration and operation settings. Theattribute storage unit 3 includes processing attribute information for determining the recognition processing target signal, evaluation information for evaluating the recognition processing result information, and distribution destination information related to the recognition engines 9-1 to n included in therecognition processing unit 9. Attribute information including is stored. Thedistribution unit 4 transmits the recognition process target information to the recognition engines 9-1 to n included in therecognition processing unit 9, and receives the recognition process result information from the recognition engines 9-1 to n. Therecognition control unit 5 controls theinput unit 1, theoutput unit 2, theattribute storage unit 3, and thedistribution unit 4.

図１を参照して、本実施形態の認識処理システム１０の動作について説明する。入力手段１は、認識処理対象信号を入力し、認識処理対象情報に変換して認識制御手段５に送る。認識制御手段５は、入力手段１の種類と属性記憶手段３に予め記憶される属性情報に基づいて、認識処理対象情報の処理属性を判断する。「処理属性」とは、認識処理対象情報にどの認識処理を行うかを示すものである。認識制御手段５は、認識処理対象情報に対する処理の内容、例えば、音声認識あるいは画像認識等、を示す処理属性を判断する。 With reference to FIG. 1, the operation of therecognition processing system 10 of this embodiment will be described. Theinput unit 1 inputs a recognition processing target signal, converts it into recognition processing target information, and sends it to therecognition control unit 5. Therecognition control unit 5 determines the processing attribute of the recognition processing target information based on the type of theinput unit 1 and the attribute information stored in advance in theattribute storage unit 3. The “processing attribute” indicates which recognition processing is performed on the recognition processing target information. The recognition control means 5 determines a processing attribute indicating the content of processing for the recognition processing target information, for example, voice recognition or image recognition.

具体的には、認識制御手段５は、例えば、入力手段１がマイクであるとき、マイク入力に関わる処理属性を属性記憶手段３から取得する。属性記憶手段３には、マイク入力に関わる処理属性として、音声認識が記憶されているものとする。認識制御手段５は、属性記憶手段３のマイク入力に関わる処理属性に基づいて、処理属性を音声認識と判断する。すなわち、認識制御手段５は、入力手段１からの認識処理対象情報の処理属性を音声認識と判断する。 Specifically, for example, when theinput unit 1 is a microphone, therecognition control unit 5 acquires a processing attribute related to microphone input from theattribute storage unit 3. It is assumed that voice recognition is stored in theattribute storage unit 3 as a processing attribute related to microphone input. Therecognition control unit 5 determines that the processing attribute is speech recognition based on the processing attribute related to the microphone input of theattribute storage unit 3. That is, therecognition control unit 5 determines that the processing attribute of the recognition processing target information from theinput unit 1 is voice recognition.

認識制御手段５は、認識処理対象情報の処理属性に基づいて、属性記憶手段３に記憶される配信先情報を取得する。 Therecognition control unit 5 acquires distribution destination information stored in theattribute storage unit 3 based on the processing attribute of the recognition processing target information.

具体的には、認識制御手段５は、認識処理対象情報の処理属性が音声認識であることから、音声認識に関わる配信先情報を、属性記憶手段３から取得する。そして、認識制御手段５は、音声認識に関わる配信先情報に基づいて、認識処理対象情報の配信先である認識エンジンを、認識エンジン９−１〜ｎの中から選定する。例えば、認識制御手段５は、音声認識の処理属性に対応する認識エンジン９−１〜ｍ（ｍは１以上で、ｎ以下の整数）を選定する。 Specifically, since the processing attribute of the recognition processing target information is voice recognition, therecognition control unit 5 acquires distribution destination information related to voice recognition from theattribute storage unit 3. And the recognition control means 5 selects the recognition engine which is a delivery destination of recognition process target information from the recognition engines 9-1 to n based on the delivery destination information related to speech recognition. For example, the recognition control means 5 selects recognition engines 9-1 to m (m is an integer equal to or larger than 1 and equal to or smaller than n) corresponding to the processing attribute of speech recognition.

認識制御手段５は、認識処理対象情報から、選定された認識エンジン９−１〜ｍに対応する配信先認識処理対象情報＃１〜＃ｍを作成する。すなわち、認識制御手段５は、属性記憶手段３に記憶される配信先情報に基づいて、認識処理対象情報が認識エンジン９−１〜ｍに対応するように、それぞれ、配信先認識処理対象情報＃１〜＃ｍに変換する。 The recognition control means 5 creates distribution destination recognition processingtarget information # 1 to #m corresponding to the selected recognition engines 9-1 to m from the recognition processing target information. That is, the recognition control means 5 is based on the delivery destination information stored in the attribute storage means 3 so that the recognition process target information corresponds to the recognition engines 9-1 to m, respectively. 1 to #m.

認識制御手段５は、各配信先向けに変換された配信先認識処理対象情報＃１〜＃ｍを、配信手段４を経由して認識エンジン９−１〜ｍに送信する。すなわち、認識制御手段５は、配信手段４を制御して、配信先認識処理対象情報＃１〜＃ｍを、それぞれ、認識エンジン９−１〜ｍに送信する。 Therecognition control unit 5 transmits the distribution destination recognition processtarget information # 1 to #m converted for each distribution destination to the recognition engines 9-1 to m through thedistribution unit 4. That is, therecognition control unit 5 controls thedistribution unit 4 to transmit the distribution destination recognition processtarget information # 1 to #m to the recognition engines 9-1 to m, respectively.

このように、認識処理システム１０の入力手段１に入力された認識処理対象信号は、認識制御手段５によって配信先認識処理対象情報＃１〜ｍに変換されて、認識エンジン９−１〜ｍに送信される。 As described above, the recognition process target signal input to theinput unit 1 of therecognition processing system 10 is converted into the delivery destination recognition processtarget information # 1 to #m by therecognition control unit 5 and is sent to the recognition engines 9-1 to m. Sent.

認識処理手段９の認識エンジン９−１〜ｍは、送信された配信先認識処理対象情報＃１〜＃ｍを受信して認識処理を行う。認識エンジン９−１〜ｍは、認識処理結果を認識処理結果情報＃１〜＃ｍとして、配信手段４に送付する。 The recognition engines 9-1 to m of therecognition processing unit 9 receive the transmitted delivery destination recognition processtarget information # 1 to #m and perform recognition processing. The recognition engines 9-1 to m send the recognition processing results to thedistribution unit 4 as recognition processingresult information # 1 to #m.

配信手段４は、送付された認識処理結果情報＃１〜＃ｍを認識制御手段５に送る。認識制御手段５は、送られた認識処理結果情報＃１〜＃ｍを、属性記憶手段３に記憶される属性情報に基づいて評価する。例えば、認識制御手段５は、音声認識に関わる属性情報に基づいて、認識処理結果情報＃１〜＃ｍを評価する。 Thedistribution unit 4 sends the sent recognition processingresult information # 1 to #m to therecognition control unit 5. Therecognition control unit 5 evaluates the sent recognition processingresult information # 1 to #m based on the attribute information stored in theattribute storage unit 3. For example, therecognition control unit 5 evaluates the recognition process resultinformation # 1 to #m based on attribute information related to voice recognition.

そして、認識制御手段５は、評価に基づいて、認識処理結果情報＃１〜＃ｍの内、最も高い評価の認識処理結果情報、例えば認識処理結果情報＃５を出力手段２に出力する。 Based on the evaluation, therecognition control unit 5 outputs the recognition process result information with the highest evaluation among the recognition process resultinformation # 1 to #m, for example, the recognition process resultinformation # 5, to theoutput unit 2.

このように、本実施形態の認識処理システム１０は、入力手段１が受け付けた認識処理対象信号の処理属性を判断し、判断された処理属性に対応する認識エンジンを選択して、選択された認識エンジン９−１〜ｍに認識処理対象情報を送信して認識処理を行う。すなわち、本実施形態では、認識処理対象信号を予め記憶される処理属性に対応する認識エンジンを選択して認識処理を行うことができるので、認識処理対象の属性、分野やサービスが限定されない。 As described above, therecognition processing system 10 according to the present embodiment determines the processing attribute of the recognition processing target signal received by theinput unit 1, selects the recognition engine corresponding to the determined processing attribute, and selects the selected recognition. The recognition process target information is transmitted to the engines 9-1 to m to perform the recognition process. That is, in the present embodiment, the recognition process target signal can be selected and the recognition process corresponding to the processing attribute stored in advance can be selected, so that the recognition process target attribute, field, and service are not limited.

認識処理システム１０は、認識エンジン９−１〜ｍから認識処理結果情報を受信して、属性記憶手段３に記憶される属性情報に基づいて評価を行い、最も高い評価の認識処理結果情報を出力手段２に出力する。したがって、本実施形態では、認識処理対象信号の処理属性に対応する認識エンジンを複数選択して認識処理を行い、認識処理結果情報を予め記憶される属性情報に基づいて評価するので、認識処理を効率的に行い、認識性能を向上させることができる。 Therecognition processing system 10 receives the recognition processing result information from the recognition engines 9-1 to m, performs the evaluation based on the attribute information stored in the attribute storage means 3, and outputs the recognition processing result information with the highest evaluation. Output tomeans 2. Therefore, in this embodiment, a plurality of recognition engines corresponding to the processing attributes of the recognition processing target signal are selected to perform recognition processing, and the recognition processing result information is evaluated based on pre-stored attribute information. It can be performed efficiently and recognition performance can be improved.

したがって、本実施形態は、効率的に認識処理を行って認識性能を向上することが可能となる。 Therefore, this embodiment can improve recognition performance by efficiently performing recognition processing.

図２〜６は、本実施形態の認識処理システムの属性情報を例示したものである。図２〜６を参照して、本実施形態の動作について説明する。 2 to 6 exemplify attribute information of the recognition processing system according to the present embodiment. The operation of this embodiment will be described with reference to FIGS.

図２は、本実施形態の認識処理システムの属性記憶手段３に記憶される認識処理に関わる処理属性情報１００を例示したものである。 FIG. 2 illustratesprocessing attribute information 100 related to recognition processing stored in theattribute storage unit 3 of the recognition processing system of this embodiment.

認識制御手段５は、入力手段１に入力された認識処理対象信号の処理属性を判断する。具体的には、認識制御手段５は、入力手段１に入力された認識処理対象信号が、例えば、マイク入力であるとき、マイク入力についての処理属性を属性記憶手段３から取得する。認識制御手段５は、属性記憶手段３に予め記憶される認識処理の処理属性情報１００を参照する。認識制御手段５は、入力デバイス欄１０１の「マイク」に関わる処理属性欄１０２に記載されている「音声認識」が、処理属性であると判断する。例えば、スキャナ入力であれば、「スキャナ」に関わる処理属性は、「画像認識」、「顔認識」、「文字認識」、「バーコード認識」と判断される。 Therecognition control unit 5 determines the processing attribute of the recognition processing target signal input to theinput unit 1. Specifically, therecognition control unit 5 acquires the processing attribute for the microphone input from theattribute storage unit 3 when the recognition processing target signal input to theinput unit 1 is, for example, a microphone input. Therecognition control unit 5 refers to theprocessing attribute information 100 of the recognition process stored in advance in theattribute storage unit 3. Therecognition control unit 5 determines that “speech recognition” described in theprocessing attribute column 102 related to “microphone” in theinput device column 101 is a processing attribute. For example, in the case of scanner input, the processing attributes related to “scanner” are determined as “image recognition”, “face recognition”, “character recognition”, and “barcode recognition”.

図３は、本実施形態の認識処理システムの属性記憶手段３に記憶される認識処理に関わる配信先情報１１０を例示したものである。音声認識エンジン＃１、画像認識エンジン＃１、標識認識エンジン＃１等は、認識エンジン９−１〜ｎのいずれかである。認識制御手段５は、認識エンジン９−１〜ｎの中から認識エンジンを指定するエンジン指定情報として、配信先情報を使用する。 FIG. 3 exemplifies thedelivery destination information 110 related to the recognition process stored in theattribute storage unit 3 of the recognition processing system of this embodiment. The speechrecognition engine # 1, the imagerecognition engine # 1, the signrecognition engine # 1, and the like are any of the recognition engines 9-1 to n. The recognition control means 5 uses the delivery destination information as engine designation information for designating a recognition engine from among the recognition engines 9-1 to n.

認識制御手段５は、入力手段１に入力された認識処理対象信号の処理属性の判断に基づいて、属性記憶手段３に記憶される配信先情報を取得する。具体的には、認識制御手段５は、認識処理対象信号の処理属性が、例えば、「音声認識」であるとき、「音声認識」に関わる配信先情報を、属性記憶手段３から取得する。認識制御手段５は、属性記憶手段３に予め記憶される認識処理の配信先情報１１０を参照して、処理属性欄１１１の「音声認識」に関わる配信先欄１１２、入力条件欄１１３などに記憶されている配信先情報を取得する。 Therecognition control unit 5 acquires the delivery destination information stored in theattribute storage unit 3 based on the determination of the processing attribute of the recognition processing target signal input to theinput unit 1. Specifically, therecognition control unit 5 acquires distribution destination information related to “voice recognition” from theattribute storage unit 3 when the processing attribute of the recognition processing target signal is “voice recognition”, for example. Therecognition control unit 5 refers to thedistribution destination information 110 of the recognition process stored in advance in theattribute storage unit 3 and stores it in thedistribution destination column 112, theinput condition column 113, and the like related to “voice recognition” in theprocessing attribute column 111. Get distribution destination information.

認識制御手段５は、配信先情報に基づいて、認識処理対象情報の配信先を選定する。具体的には、認識制御手段５は、例えば、処理属性が「音声認識」であれば、属性記憶手段３に予め記憶される認識処理の配信先情報１１０を参照して、処理属性欄１１１の「音声認識」に関わる配信先欄１１２、入力条件欄１１３などの配信先情報に基づいて、音声認識エンジンを選択する。例えば、処理属性が「画像認識」であれば、認識制御手段５は、「画像認識」に関わる配信先情報に基づいて、画像認識エンジンを選択すればよい。 Therecognition control unit 5 selects a distribution destination of recognition processing target information based on the distribution destination information. Specifically, for example, if the process attribute is “voice recognition”, therecognition control unit 5 refers to thedistribution destination information 110 of the recognition process stored in advance in theattribute storage unit 3, and A speech recognition engine is selected based on distribution destination information such as adistribution destination column 112 and aninput condition column 113 related to “voice recognition”. For example, if the processing attribute is “image recognition”, therecognition control unit 5 may select an image recognition engine based on distribution destination information related to “image recognition”.

認識制御手段５は、選択された認識処理エンジンに対応する配信先情報に基づいて、認識処理対象情報から、配信先認識処理対象情報を作成する。具体的には、認識制御手段５は、例えば、選択された音声認識エンジン＃１であれば、音声認識エンジン＃１の入力条件欄１１３などに基づいて、認識処理対象情報を、音声認識エンジン＃１の入力条件に合致するように変換する。すなわち、認識制御手段５は、認識処理対象情報を、音声認識エンジン＃１に入力できるように、例えば、情報のデータ形式やデータ長などを変換して、配信先認識処理対象情報＃１を作成する。 The recognition control means 5 creates delivery destination recognition processing target information from the recognition processing target information based on the delivery destination information corresponding to the selected recognition processing engine. Specifically, for example, if the selected speechrecognition engine # 1, therecognition control unit 5 converts the recognition processing target information into the speechrecognition engine # 1 based on theinput condition column 113 of the speechrecognition engine # 1. 1 so as to meet the input condition of 1. That is, the recognition control means 5 creates the destination recognition processingtarget information # 1 by converting the data format, data length, etc. of the information, for example, so that the recognition processing target information can be input to the voicerecognition engine # 1. To do.

認識制御手段５は、選択された認識処理エンジンに対応する配信先欄１１２に基づいて、作成された配信先認識対象情報を、配信先である認識エンジンに配信手段４を経由して送信する。すなわち、認識制御手段５は、例えば、配信先認識処理情報＃１を、配信先である音声認識エンジン＃１に、配信手段４を経由して送信する。 Therecognition control unit 5 transmits the created distribution destination recognition target information to the recognition engine that is the distribution destination via thedistribution unit 4 based on thedistribution destination column 112 corresponding to the selected recognition processing engine. That is, therecognition control unit 5 transmits, for example, distribution destination recognitionprocessing information # 1 to the voicerecognition engine # 1 that is the distribution destination via thedistribution unit 4.

認識エンジンは、送信された配信先認識処理対象情報を受け付けて、認識処理を行った認識処理結果情報を配信手段４に送付する。すなわち、本実施形態では、認識処理対象信号を予め記憶される処理属性に対応する認識エンジンを選択して認識処理を行うことができるので、認識処理対象の属性、分野やサービスが限定されない。 The recognition engine receives the transmitted delivery destination recognition process target information and sends the recognition process result information obtained by the recognition process to the delivery means 4. That is, in the present embodiment, the recognition process target signal can be selected and the recognition process corresponding to the processing attribute stored in advance can be selected, so that the recognition process target attribute, field, and service are not limited.

配信手段４は、送付された認識処理結果情報を認識制御手段５に送る。認識制御手段５は、送られた認識処理結果情報を、属性記憶手段３に記憶される属性情報に基づいて評価する。すなわち、認識制御手段５は、例えば、音声認識エンジン＃１からの認識処理結果情報＃１を、音声認識に関わる属性情報に基づいて評価する。 The distribution means 4 sends the sent recognition processing result information to the recognition control means 5. Therecognition control unit 5 evaluates the sent recognition processing result information based on the attribute information stored in theattribute storage unit 3. That is, the recognition control means 5 evaluates the recognition processingresult information # 1 from the speechrecognition engine # 1 based on attribute information related to speech recognition, for example.

認識処理結果情報の評価については、音声認識を例として説明する。図４、５、６は、本実施形態の認識処理システムの属性記憶手段３に記憶される音声認識の評価に関わる属性情報を例示したものである。図４、５、６を参照して、各図の構成について説明する。 The evaluation of the recognition processing result information will be described using voice recognition as an example. 4, 5, and 6 exemplify attribute information related to speech recognition evaluation stored in theattribute storage unit 3 of the recognition processing system according to the present embodiment. The configuration of each figure will be described with reference to FIGS.

図４は、音声認識に関わる文法番号と文法の評価情報２００を例示する。「文法」とは、特定の単語、または複数の単語群を組み合わせたものである。図４の文法欄２０２は、単語群を表すルール名、例えば、［ＭＯＮＴＨ］や［ＤＡＹ］および特定の単語、例えば「を」や「再生」などを組み合わせた文法の具体例を、文法番号欄２０１と対応付けて示す。 FIG. 4 illustrates a grammar number andgrammar evaluation information 200 related to speech recognition. “Grammar” is a specific word or a combination of a plurality of words. Thegrammar column 202 in FIG. 4 includes a rule name representing a word group, for example, [MONTH] or [DAY], and a specific example of a grammar combining a specific word such as “O” or “Replay”. 201 in association with each other.

図５は、音声認識に関わる単語群の評価情報２１０を例示する。図５では、ルール名欄２１１に単語群を表す名称がルール名として示され、表記欄２１２に各ルール名に対応する複数の単語が示される。図５の同音語・同意語欄２１３には、表記欄２１２に示された単語に対応する同音語（対応する単語の発声音が同じ単語）や同意語（対応する単語と発声音が異なるが意味が同じ単語）が記載される。 FIG. 5 illustrates theevaluation information 210 of the word group related to speech recognition. In FIG. 5, a name representing a word group is shown as a rule name in therule name column 211, and a plurality of words corresponding to each rule name are shown in thenotation column 212. In the homophone /synonymous word column 213 in FIG. 5, the homophones corresponding to the words shown in the notation column 212 (words corresponding to the same utterance) and synonyms (corresponding words and utterances are different). Words with the same meaning).

図６は、音声認識が利用される利用目的に対応する、文法の評価情報２２０を例示する。「利用目的」とは、入力手段１に認識処理対象信号が入力されたときの、認識処理システム１０のユーザ（以下、ユーザという。）が利用していたサービス、アプリケーションや出力手段２で出力していた表示内容が示す、認識処理結果情報を利用する利用目的である。例えば、ユーザがカーナビゲーションシステムを操作して、目的地を入力している場合は、「利用目的」は、図６の利用目的欄２２１の「目的地入力」となる。利用目的が「目的地入力」であるとき、対応する文法として、文法番号欄２２２の「２」が設定される。 FIG. 6 illustratesgrammar evaluation information 220 corresponding to the purpose of use for which speech recognition is used. The “purpose of use” is output by the service, application, or output means 2 used by the user of the recognition processing system 10 (hereinafter referred to as the user) when the recognition processing target signal is input to the input means 1. This is a purpose of using the recognition processing result information indicated by the displayed display content. For example, when the user inputs a destination by operating the car navigation system, the “use purpose” is “use destination” in theuse purpose column 221 of FIG. When the purpose of use is “destination input”, “2” in thegrammar number column 222 is set as the corresponding grammar.

次に、図４、５、６を参照して、認識制御手段５が行う認識処理結果情報の評価について説明する。認識制御手段５は、入力手段１に認識処理対象信号が入力されたときの利用目的を確認する。例えば、利用目的が「目的地入力」であれば、認識制御手段５は、図６の音声認識に関わる属性情報の利用目的に基づいて、認識処理結果情報の評価に適用すべき文法番号「２」を選択する。認識制御手段５は、図４の音声認識に関わる文法に基づいて、文法番号「２」に対応する文法定義を選択する。認識制御手段５は、図５の音声認識に関わる単語群に基づいて、文法番号「２」に関わる単語群を選択する。 Next, with reference to FIGS. 4, 5, and 6, evaluation of recognition processing result information performed by therecognition control unit 5 will be described. Therecognition control unit 5 confirms the purpose of use when the recognition process target signal is input to theinput unit 1. For example, if the purpose of use is “destination input”, the recognition control means 5 uses the grammar number “2” to be applied to the evaluation of the recognition processing result information based on the purpose of use of the attribute information related to speech recognition in FIG. ”Is selected. The recognition control means 5 selects a grammar definition corresponding to the grammar number “2” based on the grammar related to speech recognition in FIG. The recognition control means 5 selects a word group related to the grammar number “2” based on the word group related to the speech recognition of FIG.

このように、認識制御手段５は、利用目的に基づいて、認識処理対象信号の認識結果の評価に用いる評価情報を選択する。そして、認識制御手段５は、選択された評価情報に基づいて、認識処理結果情報を評価する。すなわち、認識制御手段５は、音声認識エンジン＃１の認識処理結果情報＃１を、「目的地入力」に関わる文法番号「２」の文法定義と、文法定義に対応した特定の単語および単語群との適合度を評価すればよい。具体的には、認識制御手段５は、認識処理結果情報＃１が、選択された文法定義、特定の単語および単語群と、完全一致するか否か、または含んでいるか否かを評価すればよい。 As described above, the recognition control means 5 selects evaluation information used for evaluating the recognition result of the recognition processing target signal based on the purpose of use. And the recognition control means 5 evaluates recognition process result information based on the selected evaluation information. That is, the recognition control means 5 uses the recognition processingresult information # 1 of the speechrecognition engine # 1 as the grammar definition of the grammar number “2” related to “destination input” and a specific word and word group corresponding to the grammar definition. The degree of conformity with Specifically, the recognition control means 5 evaluates whether or not the recognition processingresult information # 1 completely matches or includes the selected grammar definition, the specific word and the word group. Good.

このとき、例えば、特定の分野や単語などの文法を定義しない音声認識エンジンの認識処理結果情報は、選択された評価情報に基づいて評価される。また、特定の分野や単語などの文法を定義して認識結果を出力する音声認識エンジンの認識結果情報も、選択された評価情報に基づいて、同様に評価される。 At this time, for example, recognition processing result information of a speech recognition engine that does not define a grammar such as a specific field or word is evaluated based on the selected evaluation information. The recognition result information of the speech recognition engine that defines the grammar of a specific field or word and outputs the recognition result is similarly evaluated based on the selected evaluation information.

したがって、認識制御手段５は、利用目的に基づいて選択された評価情報を使用して評価を行う。すなわち、特定の分野や単語などの文法を定義しない音声認識エンジンの認識処理結果情報は、利用目的に基づいて選択された評価情報を使用して効率的に評価される。また、特定の分野や単語などの文法を定義して認識結果を出力する音声認識エンジンの認識結果情報は、例えば、利用目的に基づいて選択された評価情報が、音声認識エンジンの文法定義と合致していなければ、低い評価となり、合致していれば、高い評価を得る可能性がある。 Therefore, the recognition control means 5 performs evaluation using the evaluation information selected based on the purpose of use. That is, recognition processing result information of a speech recognition engine that does not define a grammar such as a specific field or word is efficiently evaluated using evaluation information selected based on the purpose of use. The recognition result information of the speech recognition engine that defines the grammar of a specific field or word and outputs the recognition result is, for example, the evaluation information selected based on the purpose of use matches the grammar definition of the speech recognition engine. If it is not done, the evaluation is low, and if it matches, there is a possibility of obtaining a high evaluation.

そして、認識制御手段５は、認識処理結果情報の内、最も高い評価の認識処理結果情報を、出力手段２に出力する。あるいは、認識制御手段５は、認識処理結果情報の内、評価の上位から所定の順位までの認識処理結果情報を、出力手段２に出力してもよい。また、認識制御手段５は、認識処理結果情報の評価が所定の閾値に至らない場合や、所定の時間内で認識処理結果情報が送付されない場合には、認識処理結果なしとして、出力手段２に出力してもよい。 Then, therecognition control unit 5 outputs the recognition process result information with the highest evaluation among the recognition process result information to theoutput unit 2. Alternatively, therecognition control unit 5 may output the recognition process result information from the higher rank of the evaluation to a predetermined rank among the recognition process result information to theoutput unit 2. Further, the recognition control means 5 determines that there is no recognition processing result and outputs it to the output means 2 when the evaluation of the recognition processing result information does not reach a predetermined threshold value or when the recognition processing result information is not sent within a predetermined time. It may be output.

このように、本実施形態の認識処理システム１０は、入力手段１から入力された認識処理対象信号の処理属性を判断し、判断された処理属性に対応する認識エンジンを選択して、選択された認識エンジンに認識処理対象情報を送信して認識処理を行う。すなわち、本実施形態では、認識処理対象信号を予め記憶される処理属性に対応する認識エンジンを選択して認識処理を行うことができるので、認識処理対象の属性、分野やサービスが限定されない。 As described above, therecognition processing system 10 of the present embodiment determines the processing attribute of the recognition processing target signal input from theinput unit 1, selects the recognition engine corresponding to the determined processing attribute, and is selected. The recognition process target information is transmitted to the recognition engine to perform the recognition process. That is, in the present embodiment, the recognition process target signal can be selected and the recognition process corresponding to the processing attribute stored in advance can be selected, so that the recognition process target attribute, field, and service are not limited.

そして、認識処理システム１０は、認識エンジンの認識処理結果情報を受信して、属性記憶手段３に記憶される属性情報に基づいて評価を行い、最も高い評価の認識処理結果情報を出力手段２に出力する。したがって、本実施形態では、例えば、特定の分野や単語などの文法を定義しない汎用の音声認識エンジンと、特定の分野や単語などの文法を定義して認識結果を出力する専用の音声認識エンジンの両方を選択して使用することができる。すなわち、本実施形態では、認識処理対象信号の処理属性に対応する認識エンジンを複数選択して認識処理を行い、認識処理結果を予め記憶される属性情報に基づいて評価するので、認識処理を効率的に行い、認識性能を向上することができる。 Then, therecognition processing system 10 receives the recognition process result information of the recognition engine, performs the evaluation based on the attribute information stored in theattribute storage unit 3, and outputs the recognition process result information with the highest evaluation to theoutput unit 2. Output. Therefore, in this embodiment, for example, a general-purpose speech recognition engine that does not define a grammar of a specific field or word, and a dedicated speech recognition engine that defines a grammar of a specific field or word and outputs a recognition result. Both can be selected and used. That is, in this embodiment, since recognition processing is performed by selecting a plurality of recognition engines corresponding to the processing attribute of the recognition processing target signal and the recognition processing result is evaluated based on pre-stored attribute information, the recognition processing is efficiently performed. Recognition performance can be improved.

したがって、本実施形態は、認識処理の属性、対象分野やサービスが限定されず、効率的に認識処理を行って認識性能を向上することが可能となる。 Therefore, in the present embodiment, the recognition process attributes, target fields, and services are not limited, and the recognition performance can be improved by efficiently performing the recognition process.

なお、本実施形態では、認識処理対象信号はマイク入力によるものとして説明したが、マイク入力に限定されない。例えば、カメラ入力でもよいし、スキャナ入力でもよいし、振動センサ入力でもよいし、タッチセンサでもよい。すなわち、認識処理対象信号は、認識処理システムの入力手段が備える入力デバイスから入力されるものであって、認識処理対象情報として取り扱えるものであればよい。 In the present embodiment, the recognition processing target signal is described as being based on microphone input, but is not limited to microphone input. For example, camera input, scanner input, vibration sensor input, or touch sensor may be used. That is, the recognition processing target signal may be input from an input device provided in the input unit of the recognition processing system and can be handled as recognition processing target information.

また、本実施形態では、マイクから入力された認識処理対象情報の処理属性は、音声認識として説明したが、マイクに対応する処理属性は音声認識に限定されない。例えば、マイクに対応する処理属性は、話者識別（発声した話者が特定の話者であるか否かを識別する）でもよいし、楽曲識別（発声された音、例えばハミングから楽曲を識別する）でもよいし、ＤＴＭＦ認識（ＤｕａｌＴｏｎｅＭｕｔｉＦｒｅｑｕｅｎｃｙ：プッシュ式電話などのプッシュボタンを押すたびに発信される音から、押されたプッシュボタンを識別する）でもよい。すなわち、認識処理対象情報は、認識処理システムの入力手段が備えるマイクから入力されたものであって、入力された認識処理対象信号を認識処理対象情報として取り扱えるものであればよい。 In the present embodiment, the processing attribute of the recognition processing target information input from the microphone has been described as speech recognition. However, the processing attribute corresponding to the microphone is not limited to speech recognition. For example, the processing attribute corresponding to the microphone may be speaker identification (identifying whether or not the speaker who uttered is a specific speaker) or music identification (identifying music from uttered sound, for example, humming). Or DTMF recognition (Dual Tone Muti Frequency: identifying a pushed push button from a sound transmitted each time a push button such as a push-type telephone is pushed). That is, the recognition processing target information may be input from a microphone included in the input unit of the recognition processing system and can handle the input recognition processing target signal as recognition processing target information.

また、本実施形態では、認識処理結果情報の評価は、音声認識のみの認識結果を評価するものとして説明したが、認識結果の評価は単一の認識結果の評価に限定されない。例えば、音声認識と話者識別の認識結果を併用して評価してもよい。すなわち、音声指令を受け付ける場合には、予め登録された話者の発声であるか否かを識別する話者識別の認識結果と、発声された指令を認識する音声認識の認識結果に基づいて、両者の認識結果を併用して評価してもよい。
また、認識処理対象情報がカメラによる画像である場合には、画像認識の認識結果として文字部分の位置が特定され、特定された部分に対応する文字認識の認識結果が評価してもよい。すなわち、認識結果の評価は、認識処理対象情報を認識処理した結果を併用して評価してもよい。In the present embodiment, the evaluation of the recognition processing result information has been described as evaluating the recognition result of only speech recognition. However, the evaluation of the recognition result is not limited to the evaluation of a single recognition result. For example, speech recognition and speaker identification recognition results may be used in combination for evaluation. That is, when accepting a voice command, based on a recognition result of speaker identification for identifying whether or not the voice of a speaker registered in advance and a recognition result of voice recognition for recognizing the spoken command, You may evaluate using both recognition results together.
Further, when the recognition processing target information is an image by a camera, the position of the character part may be specified as the recognition result of the image recognition, and the recognition result of the character recognition corresponding to the specified part may be evaluated. In other words, the recognition result may be evaluated by using the result of recognition processing of the recognition processing target information.

また、本実施形態では、認識処理対象情報の属性の異なる認識処理結果情報が併用され、評価されてもよい。例えば、予め記憶された個人を識別する際には、音声識別と画像識別が併用されてよい。すなわち、マイクまたは通話回線からの音声信号が音声識別された識別結果と、カメラからの映像信号が画像識別された識別結果が併用されて、個人識別の評価が行われてもよい。 In the present embodiment, recognition process result information having different attributes of the recognition process target information may be used together and evaluated. For example, when identifying an individual stored in advance, voice identification and image identification may be used in combination. In other words, the identification result obtained by voice identification of the audio signal from the microphone or the telephone line and the identification result obtained by image identification of the video signal from the camera may be used in combination to evaluate personal identification.

また、本実施形態では、認識処理結果情報の評価に関わる属性情報は、音声認識に関わる評価に関わる属性情報を固定的なものとして説明したが、認識処理の評価に関わる属性情報は固定的なものに限定されない。 In the present embodiment, the attribute information related to the evaluation of the recognition processing result information has been described as the fixed attribute information related to the evaluation related to speech recognition. However, the attribute information related to the evaluation of the recognition processing is fixed. It is not limited to things.

例えば、ユーザが利用しているサービスやアプリケーションなどに基づいて、評価に関わる属性情報は取得されて、更新されてもよい。つまり、ユーザがカーナビゲーションシステムを操作して「目的地入力」しているときに表示されている地図情報に基づいて、表示されている地図に関わる地名情報やランドマーク情報などが取得されて、［ＬＯＣＡＴＩＯＮ］（図５参照）に対応する単語群として評価情報に追加されてもよい。 For example, the attribute information related to the evaluation may be acquired and updated based on the service or application used by the user. In other words, based on the map information displayed when the user operates the car navigation system and performs “destination input”, the place name information and landmark information related to the displayed map are acquired, It may be added to the evaluation information as a word group corresponding to [LOCATION] (see FIG. 5).

また、カーナビゲーションシステムがＣＤ（登録商標、Compact Disk:ディスク規格の１つ）プレーヤ機能を備える場合であれば、ユーザがＣＤプレーヤを操作しているときに、ＣＤに記憶された楽曲名やアーティスト名が取得されて、［ＭＵＳＩＣ］（図５参照）に対応する単語群として評価情報に追加されてもよい。すなわち、認識処理結果情報の評価に関わる属性情報は、予め登録された固定的なものでもよく、ユーザが使用するサービスやアプリケーションなどが認識処理結果情報を利用する利用目的に基づいて取得される情報であって、評価に関わる属性情報として追加できる情報でもよい。 Also, if the car navigation system has a CD (registered trademark, one of the disc standards) player function, when the user operates the CD player, the music title and artist stored on the CD are displayed. A name may be acquired and added to the evaluation information as a word group corresponding to [MUSIC] (see FIG. 5). That is, the attribute information related to the evaluation of the recognition processing result information may be fixed information registered in advance, and is acquired based on the purpose of use of the recognition processing result information by the service or application used by the user. However, it may be information that can be added as attribute information related to evaluation.

また、本実施形態では、認識処理結果情報の評価結果は、評価に基づいて、最も高い評価の認識処理結果情報を出力手段に出力すると説明したが、認識処理結果情報は最も高い評価結果には限定されない。例えば、認識制御手段は、認識処理結果情報の内、評価の上位から所定の順位までの認識処理結果情報を出力してもよい。また、認識制御手段は、認識処理結果情報の評価が所定の閾値以上の認識処理結果情報を出力してもよい。また、認識制御手段は、予め配布先情報に設定された優先度に基づいて、優先度の高い認識エンジンの認識処理結果情報を出力してもよい。また、ユーザが使用するサービスやアプリケーションに基づいて、認識処理結果情報の評価結果の出力方法は適宜変更されてもよい。 In the present embodiment, the evaluation result of the recognition process result information has been described as outputting the highest evaluation recognition process result information to the output unit based on the evaluation. However, the recognition process result information is the highest evaluation result. It is not limited. For example, the recognition control means may output the recognition process result information from the top of the evaluation to a predetermined rank in the recognition process result information. The recognition control means may output recognition process result information whose evaluation of the recognition process result information is equal to or greater than a predetermined threshold. Further, the recognition control means may output recognition process result information of a recognition engine having a high priority based on a priority set in advance in the distribution destination information. Moreover, based on the service and application which a user uses, the output method of the evaluation result of recognition process result information may be changed suitably.

また、本実施形態では、認識処理手段が備える認識エンジンは、固定的ではない。例えば、認識処理結果情報の評価結果に基づいて、ユーザは認識エンジンの評価や優先度付けを行って配信先情報に記憶させてもよい。認識制御手段は、予め設定された評価や優先度に基づいて認識エンジンを適宜選択すればよい。すなわち、予め設定された評価値が所定の閾値に至らない認識エンジンは認識処理の選択対象から外されてもよい。認識エンジンは予め設定された優先度に基づいて選択されてもよい。認識エンジンは新たに追加されてもよい。また、認識制御手段は、認識エンジンとの接続状態または認識エンジンの稼働状態の確認に基づいて、認識エンジンを選択してもよい。すなわち、認識エンジンは、認識処理の処理属性と配信先情報が明示され、認識処理対象情報を入力して認識処理結果情報を出力できるものであればよい。
（第２の実施形態）
本発明において、図１に示された本実施形態の認識処理システム１０における入力手段１、出力手段２、配信手段４は必須ではない。すなわち、認識制御手段５が、認識処理対象情報を入力されて、外部の属性記憶手段３に記憶される属性情報に基づいて、外部の認識処理手段９が備える認識エンジン９−１〜ｎに送信して認識処理結果情報を取得して評価を行えればよい。したがって、本発明に必須の構成は、認識制御手段５のみである。In the present embodiment, the recognition engine included in the recognition processing unit is not fixed. For example, based on the evaluation result of the recognition processing result information, the user may perform recognition engine evaluation and prioritization and store the information in the distribution destination information. The recognition control means may select a recognition engine as appropriate based on a preset evaluation or priority. That is, a recognition engine in which a preset evaluation value does not reach a predetermined threshold value may be excluded from selection targets for recognition processing. The recognition engine may be selected based on a preset priority. A recognition engine may be newly added. Further, the recognition control means may select the recognition engine based on confirmation of a connection state with the recognition engine or an operation state of the recognition engine. That is, the recognition engine only needs to be able to output the recognition process result information by inputting the recognition process target information by specifying the process attribute of the recognition process and the delivery destination information.
(Second Embodiment)
In the present invention, the input means 1, the output means 2, and the distribution means 4 in therecognition processing system 10 of the present embodiment shown in FIG. 1 are not essential. That is, therecognition control unit 5 receives the recognition process target information and transmits it to the recognition engines 9-1 to n included in the externalrecognition processing unit 9 based on the attribute information stored in the externalattribute storage unit 3. Then, it is only necessary to obtain recognition processing result information and perform evaluation. Accordingly, only the recognition control means 5 is essential for the present invention.

図７は、本発明の最小の構成を備える、第２の実施形態の認識処理制御装置２０の構成例を示すブロック図である。 FIG. 7 is a block diagram illustrating a configuration example of the recognitionprocessing control device 20 according to the second embodiment having the minimum configuration of the present invention.

認識処理制御装置２０は、認識制御手段２１を備える。認識制御手段２１は、情報送信部２２と、結果受信部２３と評価結果出力部２４を備える。情報送信部２２は、認識処理を行う認識エンジンを指定するエンジン指定情報に基づいて、認識処理対象情報を送信する。結果受信部２３は、認識エンジンから認識処理結果情報を受信する。評価結果出力部２４は、所定の評価情報に基づいて認識処理結果情報を評価し、評価結果を出力する。 The recognitionprocessing control device 20 includes arecognition control unit 21. Therecognition control unit 21 includes aninformation transmission unit 22, aresult reception unit 23, and an evaluationresult output unit 24. Theinformation transmission unit 22 transmits the recognition process target information based on the engine designation information that designates the recognition engine that performs the recognition process. Theresult receiving unit 23 receives recognition processing result information from the recognition engine. The evaluationresult output unit 24 evaluates the recognition processing result information based on predetermined evaluation information, and outputs the evaluation result.

なお、本実施形態では、認識処理結果情報の評価に基づいて、評価結果出力部は、最も高い評価の評価結果を出力すればよい。あるいは、評価結果出力部は、予め設定された条件を満足する評価結果を出力してもよい。例えば、予め設定された閾値を超える評価の評価結果が出力されてもよい。 In the present embodiment, the evaluation result output unit may output the evaluation result with the highest evaluation based on the evaluation of the recognition processing result information. Alternatively, the evaluation result output unit may output an evaluation result that satisfies a preset condition. For example, an evaluation result of evaluation exceeding a preset threshold value may be output.

また、本実施形態では、認識処理対象情報の属性は、１種類であってもよいし、複数種類であってもよい。すなわち、本実施形態は、認識処理対象情報の属性に基づいて、認識エンジンは選択されてもよい。本実施形態は、認識処理対象情報の属性に基づいて、評価情報は選択されてもよい。本実施形態は、複数の属性の異なる認識エンジンの結果情報を併用して評価してもよいし、同一の属性の認識エンジンの結果情報を併用して評価してもよい。例えば、話者識別エンジンの結果情報と顔認証の結果情報を併用して個人識別が評価されてもよいし、日本語音声認識エンジンの結果情報と英語音声認識エンジンの結果情報を併用して数字認識が評価されてもよい。 In the present embodiment, the recognition process target information may have one or more attributes. That is, in this embodiment, the recognition engine may be selected based on the attribute of the recognition processing target information. In the present embodiment, the evaluation information may be selected based on the attribute of the recognition process target information. In the present embodiment, the result information of recognition engines having different attributes may be used together for evaluation, or the result information of recognition engines having the same attribute may be used for evaluation. For example, personal identification may be evaluated using both the result information of the speaker identification engine and the result information of the face recognition, or a number using the result information of the Japanese speech recognition engine and the result information of the English speech recognition engine. Recognition may be evaluated.

また、本実施形態では、評価結果の利用目的に基づいて、評価情報は選択されてもよい。すなわち、例えば、ユーザが利用するサービスやアプリケーションなどが評価結果を利用する利用目的に対応して評価情報が選択され、選択された評価情報に基づいて、結果情報が評価されてもよい。
（第３の実施形態）
本発明の第３の実施形態について説明する。図８は、本発明の第３の実施形態の認識処理システムの構成例を示す全体構成図である。図８の第３の実施形態は、認識処理システム１１に履歴記憶手段６が追加されているところが、第１の実施形態と相違する。よって、第１の実施形態と同一の構成要素には同一の参照数字を付記して、その説明を省略する。In the present embodiment, the evaluation information may be selected based on the use purpose of the evaluation result. That is, for example, evaluation information may be selected corresponding to the purpose of use of the evaluation result by a service or application used by the user, and the result information may be evaluated based on the selected evaluation information.
(Third embodiment)
A third embodiment of the present invention will be described. FIG. 8 is an overall configuration diagram showing a configuration example of a recognition processing system according to the third embodiment of the present invention. The third embodiment of FIG. 8 is different from the first embodiment in that ahistory storage unit 6 is added to therecognition processing system 11. Therefore, the same components as those in the first embodiment are denoted by the same reference numerals, and the description thereof is omitted.

図８を参照して、本実施形態の認識処理システム１１の構成について説明する。履歴記憶手段６は、認識処理に関わる履歴情報を記憶するとともに、認識処理対象情報、配信先認識対象情報と評価結果を記憶する。 With reference to FIG. 8, the structure of therecognition processing system 11 of this embodiment is demonstrated. Thehistory storage unit 6 stores history information related to recognition processing, and stores recognition processing target information, distribution destination recognition target information, and evaluation results.

図８を参照して、本実施形態の認識処理システム１１の動作について説明する。入力手段１は、認識処理対象信号を入力して、認識処理対象情報して認識制御手段５に送る。認識制御手段５は、入力手段１の種類と属性記憶手段３に予め記憶される属性情報に基づいて、認識処理対象情報の処理属性を判断する。 With reference to FIG. 8, the operation of therecognition processing system 11 of this embodiment will be described. Theinput unit 1 inputs a recognition processing target signal, sends it to therecognition control unit 5 as recognition processing target information. Therecognition control unit 5 determines the processing attribute of the recognition processing target information based on the type of theinput unit 1 and the attribute information stored in advance in theattribute storage unit 3.

具体的には、認識制御手段５は、例えば、入力手段１がカメラであるとき、カメラ入力に関わる処理属性を属性記憶手段３から取得する。属性記憶手段３には、カメラ入力に関わる処理属性として、標識認識が記憶されているものとする。認識制御手段５は、属性記憶手段３のカメラ入力に関わる処理属性に基づいて、処理属性を標識認識と判断する。すなわち、認識制御手段５は、入力手段１からの認識処理対象情報の処理属性を標識認識と判断する。 Specifically, for example, when theinput unit 1 is a camera, therecognition control unit 5 acquires a processing attribute related to camera input from theattribute storage unit 3. The attribute storage means 3 is assumed to store sign recognition as a processing attribute related to camera input. Therecognition control unit 5 determines that the processing attribute is sign recognition based on the processing attribute related to the camera input of theattribute storage unit 3. That is, therecognition control unit 5 determines that the processing attribute of the recognition processing target information from theinput unit 1 is sign recognition.

具体的には、認識制御手段５は、認識処理対象情報の処理属性が標識認識であることから、標識認識に関わる配信先情報を、属性記憶手段３から取得する。そして、認識制御手段５は、標識認識に関わる配信先情報に基づいて、認識処理対象情報の配信先である認識エンジンを、認識エンジン９−１〜ｎの中から選定する。例えば、認識制御手段５は、標識認識の処理属性に対応する認識エンジン９−ｄ〜ｈ（ｄは１以上で、ｎ以下の整数、ｈはｄ以上で、ｎ以下の整数）を選定する。 Specifically, since the processing attribute of the recognition process target information is sign recognition, the recognition control means 5 acquires distribution destination information related to sign recognition from the attribute storage means 3. And the recognition control means 5 selects the recognition engine which is a delivery destination of recognition process target information from the recognition engines 9-1 to n based on the delivery destination information related to the sign recognition. For example, the recognition control means 5 selects the recognition engines 9-d to h (d is 1 or more and an integer of n or less, h is an integer of d or more and n or less) corresponding to the processing attribute of the sign recognition.

認識制御手段５は、認識処理対象情報から、選定された認識エンジン９−ｄ〜ｈに対応する配信先認識処理対象情報＃ｄ〜＃ｈを作成する。すなわち、認識制御手段５は、属性記憶手段３に記憶される配信先情報に基づいて、認識処理対象情報が認識エンジン９−ｄ〜ｈに対応するように、それぞれ、配信先認識処理対象情報＃ｄ〜＃ｈに変換する。 The recognition control means 5 creates distribution destination recognition process target information #d to #h corresponding to the selected recognition engine 9-d to h from the recognition process target information. That is, the recognition control means 5 is based on the delivery destination information stored in the attribute storage means 3 so that the recognition process target information corresponds to the recognition engines 9-d to h, respectively. Convert to d to #h.

認識制御手段５は、各配信先向けに変換された配信先認識処理対象情報＃ｄ〜＃ｈを、配信手段４を経由して認識エンジン９−ｄ〜ｈに送信する。すなわち、認識制御手段５は、配信手段４を制御して、配信先認識処理対象情報＃ｄ〜＃ｈを、それぞれ、認識エンジン９−ｄ〜ｈに送信する。 Therecognition control unit 5 transmits the distribution destination recognition process target information #d to #h converted for each distribution destination to the recognition engines 9-d to h via thedistribution unit 4. That is, therecognition control unit 5 controls thedistribution unit 4 to transmit the distribution destination recognition process target information #d to #h to the recognition engines 9-d to h, respectively.

このように、認識処理システム１１の入力手段１に入力された認識処理対象信号は、配信先認識処理対象情報＃ｄ〜＃ｈに変換されて、認識エンジン９−ｄ〜ｈに送信される。 As described above, the recognition processing target signal input to theinput unit 1 of therecognition processing system 11 is converted into the delivery destination recognition processing target information #d to #h and transmitted to the recognition engines 9-d to h.

認識エンジン９−ｄ〜ｈは、送信された配信先認識処理対象情報＃ｄ〜＃ｈを受信して認識処理を行う。認識エンジン９−ｄ〜ｈは、認識処理結果を認識処理結果情報＃ｄ〜＃ｈとして、配信手段４に送付する。 The recognition engines 9-d to h receive the transmitted delivery destination recognition process target information #d to #h and perform a recognition process. The recognition engines 9-d to h send the recognition process results to thedistribution unit 4 as the recognition process result information #d to #h.

配信手段４は、送付された認識処理結果情報＃ｄ〜＃ｈを認識制御手段５に送る。認識制御手段５は、送られた認識処理結果情報＃ｄ〜＃ｈを、属性記憶手段３に記憶される属性情報に基づいて評価する。例えば、認識制御手段５は、標識認識に関わる属性情報に基づいて、認識処理結果情報＃ｄ〜＃ｈを評価する。 Thedistribution unit 4 sends the sent recognition processing result information #d to #h to therecognition control unit 5. The recognition control means 5 evaluates the sent recognition processing result information #d to #h based on the attribute information stored in the attribute storage means 3. For example, therecognition control unit 5 evaluates the recognition process result information #d to #h based on attribute information related to the sign recognition.

そして、認識制御手段５は、評価に基づいて、認識処理結果情報＃ｄ〜＃ｈの内、最も高い評価の認識処理結果情報、例えば認識処理結果情報＃ｆを出力手段２に出力する。 Based on the evaluation, therecognition control unit 5 outputs the recognition process result information with the highest evaluation among the recognition process result information #d to #h, for example, the recognition process result information #f, to theoutput unit 2.

このように、本実施形態の認識システム１１は、入力手段１に入力された認識処理対象信号の処理属性を判断し、判断された処理属性に対応する認識エンジンを選択して、選択された認識エンジン９−ｄ〜ｈに認識処理対象情報を送信して認識処理を行う。すなわち、本実施形態では、認識処理対象信号を予め記憶される処理属性に対応する認識エンジンを選択して認識処理を行うことができるので、認識処理対象の属性、分野やサービスが限定されない。 As described above, therecognition system 11 of the present embodiment determines the processing attribute of the recognition processing target signal input to theinput unit 1, selects the recognition engine corresponding to the determined processing attribute, and selects the selected recognition Recognition processing target information is transmitted to the engines 9-d to h to perform recognition processing. That is, in the present embodiment, the recognition process target signal can be selected and the recognition process corresponding to the processing attribute stored in advance can be selected, so that the recognition process target attribute, field, and service are not limited.

認識処理システム１１は、認識エンジン９−ｄ〜ｈの認識処理結果情報を受信して、属性記憶手段３に記憶される属性情報に基づいて評価し、最も高い評価の認識処理結果情報を出力手段２に出力する。 Therecognition processing system 11 receives the recognition processing result information of the recognition engines 9-d to h, evaluates based on the attribute information stored in the attribute storage means 3, and outputs the recognition processing result information with the highest evaluation. Output to 2.

履歴記憶手段６は、認識処理に関わる履歴情報を記憶する。例えば、履歴記憶手段６は、認識処理対象情報、認識処理結果情報とその評価結果を履歴情報として記憶する。認識制御手段５は、評価結果の履歴情報に基づいて、各処理属性に対応する認識エンジンを選定する際の優先度を設定する。例えば、認識制御手段５は、音声認識に関わる認識エンジンの「目的地入力」に対する履歴情報に基づいて優先度を設定する。認識制御手段５は、設定された優先度を、属性記憶手段３の配信先情報として記憶させる。認識制御手段５は、属性記憶手段３に記憶される配信先情報の優先度に基づいて、認識処理対象情報の配信先選定を行ってもよい。また、認識制御手段５は、配信先情報に記憶される優先度に基づいて認識処理結果情報の評価を行ってもよい。 The history storage means 6 stores history information related to recognition processing. For example, thehistory storage unit 6 stores the recognition process target information, the recognition process result information, and the evaluation result as history information. The recognition control means 5 sets a priority when selecting a recognition engine corresponding to each processing attribute based on the history information of the evaluation result. For example, the recognition control means 5 sets the priority based on the history information for the “destination input” of the recognition engine related to voice recognition. Therecognition control unit 5 stores the set priority as distribution destination information in theattribute storage unit 3. Therecognition control unit 5 may select the distribution destination of the recognition processing target information based on the priority of the distribution destination information stored in theattribute storage unit 3. Further, therecognition control unit 5 may evaluate the recognition processing result information based on the priority stored in the distribution destination information.

また、ユーザは、履歴記憶手段６に記憶される認識処理対象情報、認識処理結果情報および評価結果を出力手段２に出力させて、認識処理対象情報に対する認識処理結果情報とその評価結果の妥当性を確認することができる。そして、ユーザは、評価結果の妥当性の確認に基づいて、認識エンジンの優先度を変更することができる。本実施形態は、認識処理に関わる履歴情報に基づいて、認識エンジンの評価に対応する優先度を用いて認識処理を行うことで、認識処理の効率化と精度の向上を図ることができる。 Further, the user causes theoutput unit 2 to output the recognition process target information, the recognition process result information, and the evaluation result stored in thehistory storage unit 6, and the recognition process result information for the recognition process target information and the validity of the evaluation result Can be confirmed. And the user can change the priority of a recognition engine based on confirmation of the validity of an evaluation result. In the present embodiment, the recognition process is performed using the priority corresponding to the evaluation of the recognition engine based on the history information related to the recognition process, so that the efficiency and accuracy of the recognition process can be improved.

認識処理結果情報の評価については、標識認識を例として説明する。図９〜１１は、本実施形態の認識処理システム１１の属性情報を例示したものである。図２、３と図９〜１１を参照して、本実施形態の動作について説明する。 The evaluation of the recognition processing result information will be described by taking the sign recognition as an example. 9 to 11 illustrate the attribute information of therecognition processing system 11 of the present embodiment. The operation of the present embodiment will be described with reference to FIGS.

認識制御手段５は、入力手段１に入力された認識処理対象信号の処理属性を判断する。具体的には、認識制御手段５は、属性記憶手段３に記憶される認識処理に関わる処理属性情報１００（図２参照）から、入力手段１に入力された認識処理対象信号の処理属性を判断する。例えば、認識制御手段５は、入力手段１に入力された認識処理対象信号が、カメラ入力であるとき、カメラ入力についての処理属性を、属性記憶手段３から取得する。認識制御手段５は、属性記憶手段３に予め記憶される認識処理の処理属性情報１００を参照する。認識制御手段５は、入力デバイス欄１０１の「カメラ」に関わる処理属性欄１０２に記載されている「標識認識」が処理属性であると判断する。カメラ入力に関わる処理属性は、標識認識に限定されない。カメラ入力に関わる処理属性は、「画像認識」、「顔認識」、「文字認識」、「バーコード認識」などがあげられるが、ここでは、標識認識として説明する。 Therecognition control unit 5 determines the processing attribute of the recognition processing target signal input to theinput unit 1. Specifically, therecognition control unit 5 determines the processing attribute of the recognition processing target signal input to theinput unit 1 from the processing attribute information 100 (see FIG. 2) related to the recognition processing stored in theattribute storage unit 3. To do. For example, therecognition control unit 5 acquires the processing attribute for the camera input from theattribute storage unit 3 when the recognition processing target signal input to theinput unit 1 is a camera input. Therecognition control unit 5 refers to theprocessing attribute information 100 of the recognition process stored in advance in theattribute storage unit 3. The recognition control means 5 determines that “label recognition” described in theprocessing attribute column 102 related to “camera” in theinput device column 101 is a processing attribute. Processing attributes related to camera input are not limited to sign recognition. Processing attributes related to camera input include “image recognition”, “face recognition”, “character recognition”, “barcode recognition”, and the like. Here, description will be given as sign recognition.

認識制御手段５は、入力手段１に入力された認識処理対象信号の処理属性の判断に基づいて、属性記憶手段３に記憶される配信先情報を取得する。具体的には、認識制御手段５は、認識処理対象信号の処理属性が、例えば、「標識認識」であるとき、「標識認識」に関わる配信先情報を、属性記憶手段３から取得する。認識制御手段５は、属性記憶手段３に予め記憶される認識処理の配信先情報１１０（図３参照）を参照して、処理属性欄１１１の「標識認識」に関わる配信先欄１１２、入力条件欄１１３などに記憶される配信先情報を取得する。 Therecognition control unit 5 acquires the delivery destination information stored in theattribute storage unit 3 based on the determination of the processing attribute of the recognition processing target signal input to theinput unit 1. Specifically, therecognition control unit 5 acquires distribution destination information related to “label recognition” from theattribute storage unit 3 when the processing attribute of the recognition processing target signal is “sign recognition”, for example. Therecognition control unit 5 refers to the distribution destination information 110 (see FIG. 3) of recognition processing stored in advance in theattribute storage unit 3, thedistribution destination column 112 related to “label recognition” in theprocessing attribute column 111, input conditions The distribution destination information stored in thecolumn 113 is acquired.

認識制御手段５は、配信先情報に基づいて、認識処理対象情報の配信先を選定する。具体的には、認識制御手段５は、例えば、処理属性が「標識認識」であれば、属性記憶手段３に予め記憶される認識処理の配信先情報１１０（図３参照）を参照して、処理属性欄１１１の「標識認識」に関わる配信先欄１１２、入力条件欄１１３などの配信先情報に基づいて、標識認識エンジンを選択する。例えば、処理属性が「音声認識」であれば、認識制御手段５は、「音声認識」に関わる配信先情報に基づいて、音声認識エンジンを選択すればよい。 Therecognition control unit 5 selects a distribution destination of recognition processing target information based on the distribution destination information. Specifically, for example, if the processing attribute is “sign recognition”, therecognition control unit 5 refers to the distribution destination information 110 (see FIG. 3) of the recognition process stored in advance in theattribute storage unit 3. A sign recognition engine is selected based on distribution destination information such as adistribution destination field 112 and aninput condition field 113 related to “sign recognition” in theprocessing attribute field 111. For example, if the processing attribute is “voice recognition”, the recognition control means 5 may select a voice recognition engine based on the distribution destination information related to “voice recognition”.

認識制御手段５は、選択された認識処理エンジンに対応する配信先情報に基づいて、認識処理対象情報から、配信先認識処理対象情報を作成する。具体的には、認識制御手段５は、例えば、選択された標識認識エンジン＃１であれば、標識認識エンジン＃１の入力条件欄１１３などに基づいて、認識処理対象情報を、標識認識エンジン＃１の入力条件に合致するように変換する。すなわち、認識制御手段５は、認識処理対象情報を、標識認識エンジン＃１が入力を受け付けられるように、例えば、情報のデータ形式やデータ長などを変換して、配信先認識処理対象情報＃１を作成する。 The recognition control means 5 creates delivery destination recognition processing target information from the recognition processing target information based on the delivery destination information corresponding to the selected recognition processing engine. Specifically, for example, if the selected signrecognition engine # 1, therecognition control unit 5 sets the recognition processing target information to the signrecognition engine # 1 based on theinput condition column 113 of the signrecognition engine # 1. 1 so as to meet the input condition of 1. In other words, the recognition control means 5 converts the data format and data length of the information, for example, so that the signrecognition engine # 1 can accept the input of the recognition processing target information, and the distribution destination recognition processingtarget information # 1. Create

認識制御手段５は、選択された認識エンジンに対応する配信先情報に基づいて、変換された配信先認識対象情報を、配信先である認識エンジンに配信手段４を経由して送信する。すなわち、認識制御手段５は、例えば、配信先認識処理情報＃１を、配信先である標識認識エンジン＃１に配信手段４を経由して送信する。 The recognition control means 5 transmits the converted delivery destination recognition target information to the recognition engine that is the delivery destination via the delivery means 4 based on the delivery destination information corresponding to the selected recognition engine. That is, therecognition control unit 5 transmits, for example, distribution destination recognitionprocessing information # 1 to the labelrecognition engine # 1 that is a distribution destination via thedistribution unit 4.

認識エンジンは、配信された配信先認識処理対象情報を受け付けて、認識処理を行った認識処理結果情報を配信手段４に送付する。すなわち、本実施形態では、認識処理対象信号を予め記憶される処理属性に対応する認識エンジンを選択して認識処理を行うことができるので、認識処理対象の属性、対象分野やサービスは限定されない。 The recognition engine receives the distributed destination recognition processing target information and sends the recognition processing result information obtained by the recognition processing to the distribution means 4. That is, in the present embodiment, the recognition process target signal can be selected and the recognition process corresponding to the processing attribute stored in advance can be selected, so that the recognition process target attribute, target field, and service are not limited.

配信手段４は、送付された認識処理結果情報を認識制御手段５に送る。認識制御手段５は、送られた認識処理結果情報を、属性記憶手段３に記憶される属性情報に基づいて評価する。すなわち、認識制御手段５は、例えば、標識認識エンジン＃１からの認識処理結果情報＃１を、標識認識に関わる属性情報に基づいて評価する。 The distribution means 4 sends the sent recognition processing result information to the recognition control means 5. Therecognition control unit 5 evaluates the sent recognition processing result information based on the attribute information stored in theattribute storage unit 3. That is, the recognition control means 5 evaluates the recognition process resultinformation # 1 from the signrecognition engine # 1 based on attribute information related to sign recognition, for example.

認識処理結果情報の評価については、標識認識を例として説明する。図９、１０、１１は、本実施形態の認識処理システムの属性記憶手段３に記憶される標識認識の評価に関わる属性情報を例示したものである。図９、１０、１１を参照して、各図の構成について説明する。 The evaluation of the recognition processing result information will be described by taking the sign recognition as an example. 9, 10, and 11 illustrate attribute information related to the evaluation of the marker recognition stored in theattribute storage unit 3 of the recognition processing system according to the present embodiment. The configuration of each figure will be described with reference to FIGS.

図９は、標識認識に関わる文法番号と文法の評価情報３００を例示する。図９の文法欄３０２は、単語群を表すルール名、例えば、［ＫＩＳＥＩ］や［ＨＯＪＯ］などの組み合わせた文法の具体例を、文法番号欄３０１と対応付けて示す。［ＫＩＳＥＩ］とは、速度や重量、高さなどを規制する規制標識群を示すルール名である。［ＨＯＪＯ］とは、規制標識や案内標識、注意標識を補助する補助標識群を示すルール名である。例えば、補助標識は、規制区間の始まりや解除、時間規制の時刻などを示すものである。 FIG. 9 illustrates a grammar number andgrammar evaluation information 300 related to marker recognition. Thegrammar column 302 in FIG. 9 shows a specific example of a combined grammar such as a rule name representing a word group, for example, [KISEI] or [HOJO], in association with thegrammar number column 301. [KISEI] is a rule name indicating a group of restriction signs that regulate speed, weight, height, and the like. [HOJO] is a rule name indicating an auxiliary sign group that assists a restriction sign, a guide sign, and a caution sign. For example, the auxiliary sign indicates the start or release of the restriction section, time restriction time, and the like.

図１０は、標識認識に関わる単語群の評価情報３１０を例示する。図１０では、ルール名欄３１１に単語群を表す名称がルール名として示され、表記欄３１２に各ルール名に対応する複数の単語が示される。そして、図１０の同音語・同意語欄３１３には、表記欄３１２に示された単語に対応する同音語（対応する単語の発声音が同じ単語）や同意語（対応する単語と発声音が異なるが意味が同じ単語）が記載される。 FIG. 10 illustrates theevaluation information 310 of the word group related to the sign recognition. In FIG. 10, a name representing a word group is shown as a rule name in therule name column 311, and a plurality of words corresponding to each rule name are shown in thenotation column 312. In the homophone /synonymous word column 313 of FIG. 10, the homophones corresponding to the words shown in the notation column 312 (words corresponding to the same utterance) and synonyms (corresponding words and utterances are displayed). Are different but have the same meaning).

図１１は、標識認識が利用される利用目的に対応する、文法の評価情報３２０を例示する。「利用目的」とは、入力手段１に認識処理対象信号が入力されたときに、認識処理システム１１のユーザが利用していたサービスや出力手段２で出力していた内容を示す。例えば、ユーザがカーナビゲーションシステムを操作して、道路の規制確認をしている場合は、図１１の利用目的欄３２１が、「規制確認」となる。利用目的が「規制確認」であるとき、対応する文法は、文法番号欄３２２の「５」、「６」、「７」が設定される。 FIG. 11 illustratesgrammar evaluation information 320 corresponding to the purpose of use for which sign recognition is used. The “purpose of use” indicates the service used by the user of therecognition processing system 11 and the content output by the output means 2 when the recognition process target signal is input to the input means 1. For example, when the user operates the car navigation system and confirms the regulation of the road, theusage purpose column 321 in FIG. 11 is “regulation confirmation”. When the usage purpose is “regulation confirmation”, “5”, “6”, and “7” in thegrammar number column 322 are set as the corresponding grammar.

次に、図９、１０、１１を参照して、認識制御手段５が行う認識処理結果情報の評価について説明する。認識制御手段５は、入力手段１に認識処理対象信号が入力されたときの利用目的を確認する。例えば、利用目的が「規制確認」であれば、認識制御手段５は、図１１の標識認識に関わる属性情報の利用目的に基づいて、認識処理結果情報の評価に適用すべき文法番号「５」、「６」、「７」を選択する。認識制御手段５は、図９の標識認識に関わる文法に基づいて、文法番号「５」、「６」、「７」に対応する文法定義を選択する。認識制御手段５は、図１０の標識認識に関わる単語群に基づいて、文法番号「５」、「６」、「７」に関わる単語群を選択する。 Next, the evaluation of the recognition process result information performed by therecognition control unit 5 will be described with reference to FIGS. Therecognition control unit 5 confirms the purpose of use when the recognition process target signal is input to theinput unit 1. For example, if the usage purpose is “regulation confirmation”, therecognition control unit 5 uses the grammar number “5” to be applied to the evaluation of the recognition processing result information based on the usage purpose of the attribute information related to the sign recognition in FIG. , “6”, “7” are selected. The recognition control means 5 selects the grammar definition corresponding to the grammar numbers “5”, “6”, and “7” based on the grammar related to the sign recognition in FIG. The recognition control means 5 selects a word group related to the grammar numbers “5”, “6”, and “7” based on the word group related to the sign recognition of FIG.

このように、認識制御手段５は、利用目的に基づいて、認識処理対象信号の認識結果の評価に用いる評価情報を選択する。そして、認識制御手段５は、選択された評価情報に基づいて、認識処理結果情報を評価する。すなわち、認識制御手段５は、標識認識エンジン＃１の認識処理結果情報＃１を、「規制確認」に関わる文法番号「５」、「６」、「７」の文法定義と、文法定義に対応した特定の単語および単語群との適合度を評価すればよい。例えば、標識認識の結果である認識処理結果情報＃１は、道路規制に関わる標識であるか否か、規制標識の中での種別と規制、補助標識の有無と補助標識の中での種別と規制などを示す単語および単語群などが内容として含まれる。よって、認識制御手段５は、認識処理結果情報＃１が、選択された「規制確認」に関わる文法定義、特定の単語および単語群と、完全一致するか否か、または含んでいるか否かを評価すればよい。 As described above, the recognition control means 5 selects evaluation information used for evaluating the recognition result of the recognition processing target signal based on the purpose of use. And the recognition control means 5 evaluates recognition process result information based on the selected evaluation information. That is, the recognition control means 5 corresponds to the recognition processingresult information # 1 of the signrecognition engine # 1 to the grammar definitions of the grammar numbers “5”, “6”, “7” related to “regulation confirmation” and the grammar definition. What is necessary is just to evaluate the fitness with a specific word and a group of words. For example, the recognition processingresult information # 1 that is the result of the sign recognition is whether or not the sign is related to road regulation, the type and restriction in the restriction sign, the presence or absence of the auxiliary sign, and the type in the auxiliary sign. A word indicating a restriction and a word group are included as contents. Therefore, the recognition control means 5 determines whether or not the recognition processingresult information # 1 completely matches or includes the selected grammar definition related to “regulation confirmation”, specific words and word groups. You should evaluate.

このように、本実施形態の認識処理システム１１は、入力手段１から入力された認識処理対象信号の処理属性を判断し、判断された処理属性に対応する認識エンジンをその優先度に基づいて選択して、選択された認識エンジンに認識処理対象情報を送信して認識処理を行う。すなわち、本実施形態では、認識処理対象信号を予め記憶される処理属性に対応する認識エンジンを選択して認識処理を行うことができるので、認識処理対象の属性、分野やサービスが限定されない。 As described above, therecognition processing system 11 according to the present embodiment determines the processing attribute of the recognition processing target signal input from theinput unit 1 and selects a recognition engine corresponding to the determined processing attribute based on the priority. Then, the recognition process target information is transmitted to the selected recognition engine to perform the recognition process. That is, in the present embodiment, the recognition process target signal can be selected and the recognition process corresponding to the processing attribute stored in advance can be selected, so that the recognition process target attribute, field, and service are not limited.

そして、認識処理システム１１は、認識エンジンの認識処理結果情報を受信して、属性記憶手段３に記憶される属性情報に基づいて評価を行い、最も高い評価の認識処理結果情報を出力手段２に出力する。認識制御手段５は、認識処理対象情報と認識処理結果情報および評価結果を履歴記憶手段６に記憶させる。さら、認識制御手段５は、履歴記憶手段６の履歴情報に基づいて、認識エンジンの優先度を設定する。したがって、本実施形態では、認識処理対象信号の処理属性に対応する認識エンジンを優先度に基づいて複数選択して認識処理を行い、認識処理結果を予め記憶される属性情報に基づいて評価するので、認識処理を効率的に行い、認識性能を向上することができる。 Therecognition processing system 11 receives the recognition process result information of the recognition engine, performs an evaluation based on the attribute information stored in theattribute storage unit 3, and outputs the recognition process result information with the highest evaluation to theoutput unit 2. Output. Therecognition control unit 5 stores the recognition process target information, the recognition process result information, and the evaluation result in thehistory storage unit 6. Further, the recognition control means 5 sets the priority of the recognition engine based on the history information in the history storage means 6. Accordingly, in the present embodiment, a plurality of recognition engines corresponding to the processing attributes of the recognition processing target signal are selected based on the priority, the recognition processing is performed, and the recognition processing result is evaluated based on the attribute information stored in advance. The recognition processing can be performed efficiently and the recognition performance can be improved.

したがって、本実施形態は、第１の実施形態と同様の効果を得るとともに、認識処理に関わる認識エンジンを効果的に選択することができる効果が得られる。 Therefore, this embodiment can obtain the same effect as the first embodiment and the effect that the recognition engine related to the recognition process can be effectively selected.

なお、本実施形態では、認識制御手段は、履歴記憶手段に記憶される履歴情報に基づいて優先度を設定する。例えば、認識制御手段は、認識処理結果情報の評価が高い認識エンジンの優先度を高く設定し、認識処理結果情報の評価が低い認識エンジンの優先度を低く設定する。認識処理システムのユーザは、履歴記憶手段に記憶される履歴情報に基づいて、評価結果の妥当性を確認し、設定された優先度を変更することができる。したがって、本実施形態では、設定された優先度に基づいて、認識処理対象情報に適合した認識エンジンを効率的に選択して、認識処理の精度の向上を図ることができる。 In the present embodiment, the recognition control unit sets the priority based on the history information stored in the history storage unit. For example, the recognition control unit sets the priority of a recognition engine having a high evaluation of recognition processing result information to a high priority, and sets the priority of a recognition engine having a low evaluation of recognition processing result information to a low. The user of the recognition processing system can confirm the validity of the evaluation result based on the history information stored in the history storage means, and can change the set priority. Therefore, in the present embodiment, it is possible to efficiently select a recognition engine that is suitable for the recognition processing target information based on the set priority and improve the accuracy of the recognition processing.

なお、本実施形態では、認識処理対象信号はカメラ入力によるものとして説明したが、カメラ入力に限定されない。例えば、マイク入力でもよいし、スキャナ入力でもよいし、振動センサ入力でもよいし、タッチセンサでもよい。すなわち、認識処理対象信号は、認識処理システムの入力手段が備える入力デバイスから入力されるものであって、認識処理対象情報として取り扱えるものであればよい。 In the present embodiment, the recognition process target signal is described as being based on camera input, but is not limited to camera input. For example, a microphone input, a scanner input, a vibration sensor input, or a touch sensor may be used. That is, the recognition processing target signal may be input from an input device provided in the input unit of the recognition processing system and can be handled as recognition processing target information.

また、本実施形態では、カメラから入力された認識処理対象信号の処理属性は標識認識として説明したが、カメラに対応する処理属性は標識認識に限定されない。例えば、カメラに対応する処理属性は、路面表示識別でもよいし、前方車両の車両番号認識でもよい。すなわち、認識処理システムの入力手段が備えるカメラから入力されるものであって、認識処理対象情報として取り扱えるものであればよい。 In the present embodiment, the processing attribute of the recognition processing target signal input from the camera has been described as the sign recognition, but the processing attribute corresponding to the camera is not limited to the sign recognition. For example, the processing attribute corresponding to the camera may be road surface display identification or vehicle number recognition of the preceding vehicle. In other words, it may be input from a camera included in the input unit of the recognition processing system and can be handled as recognition processing target information.

また、本実施形態では、認識処理結果情報の評価は、標識認識のみの認識結果を評価するものとして説明したが、認識結果の評価は単一の認識結果の評価に限定されない。例えば、標識認識と文字認識の認識結果を併用して評価してもよい。すなわち、案内板の標識認識の場合には、案内板であるか否かを識別する標識認識の認識結果と、案内板に記載された文字を識別する文字認識の認識結果に基づいて、両者の認識結果を併用して評価してもよい。また、認識処理対象情報がカメラによる画像である場合には、画像処理の処理結果として車両番号部分の位置が特定され、特定された車両番号部分に対応する文字認識の認識結果が評価されてもよい。すなわち、認識結果の評価は、認識処理対象情報を認識処理した結果を併用して評価してもよい。 In this embodiment, the evaluation of the recognition processing result information has been described as evaluating the recognition result of only the label recognition, but the evaluation of the recognition result is not limited to the evaluation of a single recognition result. For example, the recognition result of the marker recognition and the character recognition may be evaluated in combination. That is, in the case of the sign recognition of the guide plate, based on the recognition result of the sign recognition for identifying whether it is a guide plate and the recognition result of the character recognition for identifying the character described on the guide plate, You may evaluate using a recognition result together. Further, when the recognition processing target information is an image from a camera, the position of the vehicle number portion is specified as the processing result of the image processing, and the recognition result of character recognition corresponding to the specified vehicle number portion is evaluated. Good. In other words, the recognition result may be evaluated by using the result of recognition processing of the recognition processing target information.

また、本実施形態では、認識処理結果情報の評価に関わる属性情報は、標識認識に関わる評価に関わる属性情報を固定的なものとして説明したが、認識処理の評価に関わる属性情報は固定的なものに限定されない。例えば、ユーザが利用しているサービスやアプリケーションなどが認識処理結果情報を使用する利用目的に基づいて、評価に関わる属性情報は取得されて、更新されてもよい。つまり、ユーザがカーナビゲーションシステムを操作して「目的地入力」しているときに表示されている地図情報に基づいて、表示されている地図に関わる地名情報やランドマーク情報、道路名称などが取得されて、案内標識情報の［ＡＮＮＡＩ］（図１０参照）に対応する単語群として評価情報に追加され選択されてもよい。すなわち、認識処理結果情報の評価に関わる属性情報は、予め登録された固定的なものでもよく、ユーザが使用するサービスやアプリケーションなどに基づいて取得される情報であって、評価に関わる属性情報として追加され選択できる情報でもよい。 In the present embodiment, the attribute information related to the evaluation of the recognition process result information is described as fixed attribute information related to the evaluation related to the marker recognition. However, the attribute information related to the evaluation of the recognition process is fixed. It is not limited to things. For example, the attribute information related to the evaluation may be acquired and updated based on the purpose of use in which the service or application used by the user uses the recognition processing result information. In other words, based on the map information that is displayed when the user operates the car navigation system and performs “input destination”, the location name information, landmark information, road name, etc. related to the displayed map are acquired. Then, it may be selected by being added to the evaluation information as a word group corresponding to [ANNNAI] (see FIG. 10) of the guide sign information. In other words, the attribute information related to the evaluation of the recognition processing result information may be fixed information registered in advance, and is acquired based on the service or application used by the user, and is attribute information related to the evaluation. It may be information that can be added and selected.

また、本実施形態では、認識処理結果情報の評価結果は、評価に基づいて、最も高い評価の認識処理結果情報を出力手段に出力すると説明したが、認識処理結果情報は最も高い評価結果には限定されない。例えば、予め設定された閾値以上の評価の情報処理結果情報を出力手段に出力してもよい。また、予め設定された数の情報処理結果情報を評価の高い順から選定して出力手段に出力してもよい。また、認識制御手段は、予め配布先情報に記憶された優先度に基づいて、優先度の高い認識エンジンの認識処理結果情報を出力してもよい。ユーザが使用するアプリケーションやサービスに基づいて、認識処理結果情報の評価結果の出力方法は適宜変更されてもよい。 In the present embodiment, the evaluation result of the recognition process result information has been described as outputting the highest evaluation recognition process result information to the output unit based on the evaluation. However, the recognition process result information is the highest evaluation result. It is not limited. For example, information processing result information of an evaluation equal to or higher than a preset threshold value may be output to the output unit. Alternatively, a preset number of information processing result information may be selected from the highest evaluation order and output to the output means. Further, the recognition control means may output recognition process result information of a recognition engine having a high priority based on the priority stored in advance in the distribution destination information. Based on the application or service used by the user, the output method of the evaluation result of the recognition processing result information may be changed as appropriate.

また、本実施形態では、認識処理手段が備える認識エンジンは、固定的ではない。例えば、認識処理結果情報の評価結果に基づいて、ユーザは認識エンジンの評価を行って配信先情報に記憶させてもよい。認識制御手段は、配信先情報に基づいて予め設定された評価の閾値に至らない認識エンジンを認識処理の選択対象から外してもよい。認識エンジンは新たに追加されてもよい。また、認識制御手段は、認識エンジンとの接続状態または認識エンジンの稼働状態の確認に基づいて、認識エンジンを選択してもよい。すなわち、認識エンジンは、認識処理の処理属性と配信先情報が明示され、認識処理対象情報を入力して認識処理結果情報を出力できるものであればよい。
（第４の実施形態）
本発明の第４の実施形態について説明する。図１２は、本発明の第４の実施形態の認識処理システム１２の構成例を示す全体構成図である。図１２を参照して、本実施形態の認識処理システム１２の構成について説明する。In the present embodiment, the recognition engine included in the recognition processing unit is not fixed. For example, based on the evaluation result of the recognition process result information, the user may evaluate the recognition engine and store it in the delivery destination information. The recognition control means may exclude a recognition engine that does not reach a preset evaluation threshold based on the distribution destination information from the selection target of the recognition process. A recognition engine may be newly added. Further, the recognition control means may select the recognition engine based on confirmation of a connection state with the recognition engine or an operation state of the recognition engine. That is, the recognition engine only needs to be able to output the recognition process result information by inputting the recognition process target information by specifying the process attribute of the recognition process and the delivery destination information.
(Fourth embodiment)
A fourth embodiment of the present invention will be described. FIG. 12 is an overall configuration diagram illustrating a configuration example of therecognition processing system 12 according to the fourth exemplary embodiment of the present invention. With reference to FIG. 12, the configuration of therecognition processing system 12 of the present embodiment will be described.

認識処理システム１２は、認識処理制御装置３０と通信処理装置４０を備える。認識処理システム１２は、ネットワークを介して、認識処理手段９が備える認識エンジン９−１〜ｎと接続される。認識処理制御装置３０は、入力部３１、出力部３２、属性記憶部３３、通信部３４、認識制御部３５と認識エンジン９１を備える。通信処理装置４０は、通信部４１、４３と通信制御部４２を備える。
入力部３１は、認識処理を行うための認識処理対象信号の入力と、登録および操作設定を受け付ける。出力部３２は、認識処理結果の出力と、登録および操作設定に関わる表示および出力を行う。属性記憶部３３は、認識処理対象信号を判断するための処理属性情報、認識処理結果情報を評価するための評価情報と、認識処理手段９が備える認識エンジン９−１〜ｎ、９１に関わる配信先情報を含む属性情報を記憶する。通信部３４は、通信処理装置４０を経由して認識処理手段９の備える認識エンジン９−１〜ｎに接続される。また、通信部３４は、認識処理制御装置３０の内部の認識エンジン９１に接続される。Therecognition processing system 12 includes a recognitionprocessing control device 30 and acommunication processing device 40. Therecognition processing system 12 is connected to recognition engines 9-1 to n included in the recognition processing means 9 via a network. The recognitionprocessing control device 30 includes aninput unit 31, anoutput unit 32, anattribute storage unit 33, acommunication unit 34, arecognition control unit 35, and arecognition engine 91. Thecommunication processing device 40 includescommunication units 41 and 43 and acommunication control unit 42.
Theinput unit 31 receives input of a recognition processing target signal for performing recognition processing, registration, and operation settings. Theoutput unit 32 performs output of recognition processing results, and display and output related to registration and operation settings. Theattribute storage unit 33 includes processing attribute information for determining the recognition processing target signal, evaluation information for evaluating the recognition processing result information, and distribution related to the recognition engines 9-1 to n and 91 included in therecognition processing unit 9. The attribute information including the destination information is stored. Thecommunication unit 34 is connected to the recognition engines 9-1 to n included in therecognition processing unit 9 via thecommunication processing device 40. Thecommunication unit 34 is connected to arecognition engine 91 inside the recognitionprocessing control device 30.

通信処理装置４０は、認識処理制御装置３０と認識処理手段９の間の通信を中継する。すなわち、通信部４１は、認識処理制御装置３０の通信部３４と、例えば、Ｂｌｕｅｔｏｏｔｈ（登録商標）通信や無線ＬＡＮ（Local Area Network）通信などの無線、または有線で接続される。通信部４３は、認識処理手段９が備える認識エンジン９−１〜ｎに、公衆網や移動無線網など、または有線で接続される。通信制御部４２は、通信部４１と４３の制御を行って、認識処理制御装置３０と認識処理手段９の間の通信を中継する。 Thecommunication processing device 40 relays communication between the recognitionprocessing control device 30 and the recognition processing means 9. That is, thecommunication unit 41 is connected to thecommunication unit 34 of the recognitionprocessing control device 30 by wireless or wired such as Bluetooth (registered trademark) communication or wireless LAN (Local Area Network) communication. Thecommunication unit 43 is connected to the recognition engines 9-1 to n included in therecognition processing unit 9 by a public network, a mobile wireless network, or the like. Thecommunication control unit 42 controls thecommunication units 41 and 43 to relay communication between the recognitionprocessing control device 30 and therecognition processing unit 9.

認識処理制御装置３０の入力部３１、出力部３２、属性記憶部３３と認識制御部３５は、図８で示した第３の実施形態の入力手段１、出力手段２、属性記憶手段３と認識制御手段５に相当するので説明を省略する。

Input unit 31 of the recognitionprocessing control unit 30,output unit 32, theattribute storage unit 33and therecognition control unit35,input unit 1 of the third embodiment shown in FIG. 8, theoutput unit 2, the attribute storage means 3 Sinceit corresponds to the recognition control means5, description thereof is omitted.

また、認識処理制御装置３０の通信部３４と通信処理装置４０は、図８で示した配信手段４に相当するので説明を省略する。認識処理制御装置３０は、ローカルに認識処理を行う認識エンジンとして、認識エンジン９１を備えていてもよいが、なくてもよい。 Moreover, since thecommunication part 34 and thecommunication processing apparatus 40 of the recognitionprocess control apparatus 30 are equivalent to the delivery means 4 shown in FIG. 8, description is abbreviate | omitted. The recognitionprocessing control device 30 may or may not include therecognition engine 91 as a recognition engine that performs recognition processing locally.

図１３は、本実施形態の認識処理制御装置３０の動作例を示すフローチャートである。図１４は、本実施形態の通信処理装置４０の動作例を示すフローチャートである。 FIG. 13 is a flowchart illustrating an operation example of the recognitionprocessing control apparatus 30 of the present embodiment. FIG. 14 is a flowchart illustrating an operation example of thecommunication processing device 40 according to the present embodiment.

図１３を参照して、認識処理制御装置３０の認識処理の動作について説明する。 With reference to FIG. 13, the operation of the recognition process of the recognitionprocess control device 30 will be described.

入力部３１に認識処理対象信号が入力されると、入力部３１は、入力された認識処理対象信号を認識処理対象情報に変換して認識制御部３５に送る。認識制御部３５は、属性記憶部３３に記憶される属性情報に基づいて選択された配信先に、変換された認識処理対象情報を、通信部３４と通信処理装置４０を経由して送信する（Ａ１０１）。ここで、配信先とは、認識制御部３５が属性記憶部３３に記憶される配信先情報に基づいて選択された認識エンジンである。配信先として選択された認識エンジンは、認識処理対象情報を受信して認識処理を行った認識処理結果情報を、通信処理装置４０と通信部３４を経由して認識処理制御装置３０に送信する。 When the recognition processing target signal is input to theinput unit 31, theinput unit 31 converts the input recognition processing target signal into recognition processing target information and sends it to therecognition control unit 35. Therecognition control unit 35 transmits the converted recognition processing target information to the distribution destination selected based on the attribute information stored in theattribute storage unit 33 via thecommunication unit 34 and the communication processing device 40 ( A101). Here, the distribution destination is a recognition engine selected by therecognition control unit 35 based on the distribution destination information stored in theattribute storage unit 33. The recognition engine selected as the distribution destination transmits the recognition process result information obtained by receiving the recognition process target information and performing the recognition process to the recognitionprocess control apparatus 30 via thecommunication processing apparatus 40 and thecommunication unit 34.

認識制御部３５は、配信先から認識処理結果情報を受信する（Ａ１０２）。そして、認識制御部３５は、配信先からの認識処理結果情報の有無を確認する（Ａ１０３）。 Therecognition control unit 35 receives the recognition process result information from the distribution destination (A102). And therecognition control part 35 confirms the presence or absence of the recognition process result information from a delivery destination (A103).

配信先からの認識処理結果情報がなければ（Ａ１０３：ＮＯ）、認識制御部３５は、処理結果なしと、出力部３２に出力して処理を終了する（Ａ１１１）。 If there is no recognition process result information from the delivery destination (A103: NO), therecognition control unit 35 outputs to theoutput unit 32 that there is no process result and ends the process (A111).

配信先からの認識処理結果情報があれば（Ａ１０３：ＹＥＳ）、認識制御部３５は、現在の利用目的に対応する属性情報、例えば音声認識の「目的地入力」に対応する文法を選択する（Ａ１０４）。 If there is recognition processing result information from the delivery destination (A103: YES), therecognition control unit 35 selects attribute information corresponding to the current use purpose, for example, a grammar corresponding to “destination input” of voice recognition ( A104).

認識制御部３５は、選択された属性情報に沿った認識処理結果情報があるかを確認する（Ａ１０５）。具体的には、音声認識の「目的地入力」に対応する文法番号「２」（図４、５参照）の示す単語群および特定の単語が、認識処理結果情報に含まれているか否かを、認識制御部３５は評価する。 Therecognition control unit 35 checks whether there is recognition processing result information along the selected attribute information (A105). Specifically, it is determined whether or not the word group indicated by the grammar number “2” (see FIGS. 4 and 5) corresponding to the “destination input” of speech recognition and the specific word are included in the recognition processing result information. Therecognition control unit 35 evaluates.

そして、認識制御部３５は、選択された属性情報に沿った認識処理結果情報があれば（Ａ１０５：ＹＥＳ）、認識処理結果情報を処理結果として、出力部２に出力して処理を終了する（Ａ１２１）。 Then, if there is the recognition processing result information along the selected attribute information (A105: YES), therecognition control unit 35 outputs the recognition processing result information to theoutput unit 2 as the processing result and ends the processing ( A121).

一方、認識制御部３５は、選択された属性情報に沿った認識処理結果情報がなければ（Ａ１０５：ＮＯ）、現在の利用目的に対応するＡ１０４において未だ選択されていない未選択の属性情報があるかを確認する（Ａ１０６）。認識制御部３５は、未選択の属性情報がなければ（Ａ１０６：ＮＯ）、処理結果なしと、出力部３２に出力して処理を終了する（Ａ１１１）。 On the other hand, if there is no recognition processing result information according to the selected attribute information (A105: NO), therecognition control unit 35 has unselected attribute information that has not yet been selected in A104 corresponding to the current use purpose. (A106). If there is no unselected attribute information (A106: NO), therecognition control unit 35 outputs to theoutput unit 32 that there is no processing result and ends the processing (A111).

認識制御部３５は、未選択の属性情報があれば、次の属性情報を選択（Ａ１０７）して、ステップＡ１０５の処理に戻る。 If there is unselected attribute information, therecognition control unit 35 selects the next attribute information (A107), and returns to the process of step A105.

このように、本実施形態の認識処理装置３０は、入力部３１に入力された認識処理対象信号の認識処理を行うことができる。 As described above, therecognition processing device 30 according to the present embodiment can perform the recognition processing of the recognition processing target signal input to theinput unit 31.

図１４を参照して、通信処理装置４０の動作について説明する。 The operation of thecommunication processing device 40 will be described with reference to FIG.

通信制御部４２は、認識処理制御装置３０の通信部３４から認識処理対象情報を、通信部４１で受信し、配信先である認識処理手段９の認識エンジン９−１〜ｎに、通信部４３から送信する（Ｂ１０１）。そして、通信制御部４２は、配信先の認識エンジン９−１〜ｎからの認識処理結果情報を、通信部４３を経由して受信する（Ｂ１０２）。 Thecommunication control unit 42 receives the recognition processing target information from thecommunication unit 34 of the recognitionprocessing control device 30 by thecommunication unit 41, and sends the information to the recognition engines 9-1 to n of the recognition processing means 9 that is the distribution destination. (B101). And thecommunication control part 42 receives the recognition process result information from the recognition engines 9-1 to n of a delivery destination via the communication part 43 (B102).

通信制御部４２は、認識処理結果情報を、通信部４１を経由して認識処理制御装置３０に送信して処理を終了する（Ｂ１０４）。 Thecommunication control unit 42 transmits the recognition process result information to the recognitionprocess control device 30 via thecommunication unit 41 and ends the process (B104).

こうして、本実施形態の通信処理装置４０は、認識処理制御装置３０と認識処理手段９の間の通信を中継する。 In this way, thecommunication processing device 40 of this embodiment relays communication between the recognitionprocessing control device 30 and the recognition processing means 9.

このように、本実施形態の認識処理システム１２は、情報処理制御装置３０の入力部３１に入力された認識処理対象情報を、認識制御部３５は、属性記憶部３３に記憶される属性情報に基づいて、認識エンジン９−１〜ｎの中から選択された認識エンジンに送信する。そして、選択された認識エンジンからの認識処理結果情報を受信して、認識制御部３５は、属性記憶部３３に記憶される属性情報に基づいて評価する。認識制御部３５は、評価結果に基づいて、認識処理結果情報を出力部３２に出力する。 As described above, therecognition processing system 12 according to the present embodiment converts the recognition processing target information input to theinput unit 31 of the informationprocessing control device 30 into the attribute information stored in theattribute storage unit 33. Based on the recognition engine 9-1 to n, it transmits to the recognition engine selected. And the recognition process result information from the selected recognition engine is received, and therecognition control part 35 evaluates based on the attribute information memorize | stored in the attribute memory |storage part 33. FIG. Therecognition control unit 35 outputs recognition process result information to theoutput unit 32 based on the evaluation result.

なお、本実施形態では、選択された属性情報に沿っている認識処理結果情報があれば、認識処理結果とすると説明したが、選択された属性情報に沿っていることに限定されない。例えば、認識制御部は、属性情報に基づいて、受信された認識処理結果情報を評価して、最も高い評価の認識処理結果情報を出力手段に出力してもよいし、予め設定された閾値以上の評価を得た情報処理結果情報を出力手段に出力してもよいし、予め設定された数の情報処理結果情報を評価の高い順から選定して出力手段に出力してもよい。また、ユーザが使用するサービスやアプリケーションなどに基づいて、認識処理結果情報の評価結果の出力方法は適宜変更されてもよい。 In the present embodiment, it has been described that there is recognition processing result information if there is recognition processing result information along the selected attribute information. However, it is not limited to being along the selected attribute information. For example, the recognition control unit may evaluate the received recognition process result information based on the attribute information, and may output the highest-recognized recognition process result information to the output unit, or more than a preset threshold value The information processing result information obtained from the evaluation may be output to the output unit, or a preset number of information processing result information may be selected in descending order of evaluation and output to the output unit. Moreover, the output method of the evaluation result of the recognition processing result information may be appropriately changed based on the service or application used by the user.

また、本実施形態では、認識制御部は、予め配信先情報に設定された優先度に基づいて、認識エンジンを選定してもよいし、または認識処理結果情報の評価を行ってもよい。 In this embodiment, the recognition control unit may select a recognition engine based on the priority set in advance in the distribution destination information, or may evaluate the recognition processing result information.

また、本実施形態では、認識処理制御装置と通信処理装置を分離して説明したが、認識処理制御装置と通信処理装置は必ずしも分離される必要はない。ここで、認識処理制御装置は、車載システムであれば、例えばカーナビシステムやクルージングシステムなどに適用される。カーナビシステムやクルージングシステムなどの車載装置が公衆網や移動無線網などの広域通信機能を備えていれば、本実施形態の通信処理装置は不要となる。また、カーナビシステムやクルージングシステムなどの車載装置が広域通信機能を備えていなければ、例えば広域通信機能を備えた移動端末や情報端末が、車載装置と近接通信機能または有線で通信する。すなわち、移動端末や情報端末が本実施形態の通信処理装置として機能し、認識処理制御装置として機能するカーナビシステムやクルージングシステムなどの車載装置と連携することで、本実施形態が実現されればよい。このとき、カーナビシステムやクルージングシステムなどの車載装置が備えるマイク入力、各種センサ情報やカメラ映像、移動端末のマイク入力、センサ情報やカメラ映像を、本実施形態は、認識処理対象情報に適用する。また、認識処理制御装置は、移動端末や情報端末であってもよい。移動端末や情報端末が備えるマイク入力、カメラ映像や各種センサ情報を認識処理対象情報として、本実施形態は適用される。すなわち、本実施形態は、センサを備え、センサ情報を認識処理対象情報として送信し、認識処理結果情報を受信できるものであれば適用される。 In this embodiment, the recognition processing control device and the communication processing device are described separately. However, the recognition processing control device and the communication processing device do not necessarily have to be separated. Here, if the recognition processing control device is an in-vehicle system, it is applied to, for example, a car navigation system or a cruising system. If an in-vehicle device such as a car navigation system or a cruising system has a wide-area communication function such as a public network or a mobile wireless network, the communication processing device of this embodiment is not necessary. In addition, if an in-vehicle device such as a car navigation system or a cruising system does not have a wide area communication function, for example, a mobile terminal or an information terminal having a wide area communication function communicates with the in-vehicle device by a proximity communication function or by wire. That is, the present embodiment may be realized by a mobile terminal or an information terminal functioning as a communication processing device of the present embodiment and in cooperation with an in-vehicle device such as a car navigation system or a cruising system that functions as a recognition processing control device. . At this time, in this embodiment, the microphone input, various sensor information and camera video, the microphone input of the mobile terminal, the sensor information and the camera video included in the in-vehicle device such as the car navigation system and the cruising system are applied to the recognition processing target information. The recognition processing control device may be a mobile terminal or an information terminal. The present embodiment is applied using the microphone input, camera video, and various sensor information included in the mobile terminal and the information terminal as the recognition process target information. That is, this embodiment is applied as long as it includes a sensor, can transmit sensor information as recognition process target information, and can receive recognition process result information.

また、本実施形態は、センサを備え、センサ情報を認識処理対象情報として、認識処理制御装置に入力する入力装置と、認識処理制御装置が組み合わされてもよい。 In addition, the present embodiment may include a sensor, and an input device that inputs sensor information as recognition processing target information to the recognition processing control device may be combined with the recognition processing control device.

また、本実施形態では、認識処理制御装置または通信処理装置が認識エンジンを備え、認識処理対象信号の処理属性の判断に基づいて、その認識エンジンが選択されてもよいし、選択されなくてもよい。 In the present embodiment, the recognition processing control device or the communication processing device includes a recognition engine, and the recognition engine may or may not be selected based on the determination of the processing attribute of the recognition processing target signal. Good.

なお、本願発明は、上述の各実施形態に限定されるものではなく、本願発明の要旨を逸脱しない範囲で種々変更、変形して実施することができる。 Note that the present invention is not limited to the above-described embodiments, and various modifications and changes can be made without departing from the spirit of the present invention.

例えば、第３の実施形態で追加された履歴記憶手段は、第１の実施形態や第４の実施形態に追加されてもよい。 For example, the history storage unit added in the third embodiment may be added to the first embodiment or the fourth embodiment.

また、本願発明では、音声認識または標識認識を用いて認識処理を説明したが、認識処理は、音声認識と標識認識に限定されない。例えば、認識処理は、音を使用するものであれば、話者識別、楽曲識別、ＤＴＭＦ認識などがあげられる。また、画像を使用するものであれば、画像認識、顔認識、文字認識、バーコード認識、指紋認識、路面表示識別、車両番号認識などがあげられる。本願発明は、これらの認識処理の認識処理結果情報を組み合わせて評価されてもよい。すなわち、本願発明は、認識処理対象信号が入力されて、認識処理対象情報に変換されて送信でき、認識処理結果情報と属性情報が得られて、評価できるものであれば適用できる。そして、本願発明は、各種の処理属性に対応する認識エンジンを認識処理制御装置に備えていなくても、認識処理結果情報を受信して評価し取り扱うことができる効果がある。また、本願発明は、認識処理システムとして認識処理制御装置と接続できる認識エンジンであれば、既存の認識エンジンや最新のアルゴリズムを使用する認識エンジンを使用することができる効果が得られる。さらに、本願発明は、認識処理システムとしての構築コストを軽減できる効果があるとともに、認識処理の性能の向上を期待できる効果がある。 In the present invention, the recognition process has been described using voice recognition or sign recognition. However, the recognition process is not limited to voice recognition and sign recognition. For example, if the recognition process uses sound, speaker identification, music identification, DTMF recognition, and the like can be given. In the case of using an image, image recognition, face recognition, character recognition, barcode recognition, fingerprint recognition, road surface identification, vehicle number recognition, and the like can be given. The present invention may be evaluated by combining the recognition process result information of these recognition processes. That is, the present invention can be applied as long as a recognition processing target signal is input, converted into recognition processing target information and transmitted, and recognition processing result information and attribute information can be obtained and evaluated. The present invention has an effect that the recognition process result information can be received, evaluated and handled even if the recognition process control apparatus does not include a recognition engine corresponding to various process attributes. In addition, the present invention can obtain an effect that an existing recognition engine or a recognition engine using the latest algorithm can be used as long as it is a recognition engine that can be connected to a recognition processing control device as a recognition processing system. Furthermore, the present invention has an effect of reducing the construction cost as a recognition processing system and an effect of expecting an improvement in recognition processing performance.

また、各実施形態で説明した属性記憶手段、属性記憶部、履歴記憶手段は、認識処理に関わる属性情報を記憶するハードディスク装置であってもよいし、ネットワークで接続されたサーバ装置であってもよいし、ＲＯＭ（Read Only Memory）、ＲＡＭ（Random Access Memory）、フラッシュメモリ等の半導体記憶装置、光ディスク、磁気ディスク、光磁気ディスク等、非一時的な媒体でもよい。すなわち、属性記憶手段、属性記憶部は、認識制御手段または認識制御部の制御を受けて、認識処理に関わる属性情報を記憶できるものであればよい。また、履歴記憶手段は、認識制御手段の制御を受けて、認識処理に関わる履歴情報を記憶できるものであればよい。 In addition, the attribute storage unit, attribute storage unit, and history storage unit described in each embodiment may be a hard disk device that stores attribute information related to recognition processing, or may be a server device connected via a network. Alternatively, a non-transitory medium such as a ROM (Read Only Memory), a RAM (Random Access Memory), a semiconductor memory device such as a flash memory, an optical disk, a magnetic disk, or a magneto-optical disk may be used. That is, the attribute storage unit and the attribute storage unit only need to be able to store attribute information related to the recognition process under the control of the recognition control unit or the recognition control unit. Further, the history storage means may be anything that can store history information related to recognition processing under the control of the recognition control means.

また、上述の各実施形態の認識制御手段または認識制御部による認識制御処理は、認識制御手段または認識処理制御装置が備えるコンピュータ（図示なし）を用いたソフトウェアによって実行されてもよい。すなわち、認識制御処理を行うコンピュータプログラムが、ＣＰＵ（Central Processing Unit）によって読み込まれ、実行されることによって実現されてもよい。 In addition, the recognition control processing by the recognition control unit or the recognition control unit in each of the above embodiments may be executed by software using a computer (not shown) provided in the recognition control unit or the recognition processing control device. That is, a computer program that performs recognition control processing may be realized by being read and executed by a CPU (Central Processing Unit).

なお、プログラムは、ＲＯＭ、ＲＡＭ、フラッシュメモリ等の半導体記憶装置、光ディスク、磁気ディスク、光磁気ディスク等、非一時的な媒体に格納されてもよい。 Note that the program may be stored in a non-transitory medium such as a semiconductor storage device such as a ROM, a RAM, or a flash memory, an optical disk, a magnetic disk, or a magneto-optical disk.

１入力手段
２出力手段
３属性記憶手段
４配信手段
５、２１認識制御手段
６履歴記憶手段
９認識処理手段
９−１〜ｎ、９１認識エンジン
１０、１１、１２認識処理システム
２０、３０認識処理制御装置
２２情報送信部
２３結果受信部
２４評価結果出力部
３１入力部
３２出力部
３３属性記憶部
３４、４１、４３通信部
３５認識制御部
４０通信処理装置
４２通信制御部DESCRIPTION OFSYMBOLS 1 Input means 2 Output means 3 Attribute storage means 4 Distribution means 5, 21 Recognition control means 6 History storage means 9 Recognition processing means 9-1 to n, 91Recognition engine 10, 11, 12Recognition processing system 20, 30 Recognitionprocessing control Device 22Information transmitting unit 23Result receiving unit 24 Evaluationresult output unit 31Input unit 32Output unit 33Attribute storage unit 34, 41, 43Communication unit 35Recognition control unit 40Communication processing device 42 Communication control unit

Claims

Translated fromJapanese

認識処理対象の処理属性にしたがって認識処理を行う認識エンジンを選択するエンジン指定情報に基づいて、前記認識処理対象を前記認識エンジンに送信する送信手段と、
前記認識エンジンから前記認識処理の結果情報を受信する受信手段と、
前記処理属性に対応する認識エンジンを選択する際、予め設定された優先度の高さに基づいて複数選択し、前記選択された認識エンジンからの認識処理結果情報を、ユーザの利用目的に基づいて選択された評価情報との適合度によって評価し、評価結果を出力する出力手段を備えることを特徴とする認識処理制御装置。Transmission means for transmitting the recognition processing target to the recognition engine based on engine designation information for selecting a recognition engine that performs recognition processing according to the processing attribute of the recognition processing target;
Receiving means for receiving the result information of the recognition process from the recognition engine;
When selecting a recognition engine corresponding to theprevious SL processing attributes, and multiple selection based on the height of the preset priority, the recognition result information from the selected recognition engine,based on the user's intended use A recognition processing control device comprising: an output unit that performsevaluation based on a degree of matching with the evaluation information selected and outputs an evaluation result.

前記認識処理対象、前記結果情報または前記評価結果を履歴情報として記憶する履歴記憶手段を備え、
前記エンジン指定情報は、前記評価結果の履歴情報に基づいて設定された、認識エンジンを選択するときの優先度を示す優先度情報を含み、前記優先度情報を前記予め設定された優先度とする
ことを特徴とする請求項１に記載の認識処理制御装置。A history storage means for storing the recognition processing target, the result information or the evaluation result as history information;
The engine designation information includes priority information indicating priority when selecting a recognition engine, which is set based on history information of the evaluation result, and sets the priority information as the preset priority. The recognition processing control apparatus according to claim 1.

前記出力手段が、前記エンジン指定情報に基づいて選択された複数の前記認識エンジンの前記結果情報を併用して評価する、
ことを特徴とする請求項１または２に記載の認識処理制御装置。The output means evaluates the result information of the plurality of recognition engines selected based on the engine designation information together;
The recognition processing control apparatus according to claim 1, wherein

車載装置、移動端末または情報端末であることを特徴とする請求項１から３のいずれか１項に記載の認識処理制御装置。 The recognition processing control device according to any one of claims 1 to 3, wherein the recognition processing control device is an in-vehicle device, a mobile terminal, or an information terminal.

認識処理対象の処理属性にしたがって認識処理を行う認識エンジンを選択するエンジン指定情報に基づいて、前記認識処理対象を前記認識エンジンに送信し、
前記認識エンジンから前記認識処理の結果情報を受信し、
前記処理属性に対応する認識エンジンを選択する際、予め設定された優先度の高さに基づいて複数選択し、前記選択された認識エンジンからの認識処理結果情報を、ユーザの利用目的に基づいて選択された評価情報との適合度によって評価し、評価結果を出力する
ことを特徴とする認識処理制御方法。Based on engine designation information for selecting a recognition engine that performs recognition processing according to the processing attribute of the recognition processing target, the recognition processing target is transmitted to the recognition engine,
Receiving the recognition process result information from the recognition engine;
When selecting a recognition engine corresponding to the processing attribute, a plurality of selections are made based on a preset priority level, and recognition processing result information from the selected recognition engine is selectedbased on a user's purpose of use. A recognition processing control method, characterized in thatevaluation is performed based on a degree of matching with selected evaluation information, and an evaluation result is output.

認識処理制御装置が備えるコンピュータを、
認識処理対象の処理属性にしたがって認識処理を行う認識エンジンを選択するエンジン指定情報に基づいて、前記認識処理対象を前記認識エンジンに送信する送信手段と、
前記認識エンジンから前記認識処理の結果情報を受信する受信手段と、
前記処理属性に対応する認識エンジンを選択する際、予め設定された優先度の高さに基づいて複数選択し、前記選択された認識エンジンからの認識処理結果情報を、ユーザの利用目的に基づいて選択された評価情報との適合度によって評価し、評価結果を出力する出力手段
として動作させることを特徴とする認識処理制御プログラム。A computer included in the recognition processing control device,
Transmission means for transmitting the recognition processing target to the recognition engine based on engine designation information for selecting a recognition engine that performs recognition processing according to the processing attribute of the recognition processing target;
Receiving means for receiving the result information of the recognition process from the recognition engine;
When selecting a recognition engine corresponding to the processing attribute, a plurality of selections are made based on a preset priority level, and recognition processing result information from the selected recognition engine is selectedbased on a user's purpose of use. A recognition process control program characterizedby being evaluated as a degree of matching with selected evaluation information and operating as output means for outputting an evaluation result.