JP7608076B2

Movatterモバイル変換

Info

Publication number: JP7608076B2
Application number: JP2020103339A
Authority: JP
Inventors: 剛徳月川
Original assignee: Canon Inc
Current assignee: Canon Inc
Priority date: 2020-06-15
Filing date: 2020-06-15
Publication date: 2025-01-06
Anticipated expiration: 2040-06-15
Also published as: JP2021196905A

Description

本発明は、情報処理技術に関する。The present invention relates to information processing technology.

ある対象にアノテーションやタグ付けなどを行うことで分類することは一般に広く行われている。最近では、アノテーション付与やタグ付けを整理のための分類に用いるだけではなく、機械学習の教師データとして利用するということも増えている。Classifying objects by annotating or tagging them is a common practice. Recently, annotations and tags are not only used for classification purposes, but are increasingly being used as training data for machine learning.

特に機械学習の教師データとして分類を行う場合、付与対象は大量になるため、アノテーションやタグの基準を共有した上で、複数の作業者で作業を実施する場合がある。In particular, when classifying data as training data for machine learning, the number of objects to be tagged can be large, so annotation and tagging standards may be shared and the work may be carried out by multiple workers.

複数の作業者でアノテーションを付与する技術として、特許文献１に示すような方法がある。この方法は、複数人が付与したアノテーションの結果から、各作業者の信頼性を判断し、信頼性に基づいて、作業者に割り振る付与対象を決定している。One technique for annotation by multiple workers is the method shown inPatent Document 1. This method judges the reliability of each worker from the results of annotations made by multiple people, and determines which annotations to assign to each worker based on their reliability.

特開２０１５－１６６９７５号公報JP 2015-166975 A

複数の作業者でアノテーションを付与する場合、付与基準が作業者間でぶれていると、同じ対象に対して異なるアノテーションが付与されてしまう恐れが生じる。同じ対象に異なるアノテーションが付与されると、これらを教師データとして機械学習に用いた場合に、精度が高い学習済モデルを生成できない可能性がある。When multiple workers annotate, if the annotation standards vary between workers, there is a risk that different annotations will be given to the same object. If different annotations are given to the same object, when these are used as training data for machine learning, it may not be possible to generate a highly accurate trained model.

特許文献１に記載された方法は、複数の作業者間でアノテーションに対する基準にぶれ・ずれが生じていることは検出できない。The method described inPatent Document 1 cannot detect inconsistencies or discrepancies in annotation standards between multiple workers.

本発明は上述した問題を解決するためになされたものであり、アノテーションの付与作業における複数の作業者間の付与基準のずれを検出し、付与基準のずれに対する警告を可能とすることを目的とする。The present invention has been made to solve the above-mentioned problems, and aims to detect discrepancies in annotation standards between multiple workers in annotation work and to enable warnings regarding discrepancies in annotation standards.

上記目的を達成するための一手段として、本発明の情報処理装置は以下の構成を備える。As one means for achieving the above object, the information processing device of the present invention has the following configuration.

すなわち、
同一の付与対象への複数の作業者からの入力に基づく複数のアノテーションを取得する取得手段と、
前記取得手段により取得された前記複数のアノテーションを比較する比較手段と、
前記比較手段による比較に基づいて、前記複数の作業者間にアノテーションの付与基準のずれがあるかを判定する判定手段と、
前記複数の作業者間に前記付与基準のずれが前記判定手段により判定された場合、警告を行う警告手段と、を有し、前記判定手段は、前記比較手段による比較結果が前記複数のアノテーションのエントロピーが所定値以上であるとの結果である場合、前記複数の作業者間に前記付与基準のずれがあると判定することを特徴とする。
That is,
An acquisition means for acquiring a plurality of annotations based on inputs from a plurality of workers to a same annotation target;
A comparison means for comparing the plurality of annotations acquired by the acquisition means;
a determination means for determining whether there is a discrepancy in annotation standards between the plurality of workers based on the comparison by the comparison means;
The method further comprises a warning means for issuing a warning when the judgment means judges that there is a discrepancy in the assignment criteria between the multiple workers, and the judgment means judges that there is a discrepancy in the assignment criteria between the multiple workers when the comparison result by the comparison means indicates that the entropy of the multiple annotations is equal to or greater than a predetermined value .

本発明によれば、アノテーションの付与作業における複数の作業者間の付与基準のずれを検出し、付与基準のずれに対する警告が可能となる。The present invention makes it possible to detect discrepancies in annotation standards between multiple workers in annotation work and to issue warnings regarding discrepancies in annotation standards.

情報処理装置のハードウェア構成を示す図である。FIG. 2 is a diagram illustrating a hardware configuration of an information processing device.情報処理装置の機能構成を示す図である。FIG. 2 is a diagram illustrating a functional configuration of an information processing device.情報処理装置が出力する画面例を示す図である。FIG. 11 is a diagram showing an example of a screen output by an information processing device.情報処理装置の実施形態１における全体処理を表すフローチャートである。4 is a flowchart showing an overall process of the information processing device according to the first embodiment.付与対象を管理するテーブルの例を示す図である。FIG. 13 is a diagram illustrating an example of a table for managing grant targets.作業者が作業をする際に利用する付与対象が格納された作業キューの例を示す図である。11 is a diagram showing an example of a work queue in which targets to be assigned that are used when a worker performs a task are stored. FIG.作業者が付与した付与データを格納したテーブルの例を示す図である。FIG. 13 is a diagram illustrating an example of a table storing added data added by a worker.複数人が付与した付与データのばらつきを判定する処理を表すフローチャートである。11 is a flowchart showing a process for determining the variability of assigned data assigned by a plurality of people.実施形態３における情報処理装置の機能構成を示す図である。FIG. 11 is a diagram illustrating a functional configuration of an information processing device according to a third embodiment.実施形態３における付与対象に付与するべき付与データを推定した結果を格納したテーブルの例を示す図である。FIG. 13 is a diagram showing an example of a table storing the results of estimating the assignment data to be assigned to the assignment target in the third embodiment.実施形態３における基準確認用付与対象を判断する処理を表すフローチャートである。13 is a flowchart showing a process for determining a reference confirmation assignment target inembodiment 3.実施形態３における付与対象を管理しているテーブルの例を示す図である。FIG. 13 is a diagram illustrating an example of a table for managing targets to be granted in the third embodiment.実施形態３における同一の付与対象を作業する作業者数を判断する処理を表すフローチャートである13 is a flowchart showing a process for determining the number of workers working on the same target to be assigned in the third embodiment.実施形態３における作業者が作業をする際に利用する付与対象が格納された作業キューの例を示す図である。13 is a diagram showing an example of a work queue in which targets to be assigned that are used when a worker performs work inembodiment 3 are stored. FIG.

以下、添付の図面を参照して、本発明をその好適な実施形態に基づいて詳細に説明する。なお、以下の実施形態において示す構成は一例に過ぎず、本発明は図示された構成に限定されるものではない。The present invention will now be described in detail based on preferred embodiments thereof with reference to the accompanying drawings. Note that the configurations shown in the following embodiments are merely examples, and the present invention is not limited to the configurations shown in the drawings.

＜実施形態１＞
図１は、本実施形態における情報処理装置１００のハードウェア構成を示す図である。以下の実施形態で説明される情報処理装置１００は、図１に示すブロック図の構成を持つコンピュータ装置である。また、以降では、情報処理装置１００を単一の装置として説明を行うが、これに限らない。すなわち、各実施形態における方法はそれぞれ単一のコンピュータ装置で実現してもよいし、必要に応じた複数のコンピュータ装置に各機能を分散して実現するようにしてもよい。複数のコンピュータ装置で構成される場合は、互いに通信可能なようにＬｏｃａｌＡｒｅａＮｅｔｗｏｒｋ（ＬＡＮ）などで接続されている。 <Embodiment 1>
FIG. 1 is a diagram showing a hardware configuration of aninformation processing device 100 in this embodiment. Theinformation processing device 100 described in the following embodiment is a computer device having the configuration shown in the block diagram in FIG. 1. In addition, theinformation processing device 100 will be described as a single device hereinafter, but is not limited to this. That is, the methods in each embodiment may be realized by a single computer device, or each function may be distributed and realized in multiple computer devices as necessary. When configured with multiple computer devices, they are connected by a local area network (LAN) or the like so as to be able to communicate with each other.

図１において、１０１は情報処理装置１００全体を制御するＣｅｎｔｒａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ（ＣＰＵ）である。１０２は変更を必要としないプログラムやパラメータを格納するＲｅａｄＯｎｌｙＭｅｍｏｒｙ（ＲＯＭ）である。１０３は外部装置などから供給されるプログラムやデータを一時記憶するＲａｎｄｏｍＡｃｃｅｓｓＭｅｍｏｒｙ（ＲＡＭ）である。In FIG. 1, 101 is a Central Processing Unit (CPU) that controls the entireinformation processing device 100. 102 is a Read Only Memory (ROM) that stores programs and parameters that do not require modification. 103 is a Random Access Memory (RAM) that temporarily stores programs and data supplied from an external device, etc.

１０４は情報処理装置１００に設置されたハードディスクやメモリカードなどの外部記憶装置である。ただし、固定あるいは着脱可能な状態でも構わない。例えば、フレキシブルディスク（ＦＤ）やＣｏｍｐａｃｔＤｉｓｋ（ＣＤ）等の光ディスク、磁気や光カード、ＩＣカード、メモリカードなども含む。１０５はユーザーの操作を受け、データを入力するポインティングデバイスやキーボード１０９などの入力デバイスとのインタフェースである。１０６は情報処理装置１００の保持するデータや供給されたデータを表示するためのモニタ１１０とのディスプレイインターフェイスである。１０７はインターネット１１１などのネットワーク回線に接続するためのネットワークインタフェイスである。１０８は１０１～１０７の各ユニットを通信可能に接続するシステムバスである。104 is an external storage device such as a hard disk or memory card installed in theinformation processing device 100. However, it may be fixed or removable. For example, it includes optical disks such as flexible disks (FD) and compact disks (CD), magnetic or optical cards, IC cards, memory cards, etc. 105 is an interface with input devices such as a pointing device and akeyboard 109 that accepts user operations and inputs data. 106 is a display interface with amonitor 110 that displays data held by theinformation processing device 100 and data supplied. 107 is a network interface for connecting to a network line such as the Internet 111. 108 is a system bus that connects each of theunits 101 to 107 so that they can communicate with each other.

本実施形態の情報処理装置１００は、付与対象（例えば、画像や文章などのコンテンツ）に対してそれに含まれるオブジェクトの名称や意味合いなどのアノテーションを付与する装置である。また、情報処理装置１００は、アノテーションの付与作業を実施する複数の作業者間の付与基準のぶれ・ずれを検出し、付与基準のぶれ・ずれを検出した場合、警告・注意喚起を行う。Theinformation processing device 100 of this embodiment is a device that adds annotations such as the names and meanings of objects contained in an annotation target (e.g., content such as an image or text). In addition, theinformation processing device 100 detects deviations in annotation standards between multiple workers who perform annotation work, and issues a warning or alert when deviations in annotation standards are detected.

本実施形態の情報処理装置１００の機能構成について図２を用いて説明する。以降に示す機能構成は、ＣＰＵ１０１がＲＯＭ１０２や外部記憶装置１０４に記憶されるプログラムを読み込み実行することにより情報の演算および加工、各ハードウェアの制御を行うことで実現される。なお、各機能構成をＡＳＩＣやＦＰＧＡなどのハードウェアにより実現してもよい。The functional configuration of theinformation processing device 100 of this embodiment will be described with reference to FIG. 2. The functional configuration described below is realized by theCPU 101 reading and executing programs stored in theROM 102 and theexternal storage device 104 to calculate and process information and control each piece of hardware. Note that each functional configuration may also be realized by hardware such as an ASIC or FPGA.

本実施形態における情報処理装置１００は、作業者に対して提示する付与対象を扱う手段として、付与対象管理部２０１と付与対象提示部２０２を持つ。また、情報処理装置１００は、作業者が付与対象に対して情報を付与する作業および作業結果を処理する手段として、付与データ入力部２０３と作業結果保管部２０４と基準不整合判断部２０５を持つ。加えて、情報処理装置１００は、作業者間で付与基準がぶれていることを警告する手段として、警告表示部２０６を持つ。In this embodiment, theinformation processing device 100 has an assignmenttarget management unit 201 and an assignmenttarget presentation unit 202 as means for handling assignment targets to be presented to the worker. Theinformation processing device 100 also has an assignmentdata input unit 203, a workresult storage unit 204, and a criteriainconsistency determination unit 205 as means for processing the work of the worker assigning information to the assignment target and the results of that work. In addition, theinformation processing device 100 has awarning display unit 206 as means for warning that the assignment criteria differ between workers.

付与対象管理部２０１は、作業者が情報を付与する対象となる付与対象を管理する。なお付与対象は、具体的には画像、動画、文などの自然言語を構成する文字列、ファイルなどが挙げられる。以降では付与対象として画像を例に説明する。Thetarget management unit 201 manages targets to which a worker can assign information. Specific examples of targets include images, videos, character strings that constitute natural language such as sentences, and files. In the following, an image will be used as an example of a target.

付与対象提示部２０２は、付与対象管理部２０１が管理している付与対象を作業者に提示する。The granttarget presentation unit 202 presents the grant targets managed by the granttarget management unit 201 to the worker.

付与データ入力部２０３は、付与対象提示部２０２で提示した付与対象に対して付与する付与データの入力を受け付ける。図３（ａ）に作業員に提示するユーザーインタフェースの例を示す。付与対象提示ウィンドウ３００は、付与対象表示領域３１０、付与対象変更ボタン３２０、付与データ入力領域３３０を持つ。なお、付与対象提示部２０２は、付与対象表示領域３１０と付与対象変更ボタン３２０が該当し、付与データ入力部２０３は付与データ入力領域３３０が該当する。The attachmentdata input unit 203 accepts input of attachment data to be attached to the attachment target presented by the attachmenttarget presentation unit 202. FIG. 3(a) shows an example of a user interface presented to the worker. The attachmenttarget presentation window 300 has an attachmenttarget display area 310, an attachmenttarget change button 320, and an attachmentdata input area 330. The attachmenttarget presentation unit 202 corresponds to the attachmenttarget display area 310 and the attachmenttarget change button 320, and the attachmentdata input unit 203 corresponds to the attachmentdata input area 330.

作業者が１つの付与対象について付与データを付与する付与作業は、付与対象表示領域３１０に表示された付与対象を確認し、その付与対象に付与するべきアノテーション情報を付与データ入力領域３３０に入力するものである。なお、入力が完了した場合は、付与対象変更ボタン３２０を使い、付与対象表示領域３１０に表示される付与対象を変更し、後述する作業キューに付与対象がなくなるまで繰り返す。The task of assigning data to one target involves the worker checking the target displayed in thetarget display area 310 and inputting the annotation information to be assigned to the target in thedata input area 330. Once the input is complete, the worker uses thetarget change button 320 to change the target displayed in thetarget display area 310, and repeats this process until there are no more targets in the work queue, which will be described later.

また、複数の作業者に対してユーザーインタフェースを提示する例を図３（ｂ）に示す。図３（ｂ）の例では、６人の作業者が付与作業をしている例となっている。各作業者はクライアント３５１、クライアント３５２、クライアント３５３、クライアント３５４、クライアント３５５、クライアント３５６をそれぞれ利用して、上述の付与対象提示ウィンドウ３００を操作する。６つクライアントはサーバー３４０に接続され、サーバー３４０から付与対象を受信して表示する。なお、クライアント３５１～３５４およびサーバー３４０は、情報処理装置１００と同様の構成を有する。Figure 3(b) shows an example of presenting a user interface to multiple workers. In the example of Figure 3(b), six workers are performing the assignment work. Each worker usesclient 351,client 352,client 353,client 354,client 355, andclient 356 to operate the above-mentioned assignmenttarget presentation window 300. The six clients are connected toserver 340, and receive and display the assignment target fromserver 340. Note thatclients 351 to 354 andserver 340 have the same configuration asinformation processing device 100.

作業結果保管部２０４は、付与対象提示部２０２で提示した付与対象と付与データ入力部２０３で入力された付与データを関連付けて保管する。The workresult storage unit 204 associates the target presented by thetarget presentation unit 202 with the data input by thedata input unit 203 and stores them.

基準不整合判断部２０５は、複数人が付与した付与データのばらつきを判断して、作業者間で基準に不整合が生じているかどうかを判断する。The criteriainconsistency determination unit 205 determines the variation in the assigned data assigned by multiple people and judges whether there is an inconsistency in the criteria between workers.

警告表示部２０６は、基準不整合判断部２０５によって作業者間で基準に不整合があると判断された場合に、作業者や作業者の付与結果を確認する検収者、作業者を監督する監督者などに対して警告を表示する。When the standardinconsistency determination unit 205 determines that there is an inconsistency in standards between workers, thewarning display unit 206 displays a warning to the worker, the inspector who checks the worker's assignment results, the supervisor who supervises the worker, etc.

次に、実施形態１の全体の流れについて図４を用いて説明する。図４には図４（ａ）と図４（ｂ）の２つのフローチャートがある。図４（ａ）はＳ４０１からＳ４０４までで、作業者に対して付与対象を提示し、その結果を受け取る処理を示したものである。また、図４（ｂ）はＳ４１１からＳ４１５までで、作業者が入力した付与データを受け付けた際に、イベント駆動する基準不整合判断の処理を示したものである。Next, the overall flow of the first embodiment will be explained with reference to FIG. 4. FIG. 4 has two flow charts, FIG. 4(a) and FIG. 4(b). FIG. 4(a) shows steps S401 to S404, which show the process of presenting the target to be assigned to the worker and receiving the result. FIG. 4(b) shows steps S411 to S415, which show the event-driven process of determining whether or not there is a criterion mismatch when the assignment data entered by the worker is accepted.

なお、図４および以降で示すフローチャートにおける各ステップの処理は、ＣＰＵ１０１がＲＯＭ１０２や外部記憶装置１０４に記憶されるプログラムを読み込み実行することにより情報の演算や加工および各ハードウェアの制御を行うことで実現される。なお、図４および以降に示すフローチャートにおける各ステップの処理の一部ないし全部を一つまたは複数のＡＳＩＣやＦＰＧＡなどのハードウェアにより実現する構成としてもよい。The processing of each step in the flowcharts shown in FIG. 4 and the following is realized byCPU 101 reading and executing programs stored inROM 102 orexternal storage device 104 to calculate and process information and control each piece of hardware. A part or all of the processing of each step in the flowcharts shown in FIG. 4 and the following may be realized by hardware such as one or more ASICs or FPGAs.

最初に図４（ａ）に示す処理について詳細を説明する。なお、以降の説明において、情報処理装置１００を図３（ｂ）に示したサーバー３４０として説明を行う。First, the process shown in FIG. 4(a) will be described in detail. In the following description, theinformation processing device 100 will be described as theserver 340 shown in FIG. 3(b).

Ｓ４０１において、情報処理装置１００は、付与対象管理部２０１が管理している付与対象を作業者ごとに作業キューを作成する。付与対象管理部２０１が管理している付与対象は図５に示すように、付与対象格納テーブル５００で管理されている。付与対象格納テーブル５００は、２つの列で構成されている。また１つの付与対象は付与対象格納テーブル５００の１行で表される。列５０１では付与対象を識別するＩＤを、列５０２では付与対象が基準確認用に利用できるかどうかを表す基準確認用フラグを、それぞれ格納している。列５０２で管理されている基準確認用フラグは、ランダムに決定してもいいし、検収者などのユーザーにより指定を受付け、当該指定により特定された付与対象を決定してもよい。In S401, theinformation processing device 100 creates a work queue for each worker for the grant targets managed by the granttarget management unit 201. The grant targets managed by the granttarget management unit 201 are managed in a grant target storage table 500 as shown in FIG. 5. The grant target storage table 500 is composed of two columns. Also, one grant target is represented by one row in the grant target storage table 500.Column 501 stores an ID that identifies the grant target, andcolumn 502 stores a criterion confirmation flag that indicates whether the grant target can be used for criterion confirmation. The criterion confirmation flag managed incolumn 502 may be determined randomly, or a designation may be accepted from a user such as an inspector, and the grant target identified by the designation may be determined.

図６に作業者が６人の場合の作業キューの例を示す。キュー６１１は作業者Ａ用の、キュー６１２は作業者Ｂ用の、キュー６１３は作業者Ｃ用の、キュー６１４は作業者Ｄ用の、キュー６１５は作業者Ｅ用の、キュー６１６は作業者Ｆ用を表す。６１１から６１６までの６つのキューは上から順番に利用されていくものとする。図５で示した付与対象格納テーブル５００の情報を使って、それぞれのキューに付与対象を割り当てていく。付与対象ＩＤが００１から００６までの付与対象は基準確認用フラグがＦａｌｓｅのため、それぞれ作業者のキューに別々にセットされる。次に付与対象ＩＤが００７の付与対象は基準確認用フラグがＴｒｕｅのため、すべての作業者のキューにセットされる。その後、付与対象ＩＤが００８から０１３までの付与対象も基準確認用フラグがＦａｌｓｅのため、それぞれ作業者のキューに別々にセットされる。このようなことを繰り返し、作業キューに付与対象を割り振っていく。Figure 6 shows an example of a work queue when there are six workers.Queue 611 is for worker A,queue 612 is for worker B,queue 613 is for worker C,queue 614 is for worker D,queue 615 is for worker E, andqueue 616 is for worker F. The sixqueues 611 to 616 are used in order from the top. Using the information in the target storage table 500 shown in Figure 5, targets are assigned to each queue. Targets withtarget IDs 001 to 006 have their criteria confirmation flag set to False, so they are set separately to the queues of the respective workers. Next, target withtarget ID 007 has its criteria confirmation flag set to True, so it is set to the queues of all the workers. After that, targets withtarget IDs 008 to 013 also have their criteria confirmation flag set to False, so they are set separately to the queues of the respective workers. This process is repeated to assign targets to work queues.

Ｓ４０２において、情報処理装置１００は、Ｓ４０１で生成した各作業キューがすべて空なのかどうかを付与対象管理部２０１が確認する。すべてが空であった場合は、処理を終了する。作業キューのうち少なくとも１つにまだ付与対象が存在している場合は、Ｓ４０３へ処理を進める。In S402, theinformation processing device 100 has the assignmenttarget management unit 201 check whether all of the work queues generated in S401 are empty. If all are empty, the process ends. If there are still assignment targets in at least one of the work queues, the process proceeds to S403.

Ｓ４０３において、情報処理装置１００は、Ｓ４０１で生成した各作業キューから付与対象を取得し、図３（ｂ）で示したように作業者それぞれの付与対象提示部２０２に、付与対象を表示させる。すなわち、サーバー３４０は、クライアント３５１～３５６に付与対象を表示させる。In S403, theinformation processing device 100 acquires the assignment targets from each work queue generated in S401, and displays the assignment targets on the assignmenttarget presentation unit 202 of each worker as shown in FIG. 3(b). That is, theserver 340 displays the assignment targets on theclients 351 to 356.

Ｓ４０４において、情報処理装置１００は、Ｓ４０３で提示した付与対象に対して、ユーザーが入力した付与データを、付与データ入力部２０３が受信する。受信した後、付与データ入力部２０３が、作業結果保管部２０４に受信した付与データを保管する。図３（ｂ）に示す例では、サーバー２３０は、クライアント３５１～３５６において付与された付与データを各クライアントから受信する。なお、付与データは、付与対象に含まれるオブジェクトの名称や意味合いを示すアノテーション情報を含むデータである。その後、付与データ入力部２０３が付与データ受信イベントを開始する。In S404, theinformation processing device 100 receives the attachment data input by the user for the attachment target presented in S403 via the attachmentdata input unit 203. After receiving the data, the attachmentdata input unit 203 stores the received attachment data in the workresult storage unit 204. In the example shown in FIG. 3(b), the server 230 receives the attachment data that was attached in theclients 351 to 356 from each of the clients. The attachment data is data that includes annotation information indicating the name and meaning of the object included in the attachment target. After that, the attachmentdata input unit 203 starts an attachment data reception event.

次に図４（ｂ）に示す処理について詳細を説明する。Next, the process shown in Figure 4(b) will be explained in detail.

Ｓ４１１において、情報処理装置１００は、Ｓ４０４で開始した付与データ受信イベントを受けて、基準不整合判断部２０５が、付与データが付与された当該付与対象の基準確認用フラグがＴｒｕｅかどうかを判定する。Ｔｒｕｅの場合はＳ４１２へ処理を進め、Ｆａｌｓｅの場合は処理を終了する。例えば、付与対象管理部２０１が管理している付与対象が図５に示す例の場合、ＩＤが００７の付与対象は基準確認用フラグがＴｒｕｅとなっていることが確認できる。In S411, in response to the attachment data reception event started in S404, theinformation processing device 100 causes the criteriainconsistency determination unit 205 to determine whether the criteria confirmation flag of the attachment target to which the attachment data has been attached is True. If it is True, the process proceeds to S412, and if it is False, the process ends. For example, in the case where the attachment targets managed by the attachmenttarget management unit 201 are the example shown in FIG. 5, it can be confirmed that the criteria confirmation flag of the attachment target withID 007 is True.

Ｓ４１２において、情報処理装置１００は、基準不整合判断部２０５が、基準確認用フラグがＴｒｕｅとなっている付与対象に対して付与された付与データを作業結果保管部２０４からすべて取得する。すなわち、情報処理装置１００は、複数の作業者が同一の付与対象に付与したアノテーションをそれぞれ取得する。In S412, theinformation processing device 100 acquires all annotation data that has been assigned to the annotation target for which the criteria confirmation flag is set to True from the workresult storage unit 204 by the criteriainconsistency determination unit 205. In other words, theinformation processing device 100 acquires annotations that have been assigned to the same annotation target by multiple workers.

ここで、作業結果保管部２０４が管理している付与データの例を図７に示す。付与データは付与データ格納テーブル７００で管理されている。付与データ格納テーブル７００は、４つの列で構成されている。また１つの付与データは付与データ格納テーブル７００の１行で表される。列７０１は付与データを識別するＩＤを格納している。列７０２は作業者が入力した付与データを格納している。列７０３は付与データが付与された付与対象のＩＤを格納している。なお、図７の例では付与対象を画像としているため、イメージＩＤとなっているが、画像に限定されるものではなく、動画、文などの自然言語を構成する文字列、ファイルなどでもよい。列７０４は付与データを入力した作業者のＩＤが格納されている。Here, an example of the attached data managed by the workresult storage unit 204 is shown in FIG. 7. The attached data is managed in an attached data storage table 700. The attached data storage table 700 is composed of four columns. One piece of attached data is represented by one row in the attached data storage table 700.Column 701 stores an ID that identifies the attached data.Column 702 stores the attached data input by the worker.Column 703 stores the ID of the attachment target to which the attached data is attached. In the example of FIG. 7, the attachment target is an image, so the ID is an image ID, but it is not limited to an image and may be a video, a character string constituting natural language such as a sentence, a file, or the like.Column 704 stores the ID of the worker who inputted the attached data.

図６に示した作業キューにおいて、各作業者が作業している途中までの結果が図７に示されている。例えばＳ４１１で説明した基準確認用フラグがＴｒｕｅである付与対象のＩＤが００７とすると、図７に示す付与データ格納テーブル７００を使って、列７０３が００７の付与データをＳ４１２において取得することになる。具体的には、Ｓ４１２において、情報処理装置１００は、列７０１に格納された付与データのＩＤが００７、００９、０１０、０１１、０１４、０１５の６つの付与データを取得することとなる。In the work queue shown in FIG. 6, the results up to the point where each worker is working are shown in FIG. 7. For example, if the ID of the assignment target for which the reference confirmation flag described in S411 is True is 007, then in S412, the assignment data storage table 700 shown in FIG. 7 is used to obtain the assignment data with 007 incolumn 703. Specifically, in S412, theinformation processing device 100 obtains six pieces of assignment data with IDs of 007, 009, 010, 011, 014, and 015 stored incolumn 701.

Ｓ４１３において、情報処理装置１００は、基準不整合判断部２０５がＳ４１２で取得した付与データの数と、作業者の数が等しいかを判断する。等しい場合は、Ｓ４１４へ処理を進め、等しくない場合は処理を終了する。In S413, theinformation processing device 100 determines whether the number of assigned data acquired by the criterioninconsistency determination unit 205 in S412 is equal to the number of workers. If they are equal, the process proceeds to S414, and if they are not equal, the process ends.

Ｓ４１４において、情報処理装置１００は、基準不整合判断部２０５がＳ４１２で取得した複数の付与データのばらつきを確認する。基準が正しく共有されているならば、付与データは一致するはずなので、複数の付与データに複数種類の付与データが存在していればばらついていると判断する。即ち、情報処理装置１００は、同一の付与対象への複数の作業者からのアノテーションの比較結果が複数の作業者間で統一されないとの結果である場合、複数の作業者間に付与基準のずれがあると判定する。なお、ばらつきを判断する方法はこれに限定されるものではなく、複数の付与データのエントロピーを算出してエントロピーが所定値以上であれればばらついていると判断してもよい。ばらついていると判断した場合はＳ４１５へ処理を進め、ばらついていない場合は処理を終了する。In S414, theinformation processing device 100 checks the variation in the multiple pieces of attached data acquired by the criteriainconsistency determination unit 205 in S412. If the criteria are shared correctly, the attached data should match, so if multiple types of attached data exist in the multiple pieces of attached data, it is determined that there is variation. In other words, if the comparison result of annotations from multiple workers to the same annotated target shows that the annotations are not consistent between the multiple workers, theinformation processing device 100 determines that there is a discrepancy in the attachment criteria between the multiple workers. Note that the method of determining the variation is not limited to this, and it is also possible to calculate the entropy of the multiple pieces of attached data and determine that there is variation if the entropy is equal to or greater than a predetermined value. If it is determined that there is variation, the process proceeds to S415, and if there is no variation, the process ends.

Ｓ４１５において、情報処理装置１００は、警告表示部２０６が作業者、検収者または／および監督者に警告を表示する。情報処理装置１００は、モニタ１１０にメッセージにより、注意喚起や警告を表示してもよいし、音を出力することで注意喚起や警告を行ってもよい。また、情報処理装置１００は、作業者、検収者または／および監督者に対応する装置に警告を出力させるための情報を送信してもよい。また、情報処理装置１００は、作業者、検収者または／および監督者に対応する装置に警告を表示させてもよい。In S415, theinformation processing device 100 causes thewarning display unit 206 to display a warning to the worker, inspector, and/or supervisor. Theinformation processing device 100 may display a message on themonitor 110 to call attention or warn, or may issue a sound to call attention or warn. Theinformation processing device 100 may also transmit information for outputting a warning to a device corresponding to the worker, inspector, and/or supervisor. Theinformation processing device 100 may also cause a warning to be displayed on a device corresponding to the worker, inspector, and/or supervisor.

情報処理装置１００は、注意喚起や警告として、複数の作業者間で付与基準にぶれやばらつきが生じている可能性があることを示す情報を作業者、検収者または／および監督者に提示してもよい。また、情報処理装置１００は、注意喚起や警告として、付与基準を統一させるための施策を促す情報を作業者、検収者または／および監督者に提示してもよい。この場合、情報処理装置１００は、電子メールなどで警告を示す情報を対応するアドレスに送信する構成としてもよい。Theinformation processing device 100 may present information to the worker, inspector, and/or supervisor as a warning or attention to the possibility that there may be inconsistencies or variations in the assignment criteria among multiple workers. Theinformation processing device 100 may also present information to the worker, inspector, and/or supervisor as a warning or attention to encourage measures to be taken to standardize the assignment criteria. In this case, theinformation processing device 100 may be configured to send information indicating the warning to a corresponding address by email or the like.

また、情報処理装置１００は、一部の作業者のみが他の作業者と異なる内容のアノテーションを付与している場合、この一部の作業者のみに警告を提示してもよい。この場合、情報処理装置１００は、付与基準を確認することを促す情報、他の作業者と異なる内容のアノテーションを付与していることを示す情報または／および他の作業者が付与した正解と思われるアノテーションを示す情報を提示することで警告を行ってもよい。また、情報処理装置１００は、システムを構成するすべての装置に警告を発する構成としてもよい。また、情報処理装置１００は、例示した上記の注意喚起や警告を複数組み合わせて行ってもよい。In addition, when only some workers have added annotations that are different from those of other workers, theinformation processing device 100 may present a warning only to these some workers. In this case, theinformation processing device 100 may issue a warning by presenting information that prompts the workers to check the annotation criteria, information indicating that the workers have added annotations that are different from those of other workers, and/or information indicating annotations added by other workers that are believed to be correct. Theinformation processing device 100 may also be configured to issue a warning to all devices that make up the system. Theinformation processing device 100 may also issue a combination of multiple of the above-mentioned example reminders and warnings.

また、図３において、情報処理装置１００がサーバー３４０として動作する場合、情報処理装置１００は、警告対象とする作業者、検収者または／および監督者に対応するクライアント３５１～３５６に警告を出力させるための情報を送信させてもよい。また、情報処理装置１００がサーバー３４０として動作する場合、情報処理装置１００は、内蔵または接続するモニタ１１０に警告を出力させてもよい。また、情報処理装置１００がクライアント３５１～３５６として動作する場合、情報処理装置１００は、サーバー３４０からの警告を出力せるための情報の受信に応じて、警告を出力する構成としてもよい。また、情報処理装置１００がクライアント３５１～３５６として動作する場合、情報処理装置１００は自装置に対応する作業者の付与基準が他の作業者と異なっているかを判定する構成としてもよい。この場合、サーバー３４０または他のクライアントからの他の作業者が付与したアノテーションを受信する。そして、情報処理装置１００は自装置に対応する作業者の付与基準が他の作業者と異なっているかを判定し、自装置に対応する作業者の付与基準が他の作業者と異なっている警告を出力する構成としてもよい。3, when theinformation processing device 100 operates as theserver 340, theinformation processing device 100 may transmit information for outputting a warning to theclients 351 to 356 corresponding to the worker, inspector, and/or supervisor to be warned. When theinformation processing device 100 operates as theserver 340, theinformation processing device 100 may output a warning to the built-in or connectedmonitor 110. When theinformation processing device 100 operates as theclient 351 to 356, theinformation processing device 100 may be configured to output a warning in response to receiving information for outputting a warning from theserver 340. When theinformation processing device 100 operates as theclient 351 to 356, theinformation processing device 100 may be configured to determine whether the assignment criteria of the worker corresponding to the device itself are different from those of other workers. In this case, theinformation processing device 100 receives annotations assigned by other workers from theserver 340 or other clients. Theinformation processing device 100 may then determine whether the assignment criteria of the worker corresponding to the device itself are different from those of other workers, and output a warning that the assignment criteria of the worker corresponding to the device itself are different from those of other workers.

これにより、警告を確認した作業者、検収者または監督者は、作業者間でアノテーションに対する基準がぶれていることを確認することができる。基準がぶれていることが確認できた場合、作業者は自身の認識を再確認したり、検収者や監督者は基準を誤解して誤ったアノテーションを付与した作業者に対して直接指導したりすることが可能となる。確認や指導の結果、基準に対する認識が改められ、作業場全体でアノテーション付与の精度を高めることが可能となる。This allows the worker, inspector, or supervisor who sees the warning to confirm that there is a deviation in the annotation standards between workers. If it is confirmed that the standards are deviating, the worker can reconfirm his or her understanding, and the inspector or supervisor can provide direct guidance to the worker who misunderstood the standards and made the incorrect annotation. As a result of the confirmation and guidance, perception of the standards will change, making it possible to improve the accuracy of annotation throughout the entire workplace.

＜実施形態２＞
他の実施形態として、作業者間の付与基準のずれを判定し、警告を行う実施形態２について以下に説明する。実施形態２における情報処理装置１００のハードウェア構成は、実施形態１と同様である。また、実施形態２における情報処理装置１００の機能構成は、図２に示した実施形態１の構成と同じであるが、基準不整合判断部２０５が判断する処理が実施形態１と異なるため、その点のみを説明する。 <Embodiment 2>
As another embodiment, a second embodiment in which a discrepancy in the assignment criteria between workers is determined and a warning is issued will be described below. The hardware configuration of theinformation processing device 100 in the second embodiment is the same as that in the first embodiment. Also, the functional configuration of theinformation processing device 100 in the second embodiment is the same as that in the first embodiment shown in Fig. 2, but the process determined by the criteriainconsistency determination unit 205 is different from that in the first embodiment, and therefore only this point will be described.

実施形態２では複数の付与データをグループ化することでばらつきを判定する。実施形態２におけるばらつきの判定処理について、図８に示すフローチャートを用いて説明する。In the second embodiment, the variation is determined by grouping multiple pieces of attached data. The variation determination process in the second embodiment is described with reference to the flowchart shown in FIG.

Ｓ８０１において、情報処理装置１００は、基準不整合判断部２０５が複数の付与データをグループ分けする。例として、実施形態１でも説明した付与対象のＩＤが００７の場合で説明する。図７に示す例では、ＩＤが００７の付与対象に対して、６人の作業者が付与データを付与している。具体的には、具体的には、付与データのＩＤが００７、００９、０１０、０１１、０１４、０１５の６つになる。この６つのデータの列７０２を確認すると、００７、００９、０１１の３つは付与データが「ＳＵＶ」、０１０、０１４、０１５の３つは付与データが「ピックアップトラック」になっている。この場合は、ＳＵＶグループとピックアップトラックグループの２つのグループに分けることとなる。このように、本実施形態におけるグループ分けは、アノテーションの内容ごとに区別することをいう。In S801, theinformation processing device 100 has the criterionmismatch determination unit 205 group a plurality of pieces of assigned data. As an example, the case where the ID of the assigned target is 007, which was also described in the first embodiment, will be described. In the example shown in FIG. 7, six workers assigned assigned data to the assigned target withID 007. Specifically, the assigned data has six IDs of 007, 009, 010, 011, 014, and 015. When checking thecolumn 702 of these six pieces of data, the assigned data for three pieces of data, 007, 009, and 011, is "SUV," and the assigned data for three pieces of data, 010, 014, and 015, is "pickup truck." In this case, the data is divided into two groups, an SUV group and a pickup truck group. In this way, grouping in this embodiment refers to distinguishing according to the contents of the annotation.

Ｓ８０２において、情報処理装置１００は、基準不整合判断部２０５が、Ｓ８０１でグループ分けした結果としてグループが複数できたかどうかを判定する。判定の結果、複数のグループができていた場合はＳ８０３へ、複数のグループができていないかった場合はＳ８０５へ処理を進める。上述の例ではＳＵＶグループとピックアップトラックグループの２つのグループができているため、Ｓ８０３へ処理を進める。In S802, theinformation processing device 100 determines whether the criteriamismatch determination unit 205 has created multiple groups as a result of the grouping in S801. If the determination result shows that multiple groups have been created, the process proceeds to S803, and if not, the process proceeds to S805. In the above example, two groups have been created, an SUV group and a pickup truck group, so the process proceeds to S803.

Ｓ８０３において、情報処理装置１００は、基準不整合判断部２０５が、グループ内の付与データの数は２以上のグループが複数存在しているかどうかを判定する。これはグループを構成する付与データが２未満、つまり１つの場合は、作業者が単に間違えただけである可能性を考慮するためである。判定の結果、存在していた場合はＳ８０４へ、存在していなかった場合はＳ８０５へ処理を進める。上述の例では、ＳＵＶグループもピックアップトラックグループもそれぞれ３つの付与データが存在しているため、Ｓ８０４へ処理を進める。なお、本実施形態ではグループ内に存在している付与データの数が２以上で判定を行っていたが、２に限定されるものではない。作業者が単に間違えただけだと判断する数を大きくしたければ、２以上の値を設定すればよい。In S803, theinformation processing device 100 has the criterioninconsistency determination unit 205 determine whether there are multiple groups with two or more assigned data in the group. This is because if the number of assigned data constituting a group is less than two, i.e., one, it is possible that the worker simply made a mistake. If the result of the determination is that there is assigned data, the process proceeds to S804, and if there is no assigned data, the process proceeds to S805. In the above example, there are three assigned data in both the SUV group and the pickup truck group, so the process proceeds to S804. Note that in this embodiment, the determination is made when the number of assigned data present in a group is two or more, but this is not limited to two. If you want to increase the number at which you determine that the worker simply made a mistake, you can set a value of two or more.

Ｓ８０４において、情報処理装置１００は、基準不整合判断部２０５が付与データはばらついていると判断する。In S804, theinformation processing device 100 determines that the criteriainconsistency determination unit 205 determines that the assigned data is variable.

Ｓ８０５において、情報処理装置１００は、基準不整合判断部２０５が付与データはばらついていないと判断する。In S805, theinformation processing device 100 determines that the criteriainconsistency determination unit 205 does not vary the assigned data.

これにより、作業者が単にミスしただけの可能性を排除できるようになることから、基準がぶれていることをより確度高く検出することが可能になる。This makes it possible to eliminate the possibility that the operator simply made a mistake, and therefore makes it possible to more accurately detect deviations from the standard.

＜実施形態３＞
他の実施形態として、作業者間の付与基準のずれを判定し、警告を行う実施形態３について以下に説明する。実施形態３における情報処理装置１００のハードウェア構成は、実施形態１と同様である。また、実施形態３における情報処理装置１００の機能構成を図９に示す。なお、図２に示した実施形態１の構成と同じものについては図２で使用していたものと同じ番号が記載し、その説明は省略する。 <Embodiment 3>
As another embodiment, a third embodiment in which a discrepancy in the assignment criteria between workers is determined and a warning is issued will be described below. The hardware configuration of theinformation processing device 100 in the third embodiment is the same as that in the first embodiment. Also, the functional configuration of theinformation processing device 100 in the third embodiment is shown in Fig. 9. Note that the same numbers as those used in Fig. 2 are used for the same components as those in the first embodiment shown in Fig. 2, and the description thereof will be omitted.

付与データ推定部９０１は、付与対象管理部２０１で管理している付与対象に付与すべき付与データを事前に推定する。推定をするために、付与対象管理部２０１で管理している付与対象と同種類の付与対象と、その同種類の付与対象に付与するべき付与データの関係を機械学習することで、事前学習モデルを生成する。そして、その事前学習モデルを用いて、付与対象管理部２０１で管理されているそれぞれの付与対象に対して、付与するべき付与データを推定する。推定された付与データは推定付与データとして、付与対象と関連付けて作業結果保管部２０４に保管する。The attachmentdata estimation unit 901 estimates in advance the attachment data to be assigned to the attachment targets managed by the attachmenttarget management unit 201. To make the estimation, a pre-learning model is generated by machine learning the relationship between the attachment targets of the same type as the attachment targets managed by the attachmenttarget management unit 201 and the attachment data to be assigned to the attachment targets of the same type. Then, the pre-learning model is used to estimate the attachment data to be assigned to each of the attachment targets managed by the attachmenttarget management unit 201. The estimated attachment data is stored in the workresult storage unit 204 as estimated attachment data in association with the attachment targets.

付与データ推定部９０１が用いる事前学習モデルは、情報処理装置１００が機械学習を実行してもよいし、機械学習された学習済モデルを外部から取得してもよい。情報処理装置１００が機械学習を実行する場合、情報処理装置１００は、ＧＰＵ（ＧｒａｐｈｉｃａｌＰｒｏｃｅｓｓｉｎｇＵｎｉｔ）をさらに有し、ＣＰＵ１０１に加え、ＧＰＵを用いて機械学習を行ってもよい。ＧＰＵは、データをより多く並列処理することで効率的な演算を行うことができるので、ディープラーニングのよう複数回に渡り学習を行う場合に有効である。The pre-learning model used by the attachmentdata estimation unit 901 may be machine-learned by theinformation processing device 100, or may be acquired from an external device as a machine-learned model. When theinformation processing device 100 performs machine learning, theinformation processing device 100 may further include a GPU (Graphical Processing Unit) and perform machine learning using the GPU in addition to theCPU 101. The GPU can perform efficient calculations by processing more data in parallel, and is therefore effective when learning is performed multiple times, such as in deep learning.

機械学習の具体的なアルゴリズムとしては、最近傍法、ナイーブベイズ法、決定木、サポートベクターマシンなどが挙げられる。また、ニューラルネットワークを利用して、学習するための特徴量、結合重み付け係数を自ら生成する深層学習（ディープラーニング）も挙げられる。適宜、上記アルゴリズムのうち利用できるものを用いて本実施形態に適用することができる。Specific examples of machine learning algorithms include nearest neighbor methods, naive Bayes methods, decision trees, and support vector machines. In addition, deep learning, which uses a neural network to generate features and connection weighting coefficients for learning, can also be used. Any of the above algorithms that can be used can be used as appropriate and applied to this embodiment.

学習済モデルを作成する場合、入力データと出力データとの組合せを学習データ（教師データ）として複数個準備する。それらから機械学習によって知識を獲得し、獲得した知識に基づいて入力データに対する出力データを結果として出力する学習済みモデルを生成する。なお、学習済みモデルは、必要に応じて一定の処理後に更新しても良い。When creating a trained model, multiple combinations of input data and output data are prepared as training data (teacher data). Knowledge is acquired from these through machine learning, and a trained model is generated that outputs output data for the input data based on the acquired knowledge. Note that the trained model may be updated after certain processing as necessary.

なお、同種類の付与対象とは、例えば、作業者が画像中に写る車を見てその車の種類を付与データとして付与する場合は、付与対象管理部２０１で管理されている画像とは別の画像でかつ車が写る画像となる。また、推定付与データについては、事前学習モデルを使って推定した結果と、その確率によって構成される。推定付与データの例について図１０を用いて説明する。なお図１０に示す例では付与対象を画像としている。Note that, for example, when an operator sees a car in an image and assigns the type of car as the assignment data, the same type of assignment target is an image that is different from the images managed by the assignmenttarget management unit 201 and contains a car. The estimated assignment data is composed of the results estimated using the pre-learning model and their probabilities. An example of estimated assignment data will be explained using FIG. 10. Note that in the example shown in FIG. 10, the assignment target is an image.

図１０（ａ）に示した推定付与データ格納テーブル１０００は、５つの列から構成される。また１つの推定付与データは推定付与データ格納テーブル１０００の１行で表される。列１００１は推定結果である推定付与データを識別するＩＤを格納する列である。列１００２は推定結果を格納する列である。列１００３は推定結果の確度情報を格納する列である。列１００４は推定した対象である画像のＩＤを格納する列である。列１００５は推定した対象である画像内に含まれる領域のＩＤを格納する列である。なお領域は後述する領域テーブル１０１０で管理されている。なお、付与対象に対して複数の付与データが付与される場合もあるため、領域の情報を保持している。例えば、画像中にある車の種別をアノテーションとして付ける場合、画像中に複数の車があった場合は、画像内の領域を指定して、その領域に対してアノテーションを付与することになる。図１０（ａ）に示した例では、ＩＤが００８と００９の２つの推定付与データはどちらも列１００４のイメージＩＤが００８の付与対象に対して付与されたものであることを示している。The estimated attachment data storage table 1000 shown in FIG. 10(a) is composed of five columns. One piece of estimated attachment data is represented by one row in the estimated attachment data storage table 1000.Column 1001 is a column that stores an ID that identifies the estimated attachment data, which is the estimation result.Column 1002 is a column that stores the estimation result.Column 1003 is a column that stores the accuracy information of the estimation result.Column 1004 is a column that stores the ID of the image that is the estimated target.Column 1005 is a column that stores the ID of the area contained in the image that is the estimated target. The area is managed in the area table 1010 described later. Note that, since multiple attachment data may be assigned to the attachment target, area information is stored. For example, when annotating the type of car in an image, if there are multiple cars in the image, an area in the image is specified and annotation is assigned to that area. In the example shown in FIG. 10(a), the two estimated assignment data withIDs 008 and 009 both indicate that they were assigned to the assignment target withimage ID 008 incolumn 1004.

図１０（ｂ）に示した領域テーブル１０１０は、５つの列から構成される。また付与対象内の１つの領域は領域テーブル１０１０の１行で表される。列１０１１は領域を識別するＩＤを格納する列である。列１０１２は領域の左上の点を起点とし、その起点のＸ座標の値を格納する列である。列１０１３は領域の左上の点を起点とし、その起点のＹ座標の値を格納する列である。列１０１４は領域の幅の値を格納する列である。列１０１５は領域の高さの値を格納する列である。The region table 1010 shown in FIG. 10(b) is composed of five columns. Furthermore, one region within the assignment target is represented by one row in the region table 1010.Column 1011 is a column that stores an ID that identifies the region.Column 1012 is a column that stores the X coordinate value of the starting point, starting from the top left point of the region.Column 1013 is a column that stores the Y coordinate value of the starting point, starting from the top left point of the region.Column 1014 is a column that stores the width value of the region.Column 1015 is a column that stores the height value of the region.

基準確認用付与対象判定部９０２は、基準がぶれているかを判定するのに適した基準確認用付与対象を判定する。基準確認用付与対象を判定する処理の流れについて図１１のフローチャートを用いて説明する。The reference confirmationtarget determination unit 902 determines a reference confirmation target suitable for determining whether the reference is blurred. The process flow for determining the reference confirmation target is described with reference to the flowchart in FIG. 11.

Ｓ１１０１において、情報処理装置１００は、基準確認用付与対象判定部９０２が、付与対象管理部２０１から付与対象を１つ取得する。In S1101, theinformation processing device 100 causes the reference confirmation granttarget determination unit 902 to obtain one grant target from the granttarget management unit 201.

Ｓ１１０２において、情報処理装置１００は、基準確認用付与対象判定部９０２が、作業結果保管部２０４から、Ｓ１１０１で取得した付与対象に対して推定された推定付与データをすべて取得する。In S1102, theinformation processing device 100 causes the reference confirmation assignmenttarget determination unit 902 to acquire all of the estimated assignment data estimated for the assignment target acquired in S1101 from the workresult storage unit 204.

Ｓ１１０３において、情報処理装置１００は、Ｓ１１０２で取得した１または複数の推定付与データから、付与対象を作業するときの難易度を判断する。本実施例では１または複数の推定付与データに含まれる確率が１つでも６０％以下になっているかどうかを判定する。判定の結果、推定付与データに含まれる確率が６０％以下となる推定付与データが存在する場合はＳ１１０４へ進み、推定付与データに含まれる確率がすべて６０％より大きい場合は処理を終了する。なお、６０％は例であってこれに限定されるものではなく、に任意の所定の閾値を設定する構成としてもよい。In S1103, theinformation processing device 100 determines the difficulty of working with the assignment target from the one or more estimated assignment data acquired in S1102. In this embodiment, it is determined whether the probability included in the one or more estimated assignment data is 60% or less. If the result of the determination is that there is estimated assignment data with a probability of being included in the estimated assignment data of 60% or less, the process proceeds to S1104, and if all of the probabilities included in the estimated assignment data are greater than 60%, the process ends. Note that 60% is an example and is not limited to this, and any predetermined threshold value may be set.

また、推定付与データの確度を用いる場合は、ユーザーが事前に設定した異なる値を扱ってもいいし、作業結果保管部２０４で管理する推定付与データ群の確率の傾向をつかってもよい。確率の傾向とは、例えば、確率の平均を用いるなどがある。なお、推定付与データの確度情報を用いずに他の情報を用いても構わない。例えば、１つの付与対象の中に含まれる推定付与データの数を用いてもよいし、図１０で示したように推定付与データが領域を持つような場合は、領域が重なっている面積などを用いてもよい。When using the accuracy of the estimated attachment data, different values preset by the user may be used, or the trend in the probability of the group of estimated attachment data managed by the workresult storage unit 204 may be used. The trend in probability may be, for example, the average probability. Note that other information may be used instead of using the accuracy information of the estimated attachment data. For example, the number of estimated attachment data included in one attachment target may be used, or, in the case where the estimated attachment data has an area as shown in FIG. 10, the area of overlap of the areas may be used.

Ｓ１１０４において、情報処理装置１００は、基準確認用付与対象判定部９０２が、該当する付与対象に対して、集中力判定用フラグを設定する。なお集中力判定用フラグは、付与対象管理部２０１で管理しているデータベースの列として設定しもよいし、付与対象がファイルであった場合はファイルのメタデータとして付与してもよい。図１２に付与対象管理部２０１で管理されている付与対象の例を示す。In S1104, theinformation processing device 100 has the reference confirmationtarget determination unit 902 set a concentration determination flag for the corresponding target. The concentration determination flag may be set as a column in a database managed by thetarget management unit 201, or may be assigned as metadata for the file if the target is a file. FIG. 12 shows an example of a target managed by thetarget management unit 201.

図１２に示す付与対象格納テーブル１２００は３つの列から構成される。また１つの付与対象は付与対象格納テーブル１２００の１行で表される。列１２０１は付与対象を識別するＩＤを格納する列である。列１２０２は付与対象が基準確認用付与対象として適しているかどうかを示す情報を格納する列である。列１２０３は付与対象が基準確認用付与対象であった場合、付与対象を提示する作業者の人数を格納する列である。なお、列１２０３の値は列１２０２の値がＦａｌｓｅの場合は１となり、列１２０２の値がＴｒｕｅの場合は１以外の値を格納する。列１２０３の値が１よりも大きい場合はその人数の作業者に対して付与対象を提示することを意味し、負の値の場合は作業者全員に提示することを意味する。The grant target storage table 1200 shown in FIG. 12 is composed of three columns. One grant target is represented by one row in the grant target storage table 1200.Column 1201 is a column that stores an ID that identifies the grant target.Column 1202 is a column that stores information indicating whether the grant target is suitable as a grant target for reference confirmation.Column 1203 is a column that stores the number of workers to whom the grant target is presented if the grant target is a grant target for reference confirmation. Note that the value ofcolumn 1203 is 1 if the value ofcolumn 1202 is False, and a value other than 1 is stored if the value ofcolumn 1202 is True. A value greater than 1 incolumn 1203 means that the grant target is presented to that number of workers, and a negative value means that the grant target is presented to all workers.

Ｓ１１０５において、情報処理装置１００は、基準確認用付与対象判定部９０２が、基準確認用付与対象を何人の作業者に対して提示するのかを決定する。この処理は図１３を用いて後述する。なお、Ｓ１１０５の処理を実施せず、基準確認用付与対象は常に作業者全員に提示するとしてもよい。In S1105, theinformation processing device 100 has the reference confirmationtarget determination unit 902 determine how many workers the reference confirmation target is to be presented to. This process will be described later with reference to FIG. 13. Note that the reference confirmation target may be presented to all workers at all times without performing the process of S1105.

Ｓ１１０１からＳ１１０５の処理は付与対象１つに対して行う処理であり、基準確認用付与対象判定部９０２では、付与対象管理部２０１で管理されている付与対象すべてについて、この処理を実施する。The processes from S1101 to S1105 are performed for one grant target, and the criteria confirmation granttarget determination unit 902 performs this process for all grant targets managed by the granttarget management unit 201.

基準確認用付与対象を提示する作業者の人数を決定する処理の流れについて図１３を用いて説明する。The process flow for determining the number of workers to whom the criteria confirmation assignment targets are to be presented is explained using Figure 13.

Ｓ１３０１において、情報処理装置１００は、基準確認用付与対象判定部９０２が、処理対象の付与対象に対する推定付与データのうち、最も確率低い推定付与データの確率が、６０％未満かつ５０％以上かどうかを判定する。判定の結果、条件を満たす場合はＳ１３０２へ、満たさない場合はＳ１３０３へ処理を進める。In S1301, theinformation processing device 100 determines whether the probability of the lowest probability of the estimated assignment data among the estimated assignment data for the assignment target being processed is less than 60% and 50% or more. If the result of the determination satisfies the condition, the process proceeds to S1302, and if not, the process proceeds to S1303.

Ｓ１３０２において、情報処理装置１００は、基準確認用付与対象判定部９０２が、基準確認用付与対象を提示する作業者数を３人に決定し、付与対象管理部２０１に設定する。具体的には図１２に示す付与対象格納テーブル１２００の列１２０３に３を設定する。In S1302, theinformation processing device 100 determines the number of workers to be presented with the reference confirmation grant target by the reference confirmation granttarget determination unit 902 as three, and sets this in the granttarget management unit 201. Specifically, thenumber 3 is set incolumn 1203 of the grant target storage table 1200 shown in FIG. 12.

Ｓ１３０３において、情報処理装置１００は、基準確認用付与対象判定部９０２が、ある付与対象に対する推定付与データのうち、最も確率低い推定付与データの確率が、５０％未満かつ４０％以上かどうかを判定する。判定の結果、条件を満たす場合はＳ１３０４へ、満たさない場合はＳ１３０５へ処理を進める。In S1303, theinformation processing device 100 determines whether the probability of the least likely estimated assignment data among the estimated assignment data for a certain assignment target is less than 50% and equal to or greater than 40%. If the result of the determination satisfies the condition, the process proceeds to S1304, and if not, the process proceeds to S1305.

Ｓ１３０４において、情報処理装置１００は、基準確認用付与対象判定部９０２が、基準確認用付与対象を提示する作業者数を５人に決定し、付与対象管理部２０１に設定する。具体的には図１２に示す付与対象格納テーブル１２００の列１２０３に５を設定する。In S1304, theinformation processing device 100 determines the number of workers to be presented with the reference confirmation grant target by the reference confirmation granttarget determination unit 902 as 5, and sets this in the granttarget management unit 201. Specifically, 5 is set incolumn 1203 of the grant target storage table 1200 shown in FIG. 12.

Ｓ１３０５において、情報処理装置１００は、基準確認用付与対象判定部９０２が、基準確認用付与対象を提示する作業者数を全員に決定し、付与対象管理部２０１に設定する。具体的には図１２に示す付与対象格納テーブル１２００の列１２０３に－１を設定する。In S1305, theinformation processing device 100 determines the number of workers to whom the reference confirmation granttarget determination unit 902 presents the reference confirmation grant target for all workers, and sets this in the granttarget management unit 201. Specifically, it sets -1 incolumn 1203 of the grant target storage table 1200 shown in FIG. 12.

なお、図１３に示した処理において指定した確率および判断の分岐数はこれに限定されるものではない。本実施形態では６０％未満かつ５０％以上、５０％未満かつ４０％以上、４０％未満の３つに分岐させているが、これを６０％から４０％までを５％刻みを４つの範囲、加えて４０％未満で合計５つに分割してもよい。もしくは６０％未満かつ４０％以上、４０％未満の２つに分割してもよい。また、Ｓ１１０３での説明と同様に、推定付与データの確度情報を用いずに他の情報を用いても構わない。例えば、１つの付与対象の中に含まれる推定付与データの数の範囲を用いてもよいし、図１０で示したように推定付与データが領域を持つような場合は、領域が重なっている面積の大きさの範囲や数の範囲などを用いてもよい。加えて、付与対象を提示する作業者数を管理する際にデータベースを使う方法は一例であり、これに限定されるものではない。例えば、付与対象がファイルの場合はファイルのメタデータとして設定しても構わない。The probability and the number of branching judgments specified in the process shown in FIG. 13 are not limited to the above. In this embodiment, the probability is divided into three: less than 60% and 50% or more, less than 50% and 40% or more, and less than 40%. However, this may be divided into four ranges of 5% increments from 60% to 40%, and a total of five ranges of less than 40%. Or it may be divided into two ranges: less than 60% and 40% or more, and less than 40%. Also, as in the explanation of S1103, other information may be used instead of using the accuracy information of the estimated assignment data. For example, the range of the number of estimated assignment data included in one assignment target may be used, or, in the case where the estimated assignment data has an area as shown in FIG. 10, the range of the size of the area where the areas overlap or the range of the number may be used. In addition, the method of using a database when managing the number of workers who present the assignment target is one example, and is not limited to this. For example, if the assignment target is a file, it may be set as metadata of the file.

また、付与基準の確認対象とする作業者の数を付与対象のアノテーションの推定確度に応じて変化させる構成としたが、これに限らず、推定確度に関わらず、一定（例えば、全員）としてもよい。In addition, the number of workers to be checked for the annotation criteria is changed depending on the estimated accuracy of the annotation to be added, but this is not limited to the above, and the number may be fixed (for example, everyone) regardless of the estimated accuracy.

次に、推定付与データを利用して基準判定用付与対象を特定し、またその基準判定用付与対象を提示する作業者数を決定した場合に、付与対象管理部２０１で作業者ごとの作業キューを設定した例を図１４に示す。Next, FIG. 14 shows an example in which the estimated assignment data is used to identify the assignment targets for reference judgment, and the number of workers to whom the assignment targets for reference judgment are to be presented is determined, and a work queue for each worker is set in the assignmenttarget management unit 201.

図１４で示す例は、図６で示したものと同様に、作業者が６人の場合の作業キューの例となっている。キュー１４０１は作業者Ａ用の、キュー１４０２は作業者Ｂ用の、キュー１４０３は作業者Ｃ用の、キュー１４０４は作業者Ｄ用の、キュー１４０５は作業者Ｅ用の、キュー１４０６は作業者Ｆ用を表す。今１４０１から１４０６までの６つのキューは上から順番に利用されていくものとする。The example shown in Figure 14 is an example of a work queue for six workers, similar to that shown in Figure 6.Queue 1401 is for worker A,queue 1402 is for worker B,queue 1403 is for worker C,queue 1404 is for worker D,queue 1405 is for worker E, andqueue 1406 is for worker F. Now, let's assume that the sixqueues 1401 to 1406 are used in order from the top.

付与対象管理部２０１では、図１２の付与対象格納テーブル１２００の列１２０２に格納された基準確認用フラグと列１２０３に格納された作業者人数を基に、キューへ付与対象を設定している。The assignmenttarget management unit 201 sets the assignment target to the queue based on the criteria confirmation flag stored incolumn 1202 of the assignment target storage table 1200 in FIG. 12 and the number of workers stored incolumn 1203.

図１２に例示した付与対象を、付与対象管理部２０１がどのように各キューに振り分けるのかを説明する。ここでは列１２０１のＩＤが小さい順に説明する。ＩＤが００１の付与対象は、基準確認用フラグがＦａｌｓｅのため、１人の作業者に振り分ける。具体的にはキュー１４０１にＩＤが００１の付与対象を振り分ける。続いてＩＤが００２の付与対象も同様に、キュー１４０２に振り分ける。ＩＤが００３の付与対象は基準確認用フラグがＴｒｕｅであり、かつ提示する作業者数は３人なので、キュー１４０３、キュー１４０４、キュー１４０５に振り分ける。ＩＤが００４から００７までの４つの付与対象は基準確認用フラグがＦａｌｓｅであるため、それぞれ順にキュー１４０６、キュー１４０１、キュー１４０２、キュー１４０３に振り分ける。続いて、ＩＤが００８の付与対象は、付与対象は基準確認用フラグがＴｒｕｅであり、かつ提示する作業者数は５人なので、キュー１４０４、キュー１４０５、キュー１４０６、キュー１４０１、キュー１４０２に振り分ける。ＩＤが００９から０１２までの４つの付与対象は基準確認用フラグがＦａｌｓｅであるため、それぞれ順にキュー１４０３、キュー１４０４、キュー１４０５、キュー１４０６に振り分ける。途中省略し、ＩＤがＮの付与対象は基準確認用フラグがＴｒｕｅであり、かつ作業者数が－１であるため、作業者全員に振り分ける。具体的にはキュー１４０１、キュー１４０２、キュー１４０３、キュー１４０４、キュー１４０５、キュー１４０６のすべてに振り分ける。The following describes how the granttarget management unit 201 allocates the grant targets shown in FIG. 12 to each queue. Here, the explanation will be given in ascending order of ID incolumn 1201. The grant target withID 001 is allocated to one worker because the criterion confirmation flag is False. Specifically, the grant target withID 001 is allocated toqueue 1401. Next, the grant target withID 002 is similarly allocated toqueue 1402. The grant target withID 003 has the criterion confirmation flag True and presents three workers, so it is allocated toqueues 1403, 1404, and 1405. The four grant targets withIDs 004 to 007 have the criterion confirmation flag False, so they are allocated toqueues 1406, 1401, 1402, and 1403, respectively. Next, the grant target withID 008 has its reference confirmation flag set to True and the number of workers presented is 5, so it is assigned toqueues 1404, 1405, 1406, 1401, and 1402. The four grant targets withIDs 009 to 012 have their reference confirmation flag set to False, so they are assigned toqueues 1403, 1404, 1405, and 1406, respectively. Omitting the middle part, the grant target with ID N has its reference confirmation flag set to True and the number of workers is -1, so it is assigned to all workers. Specifically, it is assigned to all ofqueues 1401, 1402, 1403, 1404, 1405, and 1406.

これにより、基準がぶれやすいと思われる付与対象に限定して基準がぶれているかどうかの判定が可能となる。その結果、より効果的に基準がぶれているかどうかを検出することが可能となり、効率的に作業場全体でアノテーション付与の精度を高めることが可能となる。This makes it possible to determine whether the criteria are inconsistent by limiting annotation targets to those that are likely to be inconsistent. As a result, it becomes possible to more effectively detect whether the criteria are inconsistent, and efficiently improve the accuracy of annotation across the entire workplace.

以上の構成からなる本実施形態によれば、複数人でアノテーションを付与した結果のばらつきから警告を出すことが可能となる。これにより警告を確認した作業者、監督者もしくは付与結果を検証する検証者は、どのような付与対象のときに基準がぶれやすくなるのか、どの作業者の認識がずれているのかが即座に判別可能になる。その結果、付与基準の認識がずれている作業者に対して、是正措置を取ることが可能となり、全体のアノテーション付与の精度を向上することが可能となる。According to this embodiment configured as described above, it is possible to issue a warning based on the variation in the results of annotations made by multiple people. This allows the worker, supervisor, or verifier verifying the annotation results who confirms the warning to immediately determine what annotation targets are likely to cause deviations in the criteria and which worker's understanding is off. As a result, it becomes possible to take corrective measures against workers whose understanding of the annotation criteria is off, making it possible to improve the overall accuracy of annotation.

＜その他の実施形態＞
上述の複数の実施形態を適宜組み合わせてもよい。例えば、付与基準の判定処理を判定する処理を所定回数ごとに各実施形態の方法を切り替える構成としてもよい。または、ランダムに各実施形態の方法に切り替えてもよい。 <Other embodiments>
The above-described embodiments may be appropriately combined. For example, the method of each embodiment may be switched every predetermined number of times to determine the determination process of the grant criteria. Alternatively, the method of each embodiment may be switched randomly.

本発明は、上述の実施例の１以上の機能を実現するプログラムを、ネットワーク又は記憶媒体を介してシステム又は装置に供給し、そのシステム又は装置のコンピュータにおける１つ以上のプロセッサーがプログラムを読出し実行する処理でも実現可能である。また、１以上の機能を実現する回路（例えば、ＡＳＩＣ）によっても実現可能である。The present invention can also be realized by supplying a program that realizes one or more of the functions of the above-mentioned embodiments to a system or device via a network or storage medium, and having one or more processors in the computer of the system or device read and execute the program. It can also be realized by a circuit (e.g., an ASIC) that realizes one or more functions.

３００付与対象提示ウィンドウ
３１０付与対象表示領域
３２０付与対象変更ボタン
３３０付与データ入力領域
３４０複数人によるアノテーション付与作業を管理するサーバー
３５１～３５６作業者がアノテーション付与作業で利用するクライアント 300 Annotationtarget presentation window 310 Annotationtarget display area 320 Annotationtarget change button 330 Annotationdata input area 340 Server that manages annotation work bymultiple people 351 to 356 Clients used by workers in annotation work