JPS6043556B2

Movatterモバイル変換

Info

Publication number: JPS6043556B2
Application number: JP55177472A
Authority: JP
Inventors: 正人須田; 好勝中村
Original assignee: Tokyo Shibaura Electric Co Ltd
Current assignee: Toshiba Corp
Priority date: 1980-12-16
Filing date: 1980-12-16
Publication date: 1985-09-28
Also published as: JPS57100567A

Description

Translated fromJapanese

【発明の詳細な説明】本発明は読取り文字に対する認識判定不能の回復や読
取りエラーの訂正を簡易に行い得る信頼性の高い文字読
取装置に関する。DETAILED DESCRIPTION OF THE INVENTION The present invention relates to a highly reliable character reading device that can easily recover from unrecognizability of read characters and correct reading errors.

近時、帳票等に印字された文字マークを自動読取りし
て認識する文字読取装置が広く普及している。2. Description of the Related Art In recent years, character reading devices that automatically read and recognize character marks printed on forms and the like have been widely used.

またこの種の装置では、一般に読取り判定が不能な文字
に対しては再度の読取りを行わしめることによる回復措
置が講ぜられるようになつている。然乍、読取りエラー
や判定誤りに対しては再読取り等の回復措置が講じられ
ることが殆んどないので、誤判定認識結果がそのまま出
力される不具合があつた。そこで従来より、文字読取り
情報を多値レベルで量子化したり、文字パターンに対”
するビット数を増して処理精度を高めたり、あるいは文
字パターンを構造的に解析、更には誤り訂正符号を併用
する等の工夫が試みられている。このような対策を講じ
ることによつて読取りエラーを著しく低減し得るが、逆
に処理形態が高度て複雑化することが否めなかつた。ま
たこれに伴つて装置構成が複雑化して高価になる故、汎
用性に欠けると云う問題が生じた。また読取り判定不能
に対する回復措置として同一文字を再読取りするにして
もその再読取り制御が煩雑であり、その期間における認
識処理が中断する合ので読取り認識効率が悪くなる等の
問題も生じた。本発明はこのような事情を考慮してなさ
れたもので、その目的とするところは、読取り文字に対
する認識判定不能の回復や読取りエラーの訂正を簡易に
且つ効果的に行い得る信頼性が高く実用性に富んだ文字
読取装置を提供することにある。Furthermore, in this type of device, recovery measures are generally taken for characters whose reading cannot be determined by re-reading them. However, since recovery measures such as re-reading are rarely taken in response to reading errors or judgment errors, there is a problem in that the erroneous judgment recognition results are output as they are. Therefore, conventional methods have been used to quantize character reading information at a multilevel level, and to
Attempts have been made to improve processing accuracy by increasing the number of bits to be processed, to analyze character patterns structurally, and to use error correction codes in combination. By taking such measures, reading errors can be significantly reduced, but on the contrary, it cannot be denied that the processing form becomes sophisticated and complicated. Further, as a result, the device configuration has become complicated and expensive, resulting in a lack of versatility. Furthermore, even if the same character is re-read as a recovery measure against inability to determine reading, the re-reading control is complicated, and the recognition process is interrupted during that period, resulting in problems such as poor reading recognition efficiency. The present invention has been made in consideration of these circumstances, and its purpose is to provide a highly reliable and practical method that can easily and effectively recover from unrecognizable characters and correct reading errors. The object of the present invention is to provide a character reading device that is rich in character.

即ち本発明は同一文字情報に対する読取りを予め複数回
繰返し行つてその読取り情報に対する前処理認識によつ
て候補文字とその類似度情報をそれぞれ求めてメモリに
格納し、これらの情報を用いて逐次前記文字情報に対す
る総合的な認識処理を、例えば類似度情報の比較処理お
よび認識候補文字データの多数決論理処理によつて行な
うことによつて上記した目的を効果的に達成したもので
ある。以下、図面を参照して本発明の一実施例につき説
明する。That is, in the present invention, the same character information is repeatedly read a plurality of times in advance, candidate characters and their similarity information are obtained by preprocessing recognition of the read information, and stored in a memory. The above object is effectively achieved by performing comprehensive recognition processing on character information, for example, through comparison processing of similarity information and majority logic processing of recognition candidate character data. Hereinafter, one embodiment of the present invention will be described with reference to the drawings.

第１図は実施例装置の概略構成をブロック化して示した
ものである。FIG. 1 is a block diagram showing the schematic configuration of an embodiment apparatus.

帳票等の記録媒体に印字された文字情報は、光電変換部
１によつて光学的に検出され、光電変換されて電気信号
として出力される。この電気信号は例えば第２図ａに示
すように文字情報の濃度に相当したレベルを有するもの
であり、量子化回路２に供給される。量子化回路２は第
２図ａ中レベルＴＨで示す如き像信号に対する量子化レ
ベルを設定しており、このレベルＴＯにて上記像信号を
同図ｂに示す如く２値化して量子化している。この量子
化処理は文字情報のパターン全体について行われ、これ
によつて文字パターンが抽出されている。しかして一文
字について前記光電変換部１にて検出され、量子化回路
２を介して量子化された文字パターンの情報は前処理回
路３に導かれて前処理認識される。この前処理回路３は
パターンマッチングやパターン解析等によつて上記読取
り文字に対する候補文字データを求めるもので、従来よ
り種々提唱される各種認一識装置の前処理回路にて構成
される。しかして前処理回路３では、各種の候補文字に
対する文字パターンの類似度計算を行つて、その類似度
結果から第１候補文字データおよび第２候補文字データ
を求め、それらの類似度データと共にメモリ４に書込ん
でいる。この前処理認識と、その認識結果のメモリ４へ
の書込みは、前記記録媒体に印字記録された一行分の文
字情報に対して順次行われる。この一行分の文字読取り
が終了したとき、再度繰返してこの一行分に対する文字
読取りが前記量子化回路２における量子化レベルを変え
て行われ、これによつて同一文字に対する読取りが複数
回繰返して行われる。そして、各認識結果のデー・夕は
、前記一行分の複数の文字にそれぞれ対応して設定され
たメモリ４の所定のアドレスエリアに順次書込まれて保
持される。一方、このメモリ４に格納された認識結果デ
ータは各文字にそれぞれ対応して読取されて編集判定部
５に供給される。Character information printed on a recording medium such as a form is optically detected by a photoelectric conversion unit 1, photoelectrically converted, and output as an electrical signal. This electrical signal has a level corresponding to the density of character information, for example, as shown in FIG. 2a, and is supplied to the quantization circuit 2. The quantization circuit 2 sets a quantization level for the image signal as shown by level TH in FIG. 2a, and binarizes and quantizes the image signal at this level TO as shown in FIG. 2b. . This quantization process is performed on the entire pattern of character information, thereby extracting the character pattern. Information on a character pattern detected by the photoelectric conversion unit 1 and quantized by the quantization circuit 2 for one character is led to a preprocessing circuit 3 and subjected to preprocessing and recognition. This preprocessing circuit 3 obtains candidate character data for the read character by pattern matching, pattern analysis, etc., and is constituted by preprocessing circuits of various recognition devices that have been proposed in the past. The preprocessing circuit 3 calculates the similarity of character patterns for various candidate characters, obtains first candidate character data and second candidate character data from the similarity results, and stores them together with the similarity data in the memory 3. is written in. This preprocess recognition and the writing of the recognition results into the memory 4 are performed sequentially for one line of character information printed and recorded on the recording medium. When the character reading for one line is completed, character reading for this one line is repeated again by changing the quantization level in the quantization circuit 2, and as a result, reading for the same character is repeated multiple times. be exposed. The data of each recognition result is sequentially written and held in a predetermined address area of the memory 4 set corresponding to each of the plurality of characters for one line. On the other hand, the recognition result data stored in the memory 4 is read corresponding to each character and supplied to the edit determination section 5.

この編集判定部５では、一次判定部６にて上記同一文字
に対する認識結果データからその類似度情報の比較処理
を主体とする一次判定処理を実行し、これによつて求め
られた認識結果（途中結果）をバッファ７に一時的に格
”納している。しかるのちこのバッファ７に格納された
データを参照して２次判定部８は上記結果データに基づ
く認識処理を、例えば多数決論理処理にて更に行つてい
る。この２次判定部７による判定結果は合成回路９に供
給されると共に、３次判定部１０に供給されており、こ
の３次判定部１０にて再度の認識判定処理がなされてい
る。前記合成回路９はこの第３次判定部１０の判定結果
と前記第２次判定部８の判定結果とを選択的に抽出して
バッファ１１に最終判定結果として格納している。即ち
、２次判定によつて十分信頼性の高い判定結果が得られ
たとき、その判定結果がバッファ１１に格納され、ある
いは上記判定結果の信頼性が十分に確保されない場合に
は、３次判定部１０により得られた判定結果が最終結果
としてバッファ１１に格納されることになる。次に上述
した装置による文字読取認識の具体例につき説明する。In this editing judgment unit 5, a primary judgment unit 6 executes a primary judgment process mainly consisting of a comparison process of similarity information from the recognition result data for the same character, and the recognition result obtained thereby (in the middle The result) is temporarily stored in a buffer 7. Then, referring to the data stored in this buffer 7, the secondary judgment unit 8 performs recognition processing based on the result data, for example, in majority logic processing. The determination result by the secondary determination section 7 is supplied to the synthesis circuit 9 and also to the tertiary determination section 10, where the recognition determination process is performed again. The synthesis circuit 9 selectively extracts the judgment result of the tertiary judgment section 10 and the judgment result of the second judgment section 8 and stores them in the buffer 11 as the final judgment result. That is, when a sufficiently reliable judgment result is obtained by the secondary judgment, the judgment result is stored in the buffer 11, or if the reliability of the judgment result is not sufficiently ensured, the tertiary judgment is performed. The determination result obtained by the unit 10 is stored as the final result in the buffer 11. Next, a specific example of character reading recognition by the above-mentioned apparatus will be explained.

第３図は文字読取りに供される帳票１２を示すもので、
この帳票１２の所定の印字領域１３に文字マーク１４が
印字記録されている。ここでは１行の文字マーク１４に
つき例示してあるが複数行からなるものであつてもよい
。しかしてこれらの文字マーク１４は、例えは左側文字
より順次光電変換されて検出され、前処理認識回路３に
より得られた上記文字マーク１４に対する認識データが
メモリ４の所定のデータエリアに順次格納される。この
文字の読取り入力は一行分の文字マーク１４に亘つて複
数回繰返して、例えば４回行われることは先に説明した
通りである。第４図はメモリ４に格納された認識データ
とこれらの認識データを用いた文字に対する認識判定処
理の手順を示したものである。即ち光電検出され量子化
された文字読取りデータは、前処理認識回路３において
パターンマッチング法等による前処理認識に供される。
この前処理認識によつて読取り文字パターンの複数の類
似した候補文字に対する各類似度がそれぞれ算出される
。これらの類似度情報に基づいて、上記読取り文字パタ
ーンに対する第１候補文字および第２候補文字がそれぞ
れ認識データとして選出される。第１候補文字は、類似
度が最大値となるものとして求められ、第２候補文字と
して類似度が次大値となるものが求られる。この具体例
にあつては、第１番目に読取られたＲＡョなる文字に対
して、最大類似度値（３５６３）より第１候補文字ＲＡ
Ｊが選出され、次大類似度値（２３３１）より第２候補
文字ＲＢＪが選出される。そして、これらの候補文字情
報がその類似度値と共にメモリ４の所定のアドレスに格
納される。次に第２番目に読取られたＲＢＪなる文字に
対しては、最大類似度値（４６９１）より第１候一補文
字０ＢＪが選出され、次大類似度値（３０２１）より第
２候補文字Ｒ８Ｊが選出される。このようにして一行に
亘る各文字についてはそれぞれ前処理認識がなされ、行
最後の文字Ｒｚョに対して最大類似度値（３３１６）よ
り第１候補文字Ｒｚ，ｊ．次．大類似度値（２６１Ｅ）
より第２候補文字Ｒ２ョがそれぞれ選出される。そして
、これらの候補文字情報がその類似度値と共に前記メモ
リ４の所定のアドレスに順次規則的に格納される。従つ
てメモリ４の所定のアドレスエリアには第４図中（１）
！に示すように各候補文字情報がその類似度値と共に順
次格納されることになる。その後、前記文字マーク１３
に対して、前記量子化回路１２による量子化レベルを変
えて再度読取りが行われる。Figure 3 shows the form 12 used for character reading.
A character mark 14 is printed and recorded in a predetermined print area 13 of this form 12. Here, one line of character mark 14 is shown as an example, but it may consist of multiple lines. These character marks 14 are sequentially photoelectrically converted and detected, for example starting from the left character, and the recognition data for the character marks 14 obtained by the preprocessing recognition circuit 3 are sequentially stored in a predetermined data area of the memory 4. Ru. As described above, this character reading input is repeated multiple times over one line of character marks 14, for example four times. FIG. 4 shows the recognition data stored in the memory 4 and the procedure for character recognition determination processing using these recognition data. That is, the photoelectrically detected and quantized character reading data is subjected to preprocessing recognition using a pattern matching method or the like in the preprocessing recognition circuit 3.
Through this preprocessing recognition, each degree of similarity for a plurality of similar candidate characters of the read character pattern is calculated. Based on these similarity information, a first candidate character and a second candidate character for the read character pattern are respectively selected as recognition data. The first candidate character is determined to have the maximum similarity value, and the second candidate character is determined to be the character with the next highest similarity value. In this specific example, for the first character read RA, the first candidate character RA is selected based on the maximum similarity value (3563).
J is selected, and the second candidate character RBJ is selected based on the degree of similarity value (2331). The candidate character information is then stored at a predetermined address in the memory 4 together with its similarity value. Next, for the second read character RBJ, the first candidate character 0BJ is selected from the maximum similarity value (4691), and the second candidate character R8J is selected from the maximum similarity value (3021). is selected. In this way, each character on a line is preprocessed and recognized, and the first candidate character Rz,j. Next. Great similarity value (261E)
Then, the second candidate character R2 is selected. These candidate character information and their similarity values are sequentially and regularly stored at predetermined addresses in the memory 4. Therefore, in the predetermined address area of memory 4, (1) in FIG.
! As shown in , each candidate character information is sequentially stored together with its similarity value. After that, the character mark 13
Then, the quantization level by the quantization circuit 12 is changed and reading is performed again.

この場合、第１番目の文字・ＲＡョに対して第１候補文
字ＲＡョが最大類似度（３Ｆ′７６）として求められ、
第２候補文字Ｒ７Ｊが次大類似度値（１７０６）として
求められ、先の第１回目の読取り時と異つて認識される
ことがある。このような異りに拘りなく、２回目の読取
りの全文字に対してそれぞれ前処理認識して前記メモリ
４の所定のアドレスエリアに順次第４図中（■）に示す
ように格納する。このアドレスエリアは先のアドレスエ
リアに対応したもので、読取り文字に対して所定のアド
レス関係が設定されていることは云うまでもない。そし
て、このような文字読取り認識が、２回目・３回目と繰
返して行われ、各候補文字情報とその類似度値が（■）
，（■）にノそれぞれ示すようにアドレス対応したメモ
リ４の所定のエリアに順次格納される。次に、上記の如
くメモリ４に格納された候補文字情報およびその類似度
値に基づいて、１次・２次・３次判定による読取り文字
の認識判定処理が．行われる。In this case, the first candidate character RAyo is determined as the maximum similarity (3F'76) to the first character RAyo,
The second candidate character R7J is determined as the next-largest similarity value (1706) and may be recognized differently from the first reading. Irrespective of such differences, all the characters read for the second time are preprocessed and recognized, and stored in the predetermined address areas of the memory 4 in order as shown in (■) in FIG. This address area corresponds to the previous address area, and it goes without saying that a predetermined address relationship is set for the read characters. Then, such character reading recognition is repeated for the second and third time, and each candidate character information and its similarity value are (■)
, (■), they are sequentially stored in predetermined areas of the memory 4 corresponding to the addresses. Next, based on the candidate character information and its similarity value stored in the memory 4 as described above, recognition and determination processing of the read character is performed using primary, secondary, and tertiary determination. It will be done.

先ず１次判定部６は、各文字に対する第１および第２候
補文字の各類似度値を参照して第１候補文字を抽出する
か否かを判定する。即ち、第１回目の読取りにおける第
１番目の文字ＲＡｊに対しては、第１候補文字ＲＡｊの
類似度値（３５６３）と第２候補文字ＲＢＪの類似度値
（２３３１）との差が大なる為、これを類似性無しと判
定して第１候補文字ＲＡＪを抽出し、これをバッファ７
の所定アドレスに（■）に示す如く格納する。また第２
回目の読取りにおける最終文字ＲｚＪに対するように第
１候補文字Ｒｚョの類似度値（３５Ｆ′１）と第２候補
文字Ｒ２ョの類似度値（３５９６）とが非常に近接して
いる場合、これを判定不能として不能マーク１？ョをバ
ッファ７の所定アドレスに格納する。このような１次判
定が全候補文字に対してそれぞれ行われ、これにより抽
出された１次候補判定文字情報がアドレス対応してバッ
ファ７に順次格納される。この場合、その類似度値や類
似度差等の情報も併せて格納するようにしておいてもよ
い。しかるのちバッファ７に格納された途中結果である
１次候補判定文字情報に基づいて、２次判定部８による
２次判定が行われることになる。First, the primary determination unit 6 determines whether or not to extract the first candidate character by referring to each similarity value of the first and second candidate characters for each character. That is, for the first character RAj in the first reading, the difference between the similarity value (3563) of the first candidate character RAj and the similarity value (2331) of the second candidate character RBJ is large. Therefore, it is determined that there is no similarity, the first candidate character RAJ is extracted, and this is stored in buffer 7.
The data is stored at a predetermined address as shown in (■). Also the second
If the similarity value (35F'1) of the first candidate character Rzo and the similarity value (3596) of the second candidate character R2 are very close to each other, as with the final character RzJ in the second reading, this Impossibility mark 1 as undeterminable? The file is stored at a predetermined address in the buffer 7. Such primary determination is performed for all candidate characters, and the extracted primary candidate determination character information is sequentially stored in the buffer 7 in correspondence with the address. In this case, information such as the similarity value and similarity difference may also be stored. Thereafter, the secondary determination unit 8 performs a secondary determination based on the primary candidate determination character information that is an intermediate result stored in the buffer 7.

この２次判定は、同一文字に対して繰返し求められた候
補文字の多数決論理により行われる。即ち、同一文字に
対して選択抽出された１次候補文字のうち同じものが幾
つあるかにより２次判定している。例えばこの場合、４
個の候補文字が全て同じであつたり、あるいはそのうち
の３個が同じく１個のみが異つているときには、多数を
占める候補文字を２次候補文字、つまりこの場合には最
終判定した候補文字としてバッファ１１に格納する。ま
た逆に４個の候補文字が相互に異なる場合、４個の候補
文字が全て不能マーク情報である場合には判定不能マー
ク（？）をバッファ１１に格納する。更には、第１候補
文字が２つづつ同じものであつたり、判定不能マークが
含まれている場合には前記バッファ１１に一時的に判定
不能マーク（？）が格納される。これにより、バッファ
１１には（■）に示すように判定結果が格納される。こ
のような判定不能マーク（？）が付された読取り文字に
対して、３次判定部１０が３次判定を行うことになる。
この３次判定は、基本的には２次判定と同様な論理によ
つて行われるものであるが、ここでは類似度値を有効に
利用して判定に大きく寄与させることにより行われる。
即ち、メモリ４に格納された候補文字情報に対して判定
不能処理を更に厳しくして行う。具体的には最大類似度
値と次大類似度値の差分値が更に大きいものだけを候補
判定文字情報として取出す等して、その判定不能認識処
理を行わしめる。これによつて例えは第３回目の読取り
における第２番目の文字に対して第１候補文字としてＲ
ＤＪが抽出されたところ、今度は判定不能として判断さ
れることになる。また第４回目の読取りにおける第２番
目の文字に対しても判定不能なる判断結果が付されるこ
とになる。この結果３次判定部１０は、その文字に対し
て、１Ｂョなる候補文字が２個、判定不能マーク２個か
ら、その最終候補文字がＲＢョであると云う結果を導き
出す。そしてこの結果をバツーフア１１に供給して、先
の判定不能情報と置換して格納する。このような３次判
定が、２次判定により判定不能判断された各文字に対し
てはそれぞれ行われ、最終的にはバッファ１１には第４
図中（■）に示すように最終判定候補文字が格納され．
ることになる。尚、３次判定によつても判定不能が解消
されない場合も生じるが、このとき、前記不能判定のレ
ベルを変えて再度３次判定を行うようにしてもよく、ま
た判定不能としてそのまま出力して警報を発するように
してもよい。かくして−３次判定によつて途中バッファ
の判定結果が訂正されていくので信頼性の高い文字認識
判定結果を得ることが可能となる。かくして上述したよ
うに同一文字に対して予め複数回繰返して読取りを行い
、それらの各量子化データから候補文字認識を行つて認
識結果をメモリに格納したのち、これらの認識結果に従
つて１次・２次・３次と多段階に亘つて上記文字に対す
る認識処理を行うことによつて、判定不能文字に対する
回復を非常に効果的に行い得る。This secondary determination is performed by majority logic of candidate characters repeatedly determined for the same character. That is, a secondary determination is made based on how many of the primary candidate characters selected and extracted for the same character are the same. For example, in this case, 4
If all of the candidate characters are the same, or if three of them are the same and only one is different, the candidate character that occupies the majority is buffered as the secondary candidate character, that is, in this case, the final candidate character. 11. On the other hand, if the four candidate characters are different from each other, and all of the four candidate characters are impossible mark information, an undeterminable mark (?) is stored in the buffer 11. Furthermore, if two of the first candidate characters are the same or if an undeterminable mark is included, an undeterminable mark (?) is temporarily stored in the buffer 11. As a result, the determination result is stored in the buffer 11 as shown in (■). The tertiary determination unit 10 performs tertiary determination on the read characters with such undeterminable marks (?) attached.
This tertiary determination is basically performed using the same logic as the secondary determination, but here it is performed by effectively utilizing the similarity value to make a large contribution to the determination.
That is, the candidate character information stored in the memory 4 is subjected to more severe undeterminable processing. Specifically, only those characters for which the difference value between the maximum similarity value and the next-largest similarity value is larger are extracted as candidate determination character information, and the undeterminable recognition process is performed. This makes the example R as the first candidate character for the second character in the third reading.
When DJ is extracted, it is determined that it cannot be determined this time. Further, the second character in the fourth reading is also given an undeterminable determination result. As a result, the tertiary determination unit 10 derives the result that the final candidate character is RB YO based on the two candidate characters 1B YO and the two undeterminable marks for that character. This result is then supplied to the Batufur 11, where it is replaced with the previous undeterminable information and stored. Such a tertiary determination is performed for each character that is determined to be undeterminable in the secondary determination, and finally the buffer 11 contains the fourth character.
As shown in the figure (■), the final judgment candidate characters are stored.
That will happen. Incidentally, there may be cases where the undeterminable condition is not resolved even with the tertiary determination, but in this case, the level of the undeterminable determination may be changed and the tertiary determination may be performed again, or the undeterminable condition may be output as is. An alarm may be issued. In this way, since the determination result of the intermediate buffer is corrected by the -3rd order determination, it becomes possible to obtain a highly reliable character recognition determination result. Thus, as described above, the same character is read multiple times in advance, candidate characters are recognized from each of these quantized data, and the recognition results are stored in memory. - By performing recognition processing on the above-mentioned characters in multiple stages such as secondary and tertiary, recovery from unidentifiable characters can be performed very effectively.

しかも読取り誤認識に対しても判定不能と判断したり、
あ゛るいは正規の認識状態に回復することができるので
、文字読取り入力された情報を用いた情報処理を効果的
に行うことができる。その上、従来のように判定不能文
字だけを選択的に抽出して再読取りするような制御の煩
雑性もなく、認識処理を円滑に行い得る。しかも候補文
字情報の途中結果を逐次有効利用する合ので総合的に判
断資料が多くなり、信頼性の高い文字認識処理に基づく
文字の読取り入力が可能となる。また装置のシステム構
成がさほど複雑化することもなく、制御系統の構成も簡
単である等の絶大なる効果を奏する。尚、本発明は上記
実施例にのみ限定されるものではない。例えば同一文字
に対する繰返し読取り回数や、一行文字等は仕様に応じ
て定めればよいものであり、類似度値も各種の類似法に
応じてそのアルゴリズムを定めればよい。また量子化レ
ベルは、背景レベルを参酌して設定するようにすれば都
合がよい。また候補文字による文字認識判定のアルゴリ
ズムも仕様に応じて適宜定めればよいものであり、多数
決論理等も種々変形できる。要するに本発明はその要旨
を逸脱しない範囲で種々変形して実施することができる
。図面の簡単な説明図は本発明の一実施例を示すもので
、第１図は装置の概略構成図、第２図は光電変換信号と
その量子化出力を示す図、第３図は文字記録媒体の一例
を示す図、第４図は認識処理形態の一例を示す図である
。Moreover, it is judged that it is impossible to judge even if there is a reading error recognition,
Otherwise, the normal recognition state can be restored, so information processing using the information input by character reading can be performed effectively. Moreover, the recognition process can be smoothly performed without the complicated control of selectively extracting only unidentifiable characters and rereading them as in the past. Moreover, since intermediate results of candidate character information are effectively used one after another, the number of judgment materials increases overall, and characters can be read and input based on highly reliable character recognition processing. Further, the system configuration of the device does not become very complicated, and the control system configuration is simple, which is a great advantage. Note that the present invention is not limited only to the above embodiments. For example, the number of repeated readings for the same character, single-line characters, etc. may be determined according to specifications, and the similarity value may also be determined by its algorithm according to various similarity methods. Further, it is convenient if the quantization level is set with consideration of the background level. Furthermore, the algorithm for determining character recognition based on candidate characters may be appropriately determined according to specifications, and the majority logic and the like can be modified in various ways. In short, the present invention can be implemented with various modifications without departing from the gist thereof. Brief explanatory diagrams of the drawings show one embodiment of the present invention, and FIG. 1 is a schematic diagram of the device, FIG. 2 is a diagram showing a photoelectric conversion signal and its quantized output, and FIG. 3 is a character recording diagram. FIG. 4 is a diagram showing an example of a medium, and FIG. 4 is a diagram showing an example of a recognition processing form.

１・・・光電変換部、２・・・量子化回路、３・・・前
処理認識回路、４・・・メモリ、５・・・編集判定部、
６・・・１次判定部、７・・・バッファ（途中結果）、
８・・・２次判定部、９・・・合成回路、１０・・・バ
ッファ（最終結果）、１１・・・３次判定部。DESCRIPTION OF SYMBOLS 1... Photoelectric conversion unit, 2... Quantization circuit, 3... Preprocessing recognition circuit, 4... Memory, 5... Edit determination unit,
6... Primary determination section, 7... Buffer (interim result),
8... Secondary determination section, 9... Synthesis circuit, 10... Buffer (final result), 11... Tertiary determination section.

Claims

Translated fromJapanese

【特許請求の範囲】１印字文字情報を光学的に検出し電気信号として出力
する光電変換手段と、この光電変換手段より出力された
電気信号を複数の異る量子化レベルに従つて夫々２値化
することにより同一印字文字に対する複数の文字パター
ンとして出力する量子化手段と、この量子化手段より出
力された複数の文字パターンに対し予め定められた各種
の候補文字と類似度計算を行うことにより少くとも第１
候補文字、第２候補文字及びこれらの類似度値から成る
認識データを夫々の文字パターンに対して出力する前処
理認識手段と、この前処理認識手段より出力された夫々
の認識データに対し第１及び第２候補文字の類似度値を
比較判定することによりこの第１候補文字を前記複数の
文字パターン夫々に対して抽出するか否か判定する１次
判定手と、この１次判定手段により抽出された第１候補
文字に対し同一の第１候補文字が前記複数分において占
める多数決論理理判定により前記印字文字に該当する第
１候補文字を決定する２次判定手段とを具備したことを
特徴とする文字読取装置。２２次判定手段の多数論理判定により前記印字文字に
該当する第１候補文字が決定されない場合、１次判定手
段は第１及び第２候補文字の類似度値の比較判定条件を
異らせてこの第１候補文字を抽出するか否か判定するも
のであり、２次判定手段は前記１次判定手段により抽出
された第１候補文字に対して多数決論理判定を繰返すこ
とにより前記印字文字に該当する第１候補文字を決定す
るものである特許請求の範囲第１項記載の文字読取装置
。[Claims] 1. A photoelectric conversion means for optically detecting printed character information and outputting it as an electric signal, and converting the electric signal output from the photoelectric conversion means into binary values according to a plurality of different quantization levels. quantization means to output multiple character patterns for the same printed character by quantization, and similarity calculation with various predetermined candidate characters for the multiple character patterns output from this quantization means. at least the first
a preprocessing recognition means for outputting recognition data consisting of a candidate character, a second candidate character, and their similarity value for each character pattern; and a primary determination means for determining whether or not the first candidate character is extracted for each of the plurality of character patterns by comparing and determining similarity values of the second candidate character; and extraction by the first determination means. and a secondary determination means for determining a first candidate character corresponding to the printed character by a majority logical determination that the same first candidate character occupies the plurality of first candidate characters. character reading device. 2. If the first candidate character corresponding to the printed character is not determined by the majority logic determination of the secondary determination means, the primary determination means differs the comparison determination conditions of the similarity values of the first and second candidate characters. It is determined whether or not to extract this first candidate character, and the secondary determination means repeats majority logic determination on the first candidate character extracted by the first determination means to determine whether the first candidate character corresponds to the printed character. 2. The character reading device according to claim 1, wherein the character reading device determines a first candidate character to be read.