JP2007251518A

Movatterモバイル変換

Info

Publication number: JP2007251518A
Application number: JP2006071044A
Authority: JP
Inventors: Hirofumi Nishida; 広文西田
Original assignee: Ricoh Co Ltd
Current assignee: Ricoh Co Ltd
Priority date: 2006-03-15
Filing date: 2006-03-15
Publication date: 2007-09-27
Anticipated expiration: 2026-03-15
Also published as: JP4615462B2

Abstract

【課題】画像データの画像タイプに適した正規化処理を適用して、画像の歪みをもたらすことなく、画像データを理想的な表現に変換した形で送信または蓄積することができる画像処理装置を提供する。
【解決手段】レイアウトの概略（文字や写真・絵の大体の空間的配置や分布など）に基づいて計算された画像データの画像特徴量を用いて当該画像データの画像タイプを分類識別した後、分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて画像正規化処理方法を選択し、選択された画像正規化処理方法に基づいて画像データに対して正規化処理を施し、送信または蓄積する。これにより、レイアウトの概略に従うことで画像のタイプを特徴付ける画像特徴量を高速に計算することができるとともに、画像データの画像タイプに適した正規化処理を適用して、画像データを理想的な表現に変換した形で送信または蓄積することができる。
【選択図】図３
An image processing apparatus capable of transmitting or storing image data converted into an ideal representation without applying image distortion by applying a normalization process suitable for the image type of the image data. provide.
After classifying and identifying the image type of the image data using the image feature quantity of the image data calculated based on the outline of the layout (such as the approximate spatial arrangement and distribution of characters, photographs and pictures), The image normalization processing method is selected based on the information that associates the classification result and the correspondence type of the image type with the image normalization processing, and the normalization processing is performed on the image data based on the selected image normalization processing method. Apply, send or accumulate. This makes it possible to calculate image feature quantities that characterize the type of image at high speed by following the outline of the layout, and apply normalization processing suitable for the image type of the image data to represent the image data as an ideal representation. Can be sent or stored in a converted form.
[Selection] Figure 3

Description

Translated fromJapanese

本発明は、受け付けた画像データを蓄積または送信する際に、画像データを理想的な形に変換するための正規化処理を行なう画像処理装置、画像形成装置、プログラムおよび画像処理方法に関する。 The present invention relates to an image processing apparatus, an image forming apparatus, a program, and an image processing method that perform normalization processing for converting image data into an ideal form when storing or transmitting received image data.

従来、スキャナやデジタルカメラなどの様々な端末機器から画像データを含む文書データを取得し、この文書データを適切に配信するための文書管理システムが知られている。また、配信先としては、利用者の個人利用環境や共同蓄積管理環境がある。利用者の個人利用環境としては、例えば電子メールがあり、上記文書管理システムは設定された利用者の電子メールアドレスへ端末機器から受信した文書データを配信する。利用者は画像データを含む文書データをＰＣ（パーソナルコンピュータ）のディスプレイで閲覧したり、紙に出力したりして利用する。 Conventionally, document management systems for acquiring document data including image data from various terminal devices such as a scanner and a digital camera and appropriately distributing the document data are known. Distribution destinations include a user's personal use environment and a shared storage management environment. The user's personal use environment is e-mail, for example, and the document management system distributes the document data received from the terminal device to the set e-mail address of the user. A user browses document data including image data on a display of a PC (personal computer) or outputs it on paper.

共同蓄積管理環境としては、ＰＣ上にて文書データの蓄積管理を実現する文書管理サーバ等があり、サーバは設定された蓄積管理ソフトウェアへ文書データを配信する。利用者は文書管理サーバにアクセスすることで、画像データを含む文書データをＰＣのディスプレイで閲覧したり、紙に出力したりして利用する。 As a joint storage management environment, there is a document management server or the like that realizes storage management of document data on a PC, and the server distributes the document data to the set storage management software. By accessing the document management server, the user browses document data including image data on the display of the PC or outputs it on paper.

ところで、一般的に、印刷物はディザ法などによりドットパターンとして擬似中間調表現がなされ、したがって本来連続階調であるべき部分がこのドットパターンの持つ周波数特性を持つことになる。このような特性をもつ印刷物を、光学的にスキャンし、閲覧あるいは紙出力する場合、読み取り装置／閲覧装置／出力装置の持つそれぞれの周波数特性と干渉してモアレを生じさせる。 By the way, generally, a printed matter is expressed as a pseudo halftone as a dot pattern by a dither method or the like, and therefore, a portion that should originally be a continuous tone has the frequency characteristics of this dot pattern. When a printed matter having such characteristics is optically scanned and browsed or output on paper, it causes moiré by interfering with the respective frequency characteristics of the reading device / browsing device / output device.

また、スキャナやデジタルカメラなどの端末機器から印刷物をデジタル画像データとして入力する場合において、端末機器の設定が適切でない場合には、黒文字が薄かったり、文字と地肌のコントラストが貧弱であったりするために、文字の視認性が低下するといった、階調性の問題が生じることがある。 In addition, when inputting printed materials as digital image data from a terminal device such as a scanner or digital camera, if the settings of the terminal device are not appropriate, black characters may be thin or the contrast between characters and background may be poor. In addition, there may be a problem of gradation such that the visibility of characters is reduced.

さらに、印刷物の各ページの枠は、デジタル画像データにおいては直立長方形として入力されるべきであるが、実際には傾いて入力されることがある。このような画像を印刷・表示すると、ページ枠に平行であるべき文字行が傾いて再現されるために、読み手に不快感を与えたり、あるいは、文書画像を、文字領域、写真領域、図領域、表領域、罫線領域などに分割するレイアウト解析処理に弊害をもたらすことがある。 Further, the frame of each page of the printed material should be input as an upright rectangle in the digital image data, but may be input in an inclined manner in practice. When such an image is printed / displayed, the character lines that should be parallel to the page frame are reproduced in an inclined manner, so that the reader is uncomfortable or the document image is displayed in the character area, photo area, figure area. In some cases, the layout analysis process for dividing the table area, the ruled line area, or the like may be harmful.

ここで、文字情報が主体である印刷文書の場合における理想的な入力デジタル画像データとは、中間調部分が連続階調で表現されており、黒文字が十分に濃く、かつ、文字と地肌のコントラストが高く、さらに、各ページの枠が直立長方形となっているようなものである。 Here, the ideal input digital image data in the case of a printed document mainly consisting of character information means that the halftone part is expressed by continuous tone, the black character is sufficiently dark, and the contrast between the character and the background In addition, the frame of each page is like an upright rectangle.

そこで、従来においては、入力したデジタル文書画像データを蓄積したり、あるいは、ネットワークを通して遠隔地に送信する場合、あるいは、ネットワークを通して配信されてきたデジタル文書画像データを受信して、蓄積したり、さらに、遠隔地に送信する場合には、デジタル文書画像データを理想的な形に変換するための「正規化処理」を行なうようにしたものが提案されている。具体的には、デジタル文書画像データの蓄積・送信の際に、中間調ドットパターンの連続階調への補正処理（例えば、特許文献１参照）、文字と地肌間のコントラスト強調処理（例えば、特許文献２参照）、スキュー補正処理（例えば、特許文献３参照）、などを施すようにしている。これにより、蓄積・送信されたデジタル文書画像データは、印刷・閲覧・編集などの様々な用途について、容易に再利用することができるようになる。そうでなければ、蓄積・送信されたデジタル文書画像データを利用するたびに、補正処理を施さなければならなくなるという煩わしさが生じる。 Therefore, conventionally, the input digital document image data is accumulated, or when transmitted to a remote place through a network, or the digital document image data distributed through the network is received and accumulated, In the case of transmission to a remote place, there has been proposed one that performs “normalization processing” for converting digital document image data into an ideal form. Specifically, when digital document image data is stored / transmitted, a halftone dot pattern is corrected to a continuous tone (see, for example, Patent Document 1), and a contrast enhancement process between characters and background (for example, a patent). Reference 2), skew correction processing (see, for example, Patent Document 3), and the like are performed. As a result, the stored and transmitted digital document image data can be easily reused for various uses such as printing, browsing, and editing. Otherwise, every time the stored / transmitted digital document image data is used, there is a trouble that correction processing must be performed.

特に、ネットワークを通して配信されてきた画像を受信して処理する場合には、入力機器の特性や機器のパラメータ設定などの入力条件がわからないことが多いので、受信した画像から得られる統計量だけをもとに処理を行わなければならない。この実現手段として、特許文献４には、印刷原稿を画像入力機器でスキャンして得られたデジタル画像データを受信して蓄積または送信する文書画像伝達装置において、デジタル画像データに対して、デジタル画像データの持つ統計量に基づき該デジタル画像データの正規化処理を施す画像処理手段を備え、該画像処理手段による処理を施した後のデジタル画像データを蓄積または送信することを特徴とする発明が開示されている。 In particular, when receiving and processing images distributed over a network, input conditions such as input device characteristics and device parameter settings are often unknown, so only statistics obtained from received images can be obtained. And processing must be done. As a means for realizing this, Japanese Patent Application Laid-Open No. H10-228561 discloses a digital image for a digital image data in a document image transmission apparatus that receives and stores or transmits digital image data obtained by scanning a printed document with an image input device. Disclosed is an invention characterized by comprising image processing means for performing normalization processing of the digital image data based on statistics possessed by data, and storing or transmitting the digital image data after processing by the image processing means. Has been.

特開２００３−２８１５２６号公報JP 2003-281526 A特開２００５−１１０１８４号公報JP-A-2005-110184特許第３３０８０３２号公報Japanese Patent No. 33008032特開２００４−２９７７８６号公報JP 2004-297786 A

ところで、一口に印刷文書と言っても様々なものがある。新聞記事のように文字が主体のものもあれば、広告のように絵や写真が主体で、文字が少ないものもある。中には写真だけのページもある。 By the way, there are various types of printed documents. Some articles are mainly texts, such as newspaper articles, while others are mainly pictures and photos and few letters, like advertisements. Some pages are just photos.

上述した特許文献３に記載されているようなスキュー補正処理を行うには、補正パラメータの計算のために、統計的に十分な数の文字が画像中に存在することが必要である。ところが、写真が主体で、文字が少ないような画像にスキュー補正を施そうとすると、補正パラメータの計算を適切に行うことができないために、スキュー補正によって、かえって画像の歪みを生ずることがあり得る。 In order to perform the skew correction processing described inPatent Document 3 described above, it is necessary that a statistically sufficient number of characters exist in the image for calculation of correction parameters. However, if skew correction is performed on an image that is mainly a photograph and has a small number of characters, correction parameters cannot be calculated appropriately, and skew correction may cause image distortion. .

また、文字と地肌間のコントラスト強調処理は、文字が殆どないような画像においては意味をなさない。さらに、特許文献３に記載されているように、階調変換関数を決定するために文字が画像中に存在することが必要であるが、文字が殆どないような画像からは階調変換関数を適切に算出することができないために、コントラスト強調処理によってかえって画像の歪みを生ずることがあり得る。 Further, the contrast enhancement process between the character and the background does not make sense in an image having few characters. Further, as described inPatent Document 3, it is necessary that characters exist in the image in order to determine the gradation conversion function. However, the gradation conversion function is determined from an image having few characters. Since the image cannot be calculated appropriately, the contrast enhancement process may cause image distortion.

本発明は、上記に鑑みてなされたものであって、画像データの画像タイプに適した正規化処理を適用して、画像の歪みをもたらすことなく、画像データを理想的な表現に変換した形で送信または蓄積することができる画像処理装置、画像形成装置、プログラムおよび画像処理方法を提供することを目的とする。 The present invention has been made in view of the above, and is a form in which image data is converted into an ideal representation without applying image distortion by applying a normalization process suitable for the image type of the image data. It is an object of the present invention to provide an image processing apparatus, an image forming apparatus, a program, and an image processing method that can be transmitted or stored in a computer.

上述した課題を解決し、目的を達成するために、請求項１にかかる発明は、受け付けた画像データを蓄積または送信する際に、前記画像データを理想的な形に変換するための正規化処理を行なう画像処理装置において、前記画像データの画像特徴量を、レイアウトの概略に基づいて計算する画像特徴量計算手段と、この画像特徴量計算手段により計算された前記画像特徴量を用い、前記画像データの画像タイプを分類識別する画像タイプ識別手段と、画像タイプと画像正規化処理の対応規則を対応付けた情報を記憶する記憶手段と、前記画像タイプ識別手段による分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて、画像正規化処理方法を選択する選択手段と、この選択手段で選択された画像正規化処理方法に基づいて、前記画像データに対して正規化処理を施す正規化手段と、を備える。 In order to solve the above-described problems and achieve the object, the invention according toclaim 1 is a normalization process for converting the image data into an ideal form when storing or transmitting the received image data. In the image processing apparatus, the image feature amount calculating means for calculating the image feature amount of the image data based on the outline of the layout, and the image feature amount calculated by the image feature amount calculating means, the image feature amount is used. Image type identifying means for classifying and identifying the image type of data, storage means for storing information associating the correspondence rules of image types and image normalization processing, classification results and image types and image normalization by the image type identifying means Selection means for selecting an image normalization processing method based on information associated with correspondence rules for normalization processing, and an image normalization processing method selected by the selection means. And Zui, and a normalization means for performing normalization processing on the image data.

また、請求項２にかかる発明は、請求項１記載の画像処理装置において、前記画像特徴量計算手段は、前記画像データを矩形ブロックに排他的に分割するブロック分割手段と、分割された前記各ブロックを、当該画像データを構成する所定の構成要素に分類するブロック分類手段と、前記ブロックの分類結果に基づいて前記画像データの画像特徴量を計算する計算手段と、を備える。 According to a second aspect of the present invention, in the image processing apparatus according to the first aspect, the image feature amount calculating unit includes a block dividing unit that exclusively divides the image data into rectangular blocks, and each of the divided pieces. Block classification means for classifying the blocks into predetermined constituent elements constituting the image data; and calculation means for calculating an image feature amount of the image data based on the classification result of the blocks.

また、請求項３にかかる発明は、請求項２記載の画像処理装置において、前記ブロック分類手段は、前記ブロックから複数の異なる解像度の画像を生成する画像生成手段と、前記各解像度の画像から特徴量ベクトルを計算する特徴量ベクトル計算手段と、前記特徴量ベクトルに基づいて前記各ブロックを所定の構成要素に分類する分類手段と、を備える。 According to a third aspect of the present invention, in the image processing apparatus according to the second aspect, the block classification unit is characterized by an image generation unit that generates a plurality of images having different resolutions from the block, and the image of each resolution. A feature vector calculating unit that calculates a quantity vector; and a classifying unit that classifies the blocks into predetermined components based on the feature vector.

また、請求項４にかかる発明は、請求項３記載の画像処理装置において、前記特徴量ベクトル計算手段は、前記各解像度の画像を２値化する２値化手段と、２値画像の各々の画素について当該画素及びその近傍画素で構成する局所パターンの対応する画素の値を使って特徴を計算する画素特徴計算手段と、前記各画素について計算された特徴を画像全体にわたって加算する加算手段と、を備える。 According to a fourth aspect of the present invention, in the image processing apparatus according to the third aspect, the feature quantity vector calculating unit includes a binarizing unit that binarizes the image of each resolution and each of the binary images. A pixel feature calculation means for calculating a feature using a value of a corresponding pixel of a local pattern constituted by the pixel and its neighboring pixels, and an addition means for adding the feature calculated for each pixel over the entire image; Is provided.

また、請求項５にかかる発明は、請求項３記載の画像処理装置において、前記特徴量ベクトル計算手段は、前記各解像度の画像の各々の画素について当該画素及びその近傍画素で構成する局所パターンの対応する画素の値を使って特徴を計算する画素特徴計算手段と、前記各画素について計算された特徴を画像全体にわたって加算する加算手段と、を備える。 According to a fifth aspect of the present invention, in the image processing apparatus according to the third aspect, the feature amount vector calculating unit is configured to generate a local pattern composed of the pixel and its neighboring pixels for each pixel of the image of each resolution. Pixel feature calculation means for calculating a feature using the value of the corresponding pixel, and addition means for adding the feature calculated for each pixel over the entire image.

また、請求項６にかかる発明は、請求項３記載の画像処理装置において、前記分類手段は、前記特徴量ベクトル計算手段により計算された前記特徴量ベクトルを、予め計算されている文字画素の特徴量ベクトル及び非文字画素の特徴量ベクトルの線形結合に分解して、前記各ブロックを所定の構成要素に分類する。 According to a sixth aspect of the present invention, in the image processing apparatus according to the third aspect, the classification means uses the feature quantity vector calculated by the feature quantity vector calculation means as the feature of a character pixel calculated in advance. The blocks are classified into predetermined constituent elements by decomposing them into linear combinations of quantity vectors and feature quantity vectors of non-character pixels.

また、請求項７にかかる発明は、請求項１ないし６のいずれか一記載の画像処理装置において、前記正規化処理方法の一つは、前記画像データ中の中間調ドットパターンを、当該中間調ドットパターンの統計量に基づき連続階調に変換する中間調変換処理である。 According to a seventh aspect of the present invention, in the image processing apparatus according to any one of the first to sixth aspects, one of the normalization processing methods is to convert a halftone dot pattern in the image data into the halftone. This is a halftone conversion process for converting to continuous tone based on the statistic of the dot pattern.

また、請求項８にかかる発明は、請求項１ないし６のいずれか一記載の画像処理装置において、前記正規化処理方法の一つは、前記画像データ中の黒文字色と紙面色を、当該画像データの統計量に基づき推定し、推定した黒文字色と紙面色とをもとに階調補正を行う階調補正処理である。 According to an eighth aspect of the present invention, in the image processing apparatus according to any one of the first to sixth aspects, one of the normalization processing methods is configured to convert a black character color and a paper surface color in the image data into the image image. This is a gradation correction process that is estimated based on the statistics of data and performs gradation correction based on the estimated black character color and paper color.

また、請求項９にかかる発明は、請求項１ないし６のいずれか一記載の画像処理装置において、前記正規化処理方法の一つは、前記画像データの傾きを、当該画像データの統計量に基づき推定し、推定した傾きを補正するスキュー補正処理である。 According to a ninth aspect of the present invention, in the image processing apparatus according to any one of the first to sixth aspects, the normalization processing method uses the inclination of the image data as a statistic of the image data. This is a skew correction process that estimates based on this and corrects the estimated inclination.

また、請求項１０にかかる発明は、画像読取手段により読み取られた画像データを理想的な形に変換するための正規化処理を行ない、画像を用紙上に印刷する画像形成装置において、前記画像データの画像特徴量を、レイアウトの概略に基づいて計算する画像特徴量計算手段と、この画像特徴量計算手段により計算された前記画像特徴量を用い、前記画像データの画像タイプを分類識別する画像タイプ識別手段と、画像タイプと画像正規化処理の対応規則を対応付けた情報を記憶する記憶手段と、前記画像タイプ識別手段による分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて、画像正規化処理方法を選択する選択手段と、この選択手段で選択された画像正規化処理方法に基づいて、前記画像データに対して正規化処理を施す正規化手段と、を備える。 According to a tenth aspect of the present invention, in the image forming apparatus for performing normalization processing for converting the image data read by the image reading means into an ideal form and printing the image on a sheet, the image data The image feature quantity calculating means for calculating the image feature quantity of the image based on the outline of the layout, and the image type for classifying the image type of the image data using the image feature quantity calculated by the image feature quantity calculation means Information for associating identification means, storage means for storing information associating correspondence types of image types with image normalization processing, and information for associating classification results and image types with correspondence rules of image normalization processing by the image type identification means Based on the image normalization processing method, and the image normalization processing method selected by the selection means, based on the image data Comprising a normalization means for applying-normalized process, the.

また、請求項１１にかかる発明は、請求項１０記載の画像形成装置において、前記画像特徴量計算手段は、前記画像データを矩形ブロックに排他的に分割するブロック分割手段と、分割された前記各ブロックを、当該画像データを構成する所定の構成要素に分類するブロック分類手段と、前記ブロックの分類結果に基づいて前記画像データの画像特徴量を計算する計算手段と、を備える。 According to an eleventh aspect of the present invention, in the image forming apparatus according to the tenth aspect, the image feature amount calculating means includes a block dividing means for exclusively dividing the image data into rectangular blocks, and each of the divided pieces. Block classification means for classifying the blocks into predetermined constituent elements constituting the image data; and calculation means for calculating an image feature amount of the image data based on the classification result of the blocks.

また、請求項１２にかかる発明は、請求項１１記載の画像形成装置において、前記ブロック分類手段は、前記ブロックから複数の異なる解像度の画像を生成する画像生成手段と、前記各解像度の画像から特徴量ベクトルを計算する特徴量ベクトル計算手段と、前記特徴量ベクトルに基づいて前記各ブロックを所定の構成要素に分類する分類手段と、を備える。 According to a twelfth aspect of the present invention, in the image forming apparatus according to the eleventh aspect, the block classification unit is characterized by an image generation unit that generates a plurality of images having different resolutions from the block, and the image of each resolution. A feature vector calculating unit that calculates a quantity vector; and a classifying unit that classifies the blocks into predetermined components based on the feature vector.

また、請求項１３にかかる発明は、請求項１２記載の画像形成装置において、前記特徴量ベクトル計算手段は、前記各解像度の画像を２値化する２値化手段と、２値画像の各々の画素について当該画素及びその近傍画素で構成する局所パターンの対応する画素の値を使って特徴を計算する画素特徴計算手段と、前記各画素について計算された特徴を画像全体にわたって加算する加算手段と、を備える。 According to a thirteenth aspect of the present invention, in the image forming apparatus according to the twelfth aspect, the feature amount vector calculating unit includes a binarizing unit that binarizes the image of each resolution, and each of the binary images. A pixel feature calculation means for calculating a feature using a value of a corresponding pixel of a local pattern constituted by the pixel and its neighboring pixels, and an addition means for adding the feature calculated for each pixel over the entire image; Is provided.

また、請求項１４にかかる発明は、請求項１２記載の画像形成装置において、前記特徴量ベクトル計算手段は、前記各解像度の画像の各々の画素について当該画素及びその近傍画素で構成する局所パターンの対応する画素の値を使って特徴を計算する画素特徴計算手段と、前記各画素について計算された特徴を画像全体にわたって加算する加算手段と、を備える。 According to a fourteenth aspect of the present invention, in the image forming apparatus according to the twelfth aspect, the feature amount vector calculating unit is configured to generate a local pattern composed of the pixel and its neighboring pixels for each pixel of the image of each resolution. Pixel feature calculation means for calculating a feature using the value of the corresponding pixel, and addition means for adding the feature calculated for each pixel over the entire image.

また、請求項１５にかかる発明は、請求項１２記載の画像形成装置において、前記分類手段は、前記特徴量ベクトル計算手段により計算された前記特徴量ベクトルを、予め計算されている文字画素の特徴量ベクトル及び非文字画素の特徴量ベクトルの線形結合に分解して、前記各ブロックを所定の構成要素に分類する。 According to a fifteenth aspect of the present invention, in the image forming apparatus according to the twelfth aspect, the classification unit uses the feature vector calculated by the feature vector calculation unit as the feature of the character pixel calculated in advance. The blocks are classified into predetermined constituent elements by decomposing them into linear combinations of quantity vectors and feature quantity vectors of non-character pixels.

また、請求項１６にかかる発明は、請求項１０ないし１５のいずれか一記載の画像形成装置において、前記正規化処理方法の一つは、前記画像データ中の中間調ドットパターンを、当該中間調ドットパターンの統計量に基づき連続階調に変換する中間調変換処理である。 According to a sixteenth aspect of the present invention, in the image forming apparatus according to any one of the tenth to fifteenth aspects, one of the normalization processing methods is to convert a halftone dot pattern in the image data into the halftone. This is a halftone conversion process for converting to continuous tone based on the statistic of the dot pattern.

また、請求項１７にかかる発明は、請求項１０ないし１５のいずれか一記載の画像形成装置において、前記正規化処理方法の一つは、前記画像データ中の黒文字色と紙面色を、当該画像データの統計量に基づき推定し、推定した黒文字色と紙面色とをもとに階調補正を行う階調補正処理である。 According to a seventeenth aspect of the present invention, in the image forming apparatus according to any one of the tenth to fifteenth aspects, one of the normalization processing methods is to convert the black character color and the paper surface color in the image data into the image image. This is a gradation correction process that is estimated based on the statistics of data and performs gradation correction based on the estimated black character color and paper color.

また、請求項１８にかかる発明は、請求項１０ないし１５のいずれか一記載の画像形成装置において、前記正規化処理方法の一つは、前記画像データの傾きを、当該画像データの統計量に基づき推定し、推定した傾きを補正するスキュー補正処理である。 According to an eighteenth aspect of the present invention, in the image forming apparatus according to any one of the tenth to fifteenth aspects, one of the normalization processing methods uses the inclination of the image data as a statistic of the image data. This is a skew correction process that estimates based on this and corrects the estimated inclination.

また、請求項１９にかかる発明は、受け付けた画像データを蓄積または送信する際に、前記画像データを理想的な形に変換するための正規化処理をコンピュータに実行させるプログラムであって、前記コンピュータに、前記画像データの画像特徴量を、レイアウトの概略に基づいて計算する画像特徴量計算機能と、この画像特徴量計算機能により計算された前記画像特徴量を用い、前記画像データの画像タイプを分類識別する画像タイプ識別機能と、画像タイプと画像正規化処理の対応規則を対応付けた情報を記憶する記憶機能と、前記画像タイプ識別機能による分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて、画像正規化処理方法を選択する選択機能と、この選択機能で選択された画像正規化処理方法に基づいて、前記画像データに対して正規化処理を施す正規化機能と、を実行させる。 The invention according to claim 19 is a program for causing a computer to execute normalization processing for converting the image data into an ideal form when storing or transmitting the received image data. In addition, the image feature amount calculation function for calculating the image feature amount of the image data based on the outline of the layout, and the image feature amount calculated by the image feature amount calculation function, the image type of the image data is Image type identification function for classifying identification, storage function for storing information in which correspondence rules for image types and image normalization processing are associated, classification results by the image type identification function, and correspondence rules for image types and image normalization processing A selection function for selecting an image normalization processing method based on information associated with the image normalization processing method and an image normalization processing method selected by the selection function. Zui and to execute, and normalized function of applying normalization process on the image data.

また、請求項２０にかかる発明は、受け付けた画像データを蓄積または送信する際に、前記画像データを理想的な形に変換するための正規化処理を実行するコンピュータにおける画像処理方法であって、前記画像データの画像特徴量を、レイアウトの概略に基づいて計算する画像特徴量計算工程と、この画像特徴量計算工程により計算された前記画像特徴量を用い、前記画像データの画像タイプを分類識別する画像タイプ識別工程と、画像タイプと画像正規化処理の対応規則を対応付けた情報を記憶する記憶工程と、前記画像タイプ識別工程による分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて、画像正規化処理方法を選択する選択工程と、この選択工程で選択された画像正規化処理方法に基づいて、前記画像データに対して正規化処理を施す正規化工程と、を含む。 The invention according toclaim 20 is an image processing method in a computer that executes normalization processing for converting the image data into an ideal form when storing or transmitting received image data. An image feature amount calculating step for calculating an image feature amount of the image data based on an outline of the layout, and using the image feature amount calculated by the image feature amount calculating step, classifying and identifying an image type of the image data Corresponding image type identification process to be performed, storage process for storing information that associates correspondence type of image type and image normalization process, classification result by image type identification process and correspondence type of image type and image normalization process A selection step of selecting an image normalization processing method based on the attached information, and the image normalization processing method selected in this selection step, Including the normalized step of applying normalization processing to the image data.

請求項１，１９，２０にかかる発明によれば、レイアウトの概略（文字や写真・絵の大体の空間的配置や分布など）に基づいて計算された画像データの画像特徴量を用いて当該画像データの画像タイプが分類識別された後、分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて画像正規化処理方法が選択され、選択された画像正規化処理方法に基づいて画像データに対して正規化処理が施され、送信または蓄積される。これにより、レイアウトの概略（文字や写真・絵の大体の空間的配置や文字と写真・絵の分布など）に従うことで画像のタイプを特徴付ける画像特徴量を高速に計算することができるとともに、画像データの画像タイプに適した正規化処理を適用して、画像の歪みをもたらすことなく、画像データを理想的な表現に変換した形で送信または蓄積することができるという効果を奏する。 According to the inventions according toclaims 1, 19, and 20, the image is used by using the image feature amount of the image data calculated based on the outline of the layout (generally the spatial arrangement and distribution of characters, pictures and pictures). After the image type of the data is classified and identified, the image normalization processing method is selected based on the classification result and the information that associates the correspondence type of the image type and the image normalization processing, and the selected image normalization processing method Based on this, normalization processing is performed on the image data and transmitted or stored. As a result, image features that characterize the type of image can be calculated at high speed by following the outline of the layout (such as the spatial arrangement of characters, photos, and pictures, and the distribution of characters, photos, and pictures). By applying a normalization process suitable for the image type of the data, there is an effect that the image data can be transmitted or accumulated in a form converted into an ideal expression without causing distortion of the image.

また、請求項２，１１にかかる発明によれば、文字や写真・絵の大体の空間的配置、文字と写真・絵の分布などのレイアウトの概略をブロック単位で取得することができるので、文書画像データの画像特徴量を簡潔に計算することができるという効果を奏する。 Further, according to the second and eleventh aspects of the present invention, it is possible to obtain an outline of the layout of characters, photographs / pictures in general, the layout of characters and photographs / pictures, etc., in units of blocks. There is an effect that the image feature amount of the image data can be simply calculated.

また、請求項３，１２にかかる発明によれば、画像の粗い特徴と細かい特徴を表す特徴を効率的に抽出することができるという効果を奏する。 Further, according to the third and twelfth aspects of the present invention, there is an effect that it is possible to efficiently extract a rough feature and a feature representing a fine feature of an image.

また、請求項４，１３にかかる発明によれば、画像データにおける黒画素と白画素の局所的配置を表す表現力の高い統計的情報を効率的に計算することができるという効果を奏する。 Further, according to the fourth and thirteenth aspects, there is an effect that it is possible to efficiently calculate statistical information having high expressive power representing the local arrangement of black pixels and white pixels in image data.

また、請求項５，１４にかかる発明によれば、画像データにおける黒画素と白画素の局所的配置を表す表現力の高い統計的情報を効率的に計算することができるという効果を奏する。 In addition, according to the fifth and fourteenth aspects, there is an effect that it is possible to efficiently calculate statistical information having high expressive power representing the local arrangement of black pixels and white pixels in image data.

また、請求項６，１５にかかる発明によれば、文字や絵（非文字）の分布に応じた文書画像データの分類線形演算により簡単に行うことができるという効果を奏する。 Further, according to the sixth and fifteenth inventions, there is an effect that it can be easily performed by classification linear calculation of document image data in accordance with the distribution of characters and pictures (non-characters).

また、請求項７，１６にかかる発明によれば、ドットパターンの持つ周波数特性を持つ印刷物を、光学的にスキャンし、閲覧あるいは紙出力する場合、読み取り装置／閲覧装置／出力装置の持つそれぞれの周波数特性と干渉して生じるモアレ現象を防止することができるという効果を奏する。 According to the inventions according toclaims 7 and 16, when the printed matter having the frequency characteristics of the dot pattern is optically scanned and browsed or outputted, each of the reading device / browsing device / output device has There is an effect that a moire phenomenon caused by interference with the frequency characteristic can be prevented.

また、請求項８，１７にかかる発明によれば、文字と地肌のコントラストが貧弱であったりするために、文字の視認性が低下するといった、階調性の問題を防止することができるという効果を奏する。 In addition, according to the inventions according toclaims 8 and 17, it is possible to prevent a gradation problem such as a decrease in the visibility of characters due to poor contrast between the characters and the background. Play.

また、請求項９，１８にかかる発明によれば、ページ枠に平行であるべき文字行が傾いて再現されるために生じる、読み手への不快感や、レイアウト解析処理への弊害を防止することができる。 In addition, according to the inventions according to claims 9 and 18, it is possible to prevent discomfort to the reader and adverse effects on the layout analysis processing, which are caused when the character lines that should be parallel to the page frame are reproduced with an inclination. Can do.

また、請求項１０にかかる発明によれば、レイアウトの概略（文字や写真・絵の大体の空間的配置や分布など）に基づいて計算された画像データの画像特徴量を用いて当該画像データの画像タイプが分類識別された後、分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて画像正規化処理方法が選択され、選択された画像正規化処理方法に基づいて画像データに対して正規化処理が施され、正規化処理が施された画像が用紙上に印刷される。これにより、レイアウトの概略（文字や写真・絵の大体の空間的配置や文字と写真・絵の分布など）に従うことで画像のタイプを特徴付ける画像特徴量を高速に計算することができるとともに、画像データの画像タイプに適した正規化処理を適用して、画像の歪みをもたらすことなく、画像データを理想的な表現に変換した形で画像を用紙上に印刷することができるという効果を奏する。 According to the invention ofclaim 10, the image feature of the image data is calculated using the image feature amount of the image data calculated based on the outline of the layout (generally the spatial arrangement and distribution of characters, pictures and pictures). After the image type is classified and identified, an image normalization processing method is selected based on the classification result and information that associates the correspondence type of the image type with the image normalization processing, and based on the selected image normalization processing method Normalization processing is performed on the image data, and the image subjected to the normalization processing is printed on paper. As a result, image features that characterize the type of image can be calculated at high speed by following the outline of the layout (such as the spatial arrangement of characters, photos, and pictures, and the distribution of characters, photos, and pictures). By applying a normalization process suitable for the image type of data, the image can be printed on paper in a form in which the image data is converted into an ideal expression without causing distortion of the image.

［第１の実施の形態］
本発明の第１の実施の形態を図１ないし図１４に基づいて説明する。[First Embodiment]
A first embodiment of the present invention will be described with reference to FIGS.

図１は、本発明の第１の実施の形態にかかる画像処理装置１の電気的な接続を示すブロック図である。図１に示すように、画像処理装置１は、ＰＣ（Personal Computer）などのコンピュータであり、画像処理装置１の各部を集中的に制御するＣＰＵ（Central Processing Unit）２、情報を格納するＲＯＭ（Read Only Memory）３及びＲＡＭ（Random Access Memory）４等の一次記憶装置５、データファイル（例えば、カラービットマップ画像データ）を記憶する記憶部であるＨＤＤ（Hard Disk Drive）６等の二次記憶装置７、情報を保管したり外部に情報を配布したり外部から情報を入手するためのＣＤ−ＲＯＭドライブ等のリムーバブルディスク装置８、ネットワーク９を介して外部の他のコンピュータと通信により情報を伝達するためのネットワークインターフェース１０、処理経過や結果等を操作者に表示するＣＲＴ（Cathode Ray Tube）やＬＣＤ（Liquid Crystal Display）等の表示装置１１、並びに操作者がＣＰＵ２に命令や情報等を入力するためのキーボード１２、マウス等のポインティングデバイス１３等から構成されており、これらの各部間で送受信されるデータをバスコントローラ１４が調停して動作する。 FIG. 1 is a block diagram showing electrical connections of theimage processing apparatus 1 according to the first embodiment of the present invention. As shown in FIG. 1, animage processing apparatus 1 is a computer such as a PC (Personal Computer), and includes a CPU (Central Processing Unit) 2 that centrally controls each unit of theimage processing apparatus 1 and a ROM ( Secondary storage such asprimary storage device 5 such as Read Only Memory (RAM) 3 and RAM (Random Access Memory) 4 and HDD (Hard Disk Drive) 6 that is a storage unit for storing data files (for example, color bitmap image data). Information is transmitted by communication with other external computers via thedevice 7, aremovable disk device 8 such as a CD-ROM drive for storing information, distributing information to the outside, and obtaining information from the outside, and a network 9.Network interface 10, CRT (Cathode Ray Tube), LCD (Liquid Crystal Display), etc. for displaying the process progress and results to the operator It comprises adisplay device 11 and akeyboard 12 for an operator to input commands and information to theCPU 2, apointing device 13 such as a mouse, etc., and the bus controller 14 arbitrates data transmitted and received between these components. Works.

なお、本実施の形態においては、画像処理装置１として一般的なパーソナルコンピュータを適用して説明しているが、これに限るものではなく、ＰＤＡ（Personal Digital Assistants）と称される携帯用情報端末、palmTopＰＣ、携帯電話、ＰＨＳ（Personal Handyphone System）等であっても良い。 In the present embodiment, a general personal computer is applied as theimage processing apparatus 1. However, the present invention is not limited to this, and a portable information terminal called PDA (Personal Digital Assistants). , PalmTopPC, mobile phone, PHS (Personal Handyphone System), etc.

このような画像処理装置１では、ユーザが電源を投入するとＣＰＵ２がＲＯＭ３内のローダーというプログラムを起動させ、ＨＤＤ６よりオペレーティングシステムというコンピュータのハードウェアとソフトウェアとを管理するプログラムをＲＡＭ７に読み込み、このオペレーティングシステムを起動させる。このようなオペレーティングシステムは、ユーザの操作に応じてプログラムを起動したり、情報を読み込んだり、保存を行ったりする。オペレーティングシステムのうち代表的なものとしては、Ｗｉｎｄｏｗｓ（登録商標）、ＵＮＩＸ（登録商標）等が知られている。これらのオペレーティングシステム上で走る動作プログラムをアプリケーションプログラムと呼んでいる。 In such animage processing apparatus 1, when the user turns on the power, theCPU 2 activates a program called a loader in theROM 3, loads a program for managing the computer hardware and software called the operating system from theHDD 6 into theRAM 7, and Start the system. Such an operating system starts a program, reads information, and performs storage according to a user operation. As typical operating systems, Windows (registered trademark), UNIX (registered trademark), and the like are known. An operation program running on these operating systems is called an application program.

ここで、画像処理装置１は、アプリケーションプログラムとして、画像処理プログラムをＨＤＤ６に記憶している。この意味で、ＨＤＤ６は、画像処理プログラムを記憶する記憶媒体として機能する。 Here, theimage processing apparatus 1 stores an image processing program in theHDD 6 as an application program. In this sense, theHDD 6 functions as a storage medium that stores the image processing program.

また、一般的には、画像処理装置１のＨＤＤ６等の二次記憶装置７にインストールされるアプリケーションプログラムは、ＣＤ−ＲＯＭやＤＶＤ−ＲＯＭ等の光情報記録メディアやＦＤ等の磁気メディア等の記憶媒体８ａに記録され、この記憶媒体８ａに記録されたアプリケーションプログラムがＨＤＤ６等の二次記憶装置７にインストールされる。このため、ＣＤ−ＲＯＭ等の光情報記録メディアやＦＤ等の磁気メディア等の可搬性を有する記憶媒体８ａも、画像処理プログラムを記憶する記憶媒体となり得る。さらには、画像処理プログラムを、インターネット等のネットワークに接続されたコンピュータ上に格納し、例えばネットワークインターフェース１０を介して外部からダウンロードさせることにより、ＨＤＤ６等の二次記憶装置７にインストールするように構成しても良い。また、本実施の形態の画像処理装置１で実行される画像処理プログラムをインターネット等のネットワーク経由で提供または配布するように構成しても良い。 In general, the application program installed in thesecondary storage device 7 such as theHDD 6 of theimage processing apparatus 1 is stored in an optical information recording medium such as a CD-ROM or DVD-ROM, or a magnetic medium such as an FD. The application program recorded on the medium 8 a and recorded on thestorage medium 8 a is installed in thesecondary storage device 7 such as theHDD 6. Therefore, theportable storage medium 8a such as an optical information recording medium such as a CD-ROM or a magnetic medium such as an FD can also be a storage medium for storing an image processing program. Further, the image processing program is stored on a computer connected to a network such as the Internet, and is installed in thesecondary storage device 7 such as theHDD 6 by being downloaded from the outside via thenetwork interface 10, for example. You may do it. The image processing program executed by theimage processing apparatus 1 according to the present embodiment may be provided or distributed via a network such as the Internet.

画像処理装置１は、オペレーティングシステム上で動作する画像処理プログラムが起動すると、この画像処理プログラムに従い、ＣＰＵ２が各種の演算処理を実行して各部を集中的に制御する。画像処理装置１のＣＰＵ２が実行する各種の演算処理のうち、本実施の形態の特長的な処理である画像データの蓄積／送信の際の正規化処理について以下に説明する。ここで、正規化処理とは、画像処理装置１に接続された外部機器（例えば、スキャナやデジタルカメラなど）やネットワーク９を通じて受け付けたデジタル画像データを理想的な形に変換するための処理である。 In theimage processing apparatus 1, when an image processing program that operates on an operating system is started, theCPU 2 executes various arithmetic processes according to the image processing program and centrally controls each unit. Of various types of arithmetic processing executed by theCPU 2 of theimage processing apparatus 1, normalization processing at the time of image data accumulation / transmission, which is a characteristic processing of the present embodiment, will be described below. Here, the normalization process is a process for converting digital image data received through an external device (for example, a scanner or a digital camera) connected to theimage processing apparatus 1 or the network 9 into an ideal form. .

なお、リアルタイム性が重要視される場合には、処理を高速化する必要がある。そのためには、論理回路（図示せず）を別途設け、論理回路の動作により各種の演算処理を実行するようにするのが望ましい。 In addition, when real-time property is regarded as important, it is necessary to speed up the processing. For this purpose, it is desirable to separately provide a logic circuit (not shown) and execute various arithmetic processes by the operation of the logic circuit.

ここで、画像処理装置１のＣＰＵ２が実行する画像データの蓄積／送信の際の正規化処理について説明する。図２は画像処理装置１のＣＰＵ２が実行する画像データの蓄積／送信の際の正規化処理にかかる機能を示す機能ブロック図、図３はその流れを概略的に示すフローチャートである。図２に示すように、画像処理装置１は、画像入力処理部２１と、画像特徴量計算部２２と、画像タイプ識別部２３と、画像正規化処理方法の選択部２４と、画像正規化処理部２５と、記憶部２６と、画像蓄積／送信処理部２７とを備えている。以下において、各構成部の動作と作用を詳述する。 Here, normalization processing at the time of accumulation / transmission of image data executed by theCPU 2 of theimage processing apparatus 1 will be described. FIG. 2 is a functional block diagram showing functions related to normalization processing at the time of image data storage / transmission executed by theCPU 2 of theimage processing apparatus 1, and FIG. 3 is a flowchart schematically showing the flow thereof. As illustrated in FIG. 2, theimage processing apparatus 1 includes an imageinput processing unit 21, an image feature amount calculation unit 22, an imagetype identification unit 23, an image normalization processingmethod selection unit 24, and an image normalization process.Unit 25,storage unit 26, and image accumulation /transmission processing unit 27. Hereinafter, the operation and action of each component will be described in detail.

画像入力処理部２１は、画像処理装置１に接続された外部機器（例えば、スキャナやデジタルカメラなど）やネットワーク９を通じて、デジタル画像データの入力を受け付ける。ここで、デジタル画像データは、例えば印刷物を光学的にスキャンしてデジタル変換したものである。 The imageinput processing unit 21 receives input of digital image data through an external device (for example, a scanner or a digital camera) connected to theimage processing apparatus 1 or the network 9. Here, the digital image data is, for example, digitally converted by optically scanning a printed material.

画像特徴量計算部２２は、画像特徴量計算手段として機能するものであって、画像全体の特徴量を出力するものである。図４は、画像特徴量計算部２２における画像特徴量計算処理の流れを概略的に示すフローチャートである。図４に示すように、まず、入力した画像を同じ大きさの矩形ブロックに排他的に分割し（ステップＳ１：ブロック分割手段）、各ブロックを、“絵”“文字”“他”の３種類のいずれかに分類する（ステップＳ２：ブロック分類手段）。次に、すべてのブロックの分類結果をもとに画像全体の画像特徴量を計算する（ステップＳ３：計算手段）。最後に、画像全体の画像特徴量を出力する（ステップＳ４）。以下において、各ステップの動作を説明する。 The image feature quantity calculation unit 22 functions as an image feature quantity calculation unit, and outputs the feature quantity of the entire image. FIG. 4 is a flowchart schematically showing the flow of the image feature quantity calculation processing in the image feature quantity calculator 22. As shown in FIG. 4, first, the input image is exclusively divided into rectangular blocks of the same size (step S1: block dividing means), and each block is divided into three types: “picture”, “character”, and “other”. (Step S2: block classification means). Next, the image feature amount of the entire image is calculated based on the classification result of all the blocks (step S3: calculation means). Finally, the image feature amount of the entire image is output (step S4). Hereinafter, the operation of each step will be described.

（１）ブロック分割（ステップＳ１）
入力画像を同じサイズのブロック、たとえば、１ｃｍ×１ｃｍ（解像度が２００ｄｐｉであれば８０画素×８０画素、解像度が３００ｄｐｉであれば１２０画素×高さ１２０画素）の矩形に分割する。(1) Block division (step S1)
The input image is divided into blocks of the same size, for example, 1 cm × 1 cm (80 pixels × 80 pixels if the resolution is 200 dpi, 120 pixels × 120 pixels if the resolution is 300 dpi).

（２）ブロックの分類（ステップＳ２）
各ブロックを、“絵”“文字”“他”の３種類のいずれかに分類する。この処理のフローを図５に示し、以下において詳述する。(2) Block classification (step S2)
Each block is classified into one of three types of “picture”, “character”, and “other”. The flow of this process is shown in FIG. 5 and will be described in detail below.

図５に示すように、まず、処理対象となるブロック画像を１００ｄｐｉ程度の低解像度に縮小した画像Ｉを生成するとともに（ステップＳ１１：画像生成手段）、解像度のレベル数Ｌを設定し（ステップＳ１２）、解像度縮小レベルｋを初期化（ｋ←０）する（ステップＳ１３）。このようなステップＳ１１〜Ｓ１３の処理を行うのは、図６に示すように、画像Ｉとともに、さらに低解像度化した画像からも特徴を抽出するためである。詳細は後述するが、例えば、解像度レベル数Ｌを２にした場合には、画像Ｉと、解像度が１／２の画像Ｉ₁と、解像度が１／４の画像の画像Ｉ₂との計３つの画像から特徴を抽出する。As shown in FIG. 5, first, an image I obtained by reducing a block image to be processed to a low resolution of about 100 dpi is generated (step S11: image generation means), and a resolution level number L is set (step S12). ), The resolution reduction level k is initialized (k ← 0) (step S13). The reason why the processes in steps S11 to S13 are performed is to extract features from an image with a further reduced resolution as well as an image I as shown in FIG. Although details will be described later, for example, when the resolutionlevel number L 2, the image I, the images I₁ resolution 1/2, the resolution is the image I₂ 1/4image meter 3 Extract features from two images.

解像度縮小レベルｋが解像度レベル数Ｌに達していない場合には（ステップＳ１４のＹｅｓ）、ステップＳ１１で生成した画像Ｉから解像度を１／２^kに縮小した画像Ｉ_k（ｋ＝０，・・・，Ｌ）を生成し（ステップＳ１５）、画像Ｉ_kを２値化する（ステップＳ１６：２値化手段）。ただし、２値画像において、黒画素は値１、白画素は値０をとるとする。If the resolution reduction level k has not reached the resolution level number L (Yes in step S14), the image I_k (k = 0,...) Obtained by reducing the resolution to 1/2^k from the image I generated in step S11. ., L) is generated (step S15), and the image I_k is binarized (step S16: binarization means). However, in a binary image, a black pixel has avalue 1 and a white pixel has a value 0.

次いで、２値化した解像度が１／２^kの画像Ｉ_kから、Ｍ次元の特徴量ベクトルｆ_kを計算した後（ステップＳ１７）、解像度縮小レベルｋを“１”だけインクリメント（ｋ←ｋ＋１）する（ステップＳ１８）。Then, from the image I_k ofbinarized resolution 1/2^k, after calculating the feature vectors f_k M-dimensional (step S17), the resolution reduction level k by "1" is incremented (k ← k + 1) (Step S18).

ここで、画像Ｉ_k（ｋ＝０，・・・，Ｌ）を２値化した画像から特徴を抽出する方法を述べる。自己相関関数を高次（Ｎ次）へと拡張した「高次自己相関関数（Ｎ次自己相関関数）」は、画面内の対象画像をＩ（ｒ）とすると、変位方向（Ｓ₁，Ｓ₂，…，Ｓ_N）に対して、

で定義される。ただし、和Σは画像全体の画素rについての加算である。従って、高次自己相関関数は、次数や変位方向（Ｓ₁，Ｓ₂，…，Ｓ_N）の取り方により、無数に考えられる。ここでは、簡単のため高次自己相関係数の次数Ｎを“２”までとする。また、変位方向を参照画素ｒの周りの局所的な３×３画素の領域に限定する。平行移動により等価な特徴を除くと、２値画像に対して、図７に示すように特徴の数は全部で２５個になる。各特徴の計算は、局所パターンの対応する画素の値の積を全画像に対して足し合わせればよい。例えば、「Ｎｏ．３」の局所パターンに対応する特徴は、参照画素ｒでの濃淡値とそのすぐ右隣の点での濃淡値との全画像に対する積和を取ることによって計算される。このようにして、解像度が１／２^kの画像から、Ｍ＝２５次元の特徴量ベクトルｆ_k＝（ｇ（ｋ，１），・・・，ｇ（ｋ，２５））が計算される。ここに、画素特徴計算手段の機能および加算手段の機能が実行される。Here, a method for extracting features from an image obtained by binarizing the image I_k (k = 0,..., L) will be described. The “higher order autocorrelation function (Nth order autocorrelation function)”, which is an extension of the autocorrelation function to the higher order (Nth order), indicates that the displacement direction (S₁ , S₂ , ..., S_N )

Defined by However, the sum Σ is addition for the pixel r of the entire image. Therefore, an infinite number of high-order autocorrelation functions can be considered depending on the order and the direction of displacement (S₁ , S₂ ,..., S_N ). Here, for simplicity, the order N of the higher-order autocorrelation coefficient is set to “2”. Further, the displacement direction is limited to a local 3 × 3 pixel region around the reference pixel r. Excluding equivalent features by translation, the total number of features is 25 for a binary image as shown in FIG. For the calculation of each feature, the product of the corresponding pixel values of the local pattern may be added to the entire image. For example, the feature corresponding to the local pattern of “No. 3” is calculated by taking the sum of products for the entire image of the gray value at the reference pixel r and the gray value at the point immediately adjacent to the reference pixel r. In this way, M = 25-dimensional feature vector f_k = (g (k, 1),..., G (k, 25)) is calculated from an image having a resolution of 1/2^k . Here, the function of the pixel feature calculation means and the function of the addition means are executed.

上述したようなステップＳ１５〜Ｓ１８の処理（特徴量ベクトル計算手段）は、ステップＳ１８でインクリメントされた解像度縮小レベルｋが解像度レベル数Ｌを超える迄（ステップＳ１４のＮｏ）、繰り返される。 The processes in steps S15 to S18 (feature vector calculation means) as described above are repeated until the resolution reduction level k incremented in step S18 exceeds the number L of resolution levels (No in step S14).

ステップＳ１８でインクリメントされた解像度縮小レベルｋが解像度レベル数Ｌを超えた場合には（ステップＳ１４のＮｏ）、特徴量ベクトルｆ₀，・・・，ｆ_Lをもとにして、ブロックを、“絵”“文字”“他”の３種類のいずれかに分類する（ステップＳ１９：分類手段）。If incremented resolution reduction level k has exceeded the number of resolution levels L in step S18 (No in step S14), and feature vectors f_0, · · ·, based on f_L, the block, " Classification is made into one of three types of picture, “character” and “other” (step S19: classification means).

ここで、ブロックの分類の方法について詳述する。まず、前述したＭ＝２５次元の特徴量ベクトルｆ_k＝（ｇ（ｋ，１），・・・，ｇ（ｋ，２５））（ｋ＝０，・・・，Ｌ）から（２５×Ｌ）次元の特徴量ベクトルｘ＝（ｇ（０，１），・・・，ｇ（０，２５），・・・，ｇ（Ｌ，１），・・・，ｇ（Ｌ，２５））を生成する。このようなブロックの特徴量ベクトルｘを用いて分類を行うためには、前もって学習を行うことが必要である。そこで、本実施の形態においては、学習用データを文字だけ含むようなものと文字を含まないようなものの２種類に分けて特徴量ベクトルｘを計算する。その後、それぞれの平均をとることによって、文字画素の特徴量ベクトルｐ₀と非文字画素の特徴量ベクトルｐ₁を前もって計算しておく。そして、分類しようとしているブロック画像から得られた特徴量ベクトルｘを、既知の特徴量ベクトルｐ₀とｐ₁の線形結合に分解すれば、その結合係数ａ₀，ａ₁が文字画素と非文字画素の比率、あるいは、ブロックの「文字らしさ」と「非文字らしさ」を表すことになる。このような分解が可能であるのは、高次局所自己相関に基づく特徴が画面内の対象の位置に不変で、しかも、対象の数に関して加法性を持つことによる。特徴量ベクトルｘの分解を、
ｘ＝ａ₀・ｐ₀＋ａ₀・ｐ₁＝Ｆ^Tａ＋ｅ
とする。ここで、ｅは誤差ベクトル、Ｆ＝［ｐ₀，ｐ₁］^T、ａ＝（ａ₀，ａ₁）^Tである。最小二乗法により、最適な結合係数ベクトルａは、
ａ＝（ＦＦ^T）^-1・Ｆｘ
で与えられる。各ブロックについて、「非文字らしさ」を表すパラメータａ₁について閾値処理することにより、そのブロックを「絵」、「絵でない」、「未定」に分類する。各ブロックについて、「未定」または「絵でない」に分類されていて、文字らしさを表すパラメータａ₀が閾値以上であれば「文字」に、そうでなければ「その他」に分類する。図８にブロック分類の例を示す。図８の例においては、黒部分は「文字」、グレイ部分は「絵」、白部分は「他」を表わしている。Here, the block classification method will be described in detail. First, from the aforementioned M = 25-dimensional feature vector f_k = (g (k, 1),..., G (k, 25)) (k = 0,..., L) to (25 × L ) Dimension feature vector x = (g (0,1),..., G (0,25),..., G (L, 1),. Generate. In order to perform classification using such a block feature quantity vector x, it is necessary to perform learning in advance. Therefore, in the present embodiment, the feature amount vector x is calculated by dividing the learning data into two types, one containing only characters and one not containing characters. Thereafter, the feature quantity vector p₀ of the character pixel and the feature quantity vector p₁ of the non-character pixel are calculated in advance by taking the respective averages. Then, if the feature vector x obtained from the block image to be classified is decomposed into a linear combination of the known feature vectors p₀ and p₁ , the coupling coefficients a₀ and a₁ become character pixels and non-characters. It represents the ratio of pixels or the “characteristic” and “non-characteristic” of the block. Such decomposition is possible because the feature based on the higher-order local autocorrelation is invariant to the position of the object in the screen, and is additive with respect to the number of objects. Decompose feature vector x
x = a₀ · p₀ + a₀ · p₁ = F^T a + e
And Here, e is an error vector, F = [p₀ , p₁ ]^T , and a = (a₀ , a₁ )^T. By the least square method, the optimal coupling coefficient vector a is
a = (FF^T )⁻¹ · Fx
Given in. Each block is classified into “picture”, “not a picture”, and “undecided” by performing threshold processing on the parameter a₁ representing “non-characteristic”. Each block is classified as “undecided” or “not a picture”, and is classified as “character” if the parameter a₀ representing the character character is greater than or equal to a threshold value, and “other” otherwise. FIG. 8 shows an example of block classification. In the example of FIG. 8, the black portion represents “character”, the gray portion represents “picture”, and the white portion represents “other”.

（３）画像特徴量の計算（ステップＳ３）
ブロックの分類結果をもとにして、画像のタイプ分けのための画像特徴量を計算する。特に、
・文字、絵の割合
・密集率：レイアウトの混み方（狭いところに詰め込まれている度合い）
・文字、絵の散乱度：文字や写真が紙面全体に散らばって分布している度合い
を計算する。具体的には、次の５つの画像特徴量を計算する。
・文字の割合Ｒｔ∈［０，１］：全ブロックの中で「文字」に分類されたブロックの割合
・非文字の割合Ｒｐ∈［０，１］：全ブロックの中で「絵」に分類されたブロックの割合
・レイアウト密度Ｄ∈［０，１］：「文字」と「絵」のブロック数の面積の和を、描画領域の面積で割ったもの
・文字散乱度Ｓｔ（＞０）：文字ブロックのｘ，ｙ方向の空間的分布について、分散・共分散行列の行列式を、画像の面積で正規化したもの
・非文字散乱度Ｓｐ（＞０）：絵ブロックのｘ，ｙ方向の空間的分布について、分散・共分散行列の行列式を、画像の面積で正規化したもの
表１は、図８の例についての画像特徴量の計算結果を示すものである。

(3) Image feature amount calculation (step S3)
Based on the block classification result, an image feature amount for image type classification is calculated. In particular,
・ Percentage of characters and pictures ・ Denseness: how to lay out the layout (how much is packed in a narrow space)
-Scattering degree of characters and pictures: The degree to which characters and pictures are scattered and distributed throughout the paper is calculated. Specifically, the following five image feature amounts are calculated.
-Character ratio Rt ∈ [0, 1]: Ratio of blocks classified as "character" in all blocks-Non-character ratio Rp ∈ [0, 1]: Classification as "pictures" in all blocks Ratio of blocks formed: Layout density Dε [0, 1]: the sum of the area of the number of blocks of “character” and “picture” divided by the area of the drawing area • Character scattering degree St (> 0): For the spatial distribution of character blocks in the x and y directions, the determinant of the variance / covariance matrix normalized by the area of the image. Non-character scattering degree Sp (> 0): in the x and y directions of the picture block Determining the dispersion / covariance matrix with respect to the spatial distribution normalized by the area of the image Table 1 shows the calculation result of the image feature amount for the example of FIG.

次に、画像タイプ識別部２３について説明する。画像タイプ識別部２３は、画像タイプ識別手段として機能するものであって、画像特徴量計算部２２で計算した画像特徴量を用い、画像のタイプを分類識別する。本実施の形態においては、画像特徴量計算部２２で計算した特徴量を用いることにより、「スキュー補正、コントラスト強調処理をかけてはいけない」画像のタイプについて、例えば線形判別関数により簡単に表現するものとする。
・絵が主体で、文字が殆どない画像のタイプ：すなわち、Ｒｐについて単調増加し、Ｒｔについて単調減少するような判別関数
Ｒｐ−ａ₀・Ｒｔ−ａ₁＞０（ａ₀＞１）
を満たす画像のタイプである。より具体的には、大きな写真や絵が張り付いているもの、あるいは、小さい写真が多数張り付いているものがこのタイプに分類される。
・文字が少なく、ページ全体に散らばっているような画像のタイプ：Ｒｔについて単調減少し、Ｓｔについて単調増加するような判別関数
Ｓｔ−ｃ₀・Ｒｔ−ｃ₁＞０（ｃ₀＞０）
を満たす画像のタイプである。より具体的には、写真や絵が占める割合がそれほど多くなくても、文字が写真の絵の説明に添えられているようなものがこのタイプに分類される。表２は、図８の例についてのタイプ識別例を示すものである。

Next, the imagetype identification unit 23 will be described. The imagetype identification unit 23 functions as an image type identification unit, and classifies and identifies image types using the image feature amount calculated by the image feature amount calculation unit 22. In the present embodiment, by using the feature amount calculated by the image feature amount calculation unit 22, the type of an image that should not be subjected to skew correction and contrast enhancement processing is simply expressed by a linear discriminant function, for example. Shall.
Discrimination function Rp-a₀ · Rt-a₁ > 0 (a₀ > 1) which is monotonically increasing with respect to Rp and monotonically decreasing with respect to Rt.
The type of image that satisfies More specifically, a large picture or picture is attached to this type, or a large number of small pictures are attached to this type.
A type of image that has few characters and is scattered throughout the page: a discriminant function that monotonously decreases with respect to Rt and monotonously increases with respect to St. St-c₀ .Rt-c₁ > 0 (c₀ > 0)
The type of image that satisfies More specifically, even if the proportion of photographs and pictures is not so large, those in which characters are attached to the picture description of the photograph are classified into this type. Table 2 shows an example of type identification for the example of FIG.

次に、画像正規化処理方法の選択部２４について説明する。画像正規化処理方法の選択部２４は、選択手段として機能するものであって、画像タイプ識別部２３における画像のタイプ分類の結果に基づいて、画像データの蓄積／送信の際における画像正規化の方法を選択する。例えば、図９に示すような画像タイプと画像正規化処理の対応規則を記憶手段である記憶部２６に保持しておき、この画像タイプと画像正規化処理の対応規則に従って画像正規化の方法を選択するようにすれば良い。 Next, theselection unit 24 of the image normalization processing method will be described. Theselection unit 24 of the image normalization processing method functions as a selection unit. Based on the result of the image type classification in the imagetype identification unit 23, the image normalization processing method is performed. Select a method. For example, the correspondence rule between the image type and the image normalization process as shown in FIG. 9 is held in thestorage unit 26 as storage means, and the image normalization method is performed according to the correspondence rule for the image type and the image normalization process. You may make it choose.

具体的には、図９に示すような対応規則においては、「絵が主体で、文字が殆どない」デジタル画像データ（図８の（ｄ）が該当）では、文字と地肌間のコントラスト強調のための階調変換は意味を持たず、また、前述したようにスキュー補正も十分な数の文字がないためにうまく働かないことが多い。したがって、中間調ドットパターンを連続階調に変換する処理だけを施す。 Specifically, in the correspondence rule as shown in FIG. 9, in the case of digital image data (corresponding to (d) in FIG. 8) of “the picture is mainly and there are almost no characters”, the contrast enhancement between the characters and the background is enhanced. For this reason, the tone conversion has no meaning, and as described above, the skew correction often does not work well because there is not a sufficient number of characters. Therefore, only the process of converting the halftone dot pattern into a continuous tone is performed.

「文字が少なく、ページ全体に散らばっている」デジタル画像データ（図８の（ａ）（ｃ）（ｄ）が該当）では、統計的に十分な数の文字がないためにスキュー補正がうまく働く保障がない。そこで、中間調ドットパターンを連続階調に変換する処理と、文字と地肌間のコントラスト強調のための階調変換だけを施す。 In digital image data (applicable to (a), (c), and (d) in FIG. 8), “skew correction works well because there is not a statistically sufficient number of characters”. There is no guarantee. Therefore, only the process of converting the halftone dot pattern into a continuous tone and the tone conversion for enhancing the contrast between the character and the background are performed.

それ以外の場合（図８の（ｂ）（ｅ）（ｆ）が該当）には、十分な数の文字が画像に存在するので、中間調ドットパターンを連続階調に変換する処理と、文字と地肌間のコントラスト強調のための階調変換処理とに加えて、スキュー補正も施す。 In other cases (corresponding to (b), (e), and (f) in FIG. 8), a sufficient number of characters are present in the image. In addition to the tone conversion processing for enhancing contrast between the background and the background, skew correction is also performed.

このようにして選択された画像正規化処理方法にしたがってパラメータが変更される。なお、複数の画像正規化処理方法が選択されるような場合には、例えば画像タイプに優先順位を付しておき、優先順位が高い画像タイプについての画像正規化処理方法を優先する。 The parameters are changed according to the image normalization processing method selected in this way. When a plurality of image normalization processing methods are selected, for example, priorities are assigned to image types, and image normalization processing methods for image types with higher priorities are prioritized.

画像正規化処理部２５は、正規化手段として機能するものであって、画像正規化処理方法の選択部２４で選択された画像正規化処理方法に基づいて、デジタル画像データに対して画像正規化処理を施す。 The imagenormalization processing unit 25 functions as a normalization unit, and performs image normalization on the digital image data based on the image normalization processing method selected by the image normalization processingmethod selection unit 24. Apply processing.

ここで、画像処理装置１のＣＰＵ２が実行する各種の画像正規化処理について簡単に説明する。 Here, various image normalization processes executed by theCPU 2 of theimage processing apparatus 1 will be briefly described.

（１）中間調変換処理
まず、中間調ドットパターンを連続階調に変換する中間調変換処理について説明する。図１０は、中間調変換処理の流れを示すフローチャートである。図１０に示すように、デジタル画像データである原画像Ｉ₀を入力あるいは受信していることが前提となっている（ステップＳ５０１）。このような前提の下、まず、原画像Ｉ₀から低解像度の縮小画像Ｉを生成する（ステップＳ５０２）。次いで、低解像度画像Ｉへの処理として、生成した低解像度の縮小画像Ｉ上で、局所的特徴（エッジ、色信号の局所的統計量など）に基づいて前景画像Ｆを抽出する（ステップＳ５０３）。前景画像Ｆ以外の画素が背景画像Ｂとなる。そして、背景画像Ｂを連続階調表現に変換し、連続階調変換された背景画像Ｊとする（ステップＳ５０４）。連続階調とは、ディザ等の擬似階調とは異なり、個々の画素に画素値（カラーの場合はＲ、Ｇ、Ｂそれぞれの輝度）を付与して階調を表現したものである。(1) Halftone Conversion Processing First, halftone conversion processing for converting a halftone dot pattern to continuous tone will be described. FIG. 10 is a flowchart showing the flow of halftone conversion processing. As shown in FIG. 10, it is assumed that input or receives an original image I₀ is a digital image data (step S501). Under such a premise, first, a low-resolution reduced image I is generated from the original image I₀ (step S502). Next, as processing for the low-resolution image I, the foreground image F is extracted on the generated low-resolution reduced image I based on local features (edges, local statistics of color signals, etc.) (step S503). . Pixels other than the foreground image F become the background image B. Then, the background image B is converted into a continuous tone expression to obtain a background image J subjected to continuous tone conversion (step S504). A continuous tone is different from a pseudo tone such as dither, and expresses a tone by assigning pixel values to individual pixels (in the case of color, the respective luminances of R, G, and B).

次に、原解像度画像Ｉ₀への処理として、低解像度の縮小画像Ｉ中から前景として抽出された前景画像Ｆについて、受信したままの原画像Ｉ₀での処理を行う。縮小画像Ｉ中から前景として抽出された前景画像Ｆ内で、局所的特徴（エッジ、色信号の局所的統計量など）をもとに前景画像ＦＦを抽出し、それ以外の画素を背景画像ＢＦとする（ステップＳ５０５）。そして、その背景画像ＢＦを連続階調表現に変換し、連続階調変換された背景画像Ｊ₀とする（ステップＳ５０６）。Next, as the processing for the original resolution image I₀ , the foreground image F extracted as the foreground from the low-resolution reduced image I is processed with the original image I_{0 as} received. In the foreground image F extracted as the foreground from the reduced image I, the foreground image FF is extracted based on local features (edges, local statistics of color signals, etc.), and other pixels are extracted from the background image BF. (Step S505). Then, the background image BF is converted into a continuous tone expression to obtain a background image J₀ subjected to continuous tone conversion (step S506).

最後に、縮小画像Ｉから連続階調表現された背景画像Ｊ、前景画像ＦＦ、原画像Ｉ₀から連続階調表現された背景画像Ｊ₀を合成して、補正画像Ｑを得る（ステップＳ５０７）。Finally, the corrected image Q is obtained by synthesizing the background image J expressed in continuous tone from the reduced image I, the foreground image FF, and the background image J₀ expressed in continuous tone from the original image I₀ (step S507). .

背景画像Ｂ、ＢＦを連続階調表現に変換する処理については、まず、前景画像Ｆを抽出した後に残る領域の画像、すなわち背景画像Ｂ内に、Ｗ×Ｗの大きさ（ウィンドウサイズｓ）の局所的領域Ｒを設定する。ここでは、ウィンドウサイズｓがＷ×Ｗの大きさであることから、局所的領域Ｒは正方形であることになるが、局所的領域Ｒは、矩形であれば必ずしも正方形に限定されない。また、この局所的領域Ｒは、それが適切なサイズであれば、鋭いエッジ、文字、グラフィックスなどを持たない。したがって、ウィンドウサイズｓの大きさと局所的領域Ｒの位置とが適切であれば、局所的領域Ｒ内の画素をＲ内の平均色で置き換えればよい。背景画像ＢＦについても同様である。 Regarding the process of converting the background images B and BF into the continuous tone representation, first, in the image of the region remaining after the foreground image F is extracted, that is, in the background image B, the size of W × W (window size s). A local region R is set. Here, since the window size s is W × W, the local region R is a square, but the local region R is not necessarily limited to a square as long as it is a rectangle. Also, this local region R does not have sharp edges, characters, graphics, etc. if it is of an appropriate size. Therefore, if the size of the window size s and the position of the local region R are appropriate, the pixels in the local region R may be replaced with the average color in R. The same applies to the background image BF.

また、ステップＳ５０７のＦＦ、Ｊ、Ｊ０の合成を行い補正画像Ｑを得るには、Ｒ、Ｇ、Ｂ各色について次の処理を行えば良い。即ち、Ｑ［ｉ，ｊ］には、対応する低解像度画像での画素がＢ（背景）なら、低解像度連続階調画像Ｊの画素値を代入する。背景でない場合（Ｆ：前景）は、Ｑ［ｉ，ｊ］に対応する原解像度画像の画素がＦＦ（前景）なら、Ｉ０［ｉ，ｊ］を、ＢＦならＪ０［ｉ，ｊ］を代入すれば良い。 Further, in order to obtain the corrected image Q by combining FF, J, and J0 in step S507, the following processing may be performed for each of the R, G, and B colors. That is, Q [i, j] is substituted with the pixel value of the low resolution continuous tone image J if the corresponding pixel in the low resolution image is B (background). If it is not the background (F: foreground), if the pixel of the original resolution image corresponding to Q [i, j] is FF (foreground), I0 [i, j] is substituted, and if it is BF, J0 [i, j] is substituted. It ’s fine.

なお、この中間調変換処理の詳細については、本出願人が出願した特開２００３−２８
１５２６号公報（特許文献１）などに詳述されている。The details of the halftone conversion process are described in Japanese Patent Application Laid-Open No. 2003-28 filed by the present applicant.
No. 1526 (Patent Document 1) and the like.

（２）階調補正処理
次に、文字と地肌間のコントラスト強調のための階調変換処理について説明する。図１１は、階調補正処理の流れを示すフローチャートである。まず、処理の概要について説明する。文書画像には多くの文字が印刷されているが、一般的に通常の文書には、紙面の何も印刷されていない部分に黒い文字が直接印刷されている部分がある。このため、入力画像から黒文字がありそうな領域を抽出する。入力画像を十分に小さいブロックに分割することで、内部に黒い文字が紙面に直接印刷されているようなあるブロックが存在すると仮定できる。このことから、下地色となる紙面色が白であるとすると、次のような画像処理を行えば良い。以下、図１１を参照して処理の流れを説明する。(2) Tone Correction Processing Next, tone conversion processing for enhancing contrast between characters and the background will be described. FIG. 11 is a flowchart showing the flow of gradation correction processing. First, an overview of the process will be described. Many characters are printed on a document image. Generally, a normal document has a portion in which black characters are directly printed on an unprinted portion of paper. Therefore, an area where black characters are likely to be extracted is extracted from the input image. By dividing the input image into sufficiently small blocks, it can be assumed that there is a block in which black characters are directly printed on the paper. For this reason, if the paper color as the base color is white, the following image processing may be performed. Hereinafter, the flow of processing will be described with reference to FIG.

図１１に示すように、デジタル画像データである原画像Ｉ₀を受信していることが前提となっている（ステップＳ６０１）。このような前提の下、まず、原画像Ｉ₀から低解像度の縮小画像Ｉを生成する（ステップＳ６０２）。As shown in FIG. 11, it is assumed that an original image I₀ that is digital image data is received (step S601). Under such a premise, first, a low-resolution reduced image I is generated from the original image I₀ (step S602).

次に、ステップＳ６０２で生成した低解像度画像Ｉを平滑化する（ステップＳ６０３）。そして、低解像度画像Ｉの各画素の周りに固定サイズのウィンドウを設定して、Ｒ、Ｇ、Ｂの各チャンネル（色）で信号（画素値）の平均値μと標準偏差σとを計算することで低解像度画像Ｉの特徴量を計算する（ステップＳ６０４）。画像データの平滑化は公知の技術であり、ノイズの除去を目的とする。また、ステップＳ６０４で求めた統計量は、次のステップでの文字領域の判定に用いる。 Next, the low resolution image I generated in step S602 is smoothed (step S603). Then, a fixed-size window is set around each pixel of the low-resolution image I, and an average value μ and a standard deviation σ of signals (pixel values) are calculated for each channel (color) of R, G, and B. Thus, the feature amount of the low resolution image I is calculated (step S604). The smoothing of the image data is a known technique and aims at removing noise. Further, the statistic obtained in step S604 is used for determination of the character area in the next step.

次いで、低解像度画像Ｉに対して局所適応的閾値処理と膨張処理とを行ってカラー成分の局所適応的二値化を行うことにより、文字領域Ｃの抽出検出を行う（ステップＳ６０５）。例えば、下地に黒文字が直接印刷された画像である場合、Ｒ、Ｇ、Ｂのすべてのチャンネル（色）においてコントラストが強くなる傾向がある。このため、信号値（画素値）が、すべてのチャンネル（色）において、閾値μ（ａ＋ｂσ）よりも低い画素［ｉ，ｊ］を文字領域Ｃの要素として設定する。ａ、ｂはパラメータであり、文書原稿に含まれるイメージ成分によって調整する。なお、文字領域Ｃの抽出検出の方法は、文字認識等で行われている既存の方法を用いても良い。 Next, the character region C is extracted and detected by performing local adaptive threshold processing and expansion processing on the low-resolution image I to perform local adaptive binarization of the color components (step S605). For example, in the case of an image in which black characters are directly printed on the background, the contrast tends to be strong in all channels (colors) of R, G, and B. Therefore, a pixel [i, j] whose signal value (pixel value) is lower than the threshold value μ (a + bσ) is set as an element of the character region C in all channels (colors). a and b are parameters which are adjusted according to the image components included in the document original. In addition, as a method for detecting detection of the character region C, an existing method used in character recognition or the like may be used.

次に、入力された原画像Ｉ₀を固定サイズの互いに重なりのないブロックに分割し（ステップＳ６０６）、分割した各ブロックにおいて、文字領域Ｃに属する画素を２つの代表色に応じて第１のクラスと第２のクラスとの２クラスに分類する（ステップＳ６０７）。そして、文字領域Ｃに属する画素の輝度に基づいて、通常は、明るい色の方を文字領域の背景色に、暗い色の方を文字色に対応させる。なお、ブロックの大きさは辺の長さが２０ｍｍ程度の正方形で良い（２００ｄｐｉでは１６０画素×１６０画素）。Next, the input original image I₀ is divided into fixed-size non-overlapping blocks (step S606), and in each divided block, the pixels belonging to the character area C are set to the first according to the two representative colors. Classification into two classes, a class and a second class (step S607). Then, based on the luminance of the pixels belonging to the character area C, normally, the lighter color corresponds to the background color of the character area and the darker color corresponds to the character color. The size of the block may be a square with a side length of about 20 mm (160 pixels × 160 pixels at 200 dpi).

さらに、文字領域Ｃに属する画素が２つの代表色に分類された各ブロックから、一方のクラスに分類された画素数が最大になるブロックＢをウィンドウＷとして選択し、このウィンドウＷにおける２つの代表色を入力画像における下地の平均色および黒文字の平均色としてそれぞれ設定し、さらに、輝度の統計量に基づいて、黒文字色と下地色とを推定する（ステップＳ６０８）。ここで、輝度は、例えば、
輝度＝（Ｒ＋Ｇ＋Ｂ）／３
に示す式の演算により取得されるＲ、Ｇ、Ｂ信号の平均値であり、この輝度から取得される輝度の平均値および標準偏差を輝度の統計量とする。Further, a block B in which the number of pixels classified into one class is maximized is selected as a window W from each block in which pixels belonging to the character region C are classified into two representative colors. The colors are respectively set as the average color of the background and the average color of black characters in the input image, and the black character color and the background color are estimated based on the statistics of luminance (step S608). Here, the luminance is, for example,
Luminance = (R + G + B) / 3
The average value of the R, G, and B signals acquired by the calculation of the equation shown in FIG.

このようにして求めた輝度の統計量に基づいて、各ブロック内における各画素の階調補正を行う（ステップＳ６０９）。ここでは、下地色を白（輝度最大）とし、黒文字色を黒（輝度最小）とする。具体的には、クラス１を文字色の代表色とし、クラス２を下地色の代表色とすれば、文字色の代表色より暗い画素を黒、下地色の代表色より明るい画素を白として、二つの色の中間の画素に対しては、黒と白の中間調に線形マップする。つまり、
最大画素値を２５５とすると、
画素値＝２５５＊（ｘ−Ｂ）／（Ｗ−Ｂ）
ここで、ｘは注目している画素の画素値、Ｂは文字色の代表色の画素値、Ｗは文字色の
代表色の画素値である。Based on the luminance statistics thus obtained, gradation correction of each pixel in each block is performed (step S609). Here, the background color is white (luminance maximum), and the black character color is black (luminance minimum). Specifically, ifclass 1 is the representative color of the character color andclass 2 is the representative color of the background color, pixels darker than the representative color of the character color are black, and pixels brighter than the representative color of the background color are white, For pixels in the middle of the two colors, a linear map is made to a halftone between black and white. That means
If the maximum pixel value is 255,
Pixel value = 255 * (x−B) / (W−B)
Here, x is the pixel value of the pixel of interest, B is the pixel value of the representative color of the character color, and W is the pixel value of the representative color of the character color.

このようにして求めた画素値をＲ、Ｇ、Ｂ各色の画素値に設定する。なお、ここでは画素値という表現を用いたが、輝度と同じ意味である。 The pixel values obtained in this way are set to the pixel values of R, G, and B colors. Although the expression pixel value is used here, it has the same meaning as luminance.

なお、この階調補正処理の詳細については、本出願人が出願した特開２００５−１１０１８４号公報（特許文献２）などに詳述されている。 Details of the gradation correction processing are described in detail in Japanese Patent Application Laid-Open No. 2005-110184 (Patent Document 2) filed by the present applicant.

（３）スキュー補正処理
次に、スキュー補正処理について説明する。スキューは当該分野において良く知られた問題であり、文書のラインが水平線上にないドキュメントイメージを指す。スキュー検出方法には、スキュー角の決定処理が設けられている。イメージを表現している抽出された矩形領域のリストからドキュメントのスキュー角を決定することができる。スキューを決定する方法は、抽出された矩形領域がどのようにして導き出されたかに依らない。従って、矩形領域に関してドキュメント表現を正確に行なうことができる方法であれば、これをスキューの検出および補正方法に用いることができる。(3) Skew Correction Processing Next, the skew correction processing will be described. Skew is a well-known problem in the art and refers to a document image in which the lines of the document are not on a horizontal line. The skew detection method includes a skew angle determination process. A document skew angle can be determined from a list of extracted rectangular regions representing the image. The method for determining the skew does not depend on how the extracted rectangular region was derived. Therefore, any method that can accurately express a document with respect to a rectangular region can be used as a skew detection and correction method.

図１２は、スキュー角の決定処理の流れを示すフローチャートである。前提条件として、Ｘ−Ｙ平面上での座標点として矩形領域をアドレスすることができることが先ず理解されるべきである。 FIG. 12 is a flowchart showing the flow of the skew angle determination process. As a prerequisite, it should first be understood that a rectangular region can be addressed as a coordinate point on the XY plane.

最初、所定数の関連した矩形領域のアドレス情報を矩形領域バッファに格納する（ステップＳ７０１）。スキュー角を検出するために、正確には８０個の矩形領域を用いることとする。さらに、格納されるアドレス情報は、矩形領域の左上隅の座標である。全ての矩形領域アドレスについて一貫性をもたせれば、これのかわりに、矩形領域の右下隅の座標をアドレス情報として用いることもできる。 First, address information of a predetermined number of related rectangular areas is stored in the rectangular area buffer (step S701). To detect the skew angle, 80 rectangular regions are used accurately. Further, the stored address information is the coordinates of the upper left corner of the rectangular area. If all the rectangular area addresses are consistent, the coordinates of the lower right corner of the rectangular area can be used as address information instead.

次いで、各々のアドレスのＸ座標をＸ座標アドレスのヒストグラム上に投射し、コラムエッジを検出する（ステップＳ７０２）。図１３には、このようなヒストグラムが示されている。このヒストグラムは、最も頻度が高いＸ座標値を示している。この最も頻度が高いＸ座標値により、文書のコラムのエッジを検出することができる。すなわち、矩形領域をこれらの左上隅点により検出する場合には、左側のコラムエッジが検出される。これとは逆に、右下隅点が用いられる場合には、右側のコラムエッジが検出される。図１３を参照すると、符号７０１の部分は、Ｘ座標を示しており、符号７０２の部分は頻度を示しており、符号７０３の部分は、各Ｘ座標のカウント値の関係をグラフィックに示したものである。符号７０４で示すＸ座標値は、最も頻度が高く、このＸ座標値がコラムエッジとして定められる。コラムの検出は、スキュー角を決定するときに比較対象となる矩形領域をコラム検出結果を用いて制限することができる点で重要である。なお、この制限とは、同じコラム内の矩形領域のみを比較することである。 Next, the X coordinate of each address is projected on the histogram of the X coordinate address, and a column edge is detected (step S702). FIG. 13 shows such a histogram. This histogram shows the X coordinate value with the highest frequency. The edge of the column of the document can be detected by the most frequently used X coordinate value. That is, when the rectangular area is detected by these upper left corner points, the left column edge is detected. In contrast, when the lower right corner point is used, the right column edge is detected. Referring to FIG. 13, theportion 701 indicates the X coordinate, theportion 702 indicates the frequency, and theportion 703 graphically shows the relationship between the count values of the respective X coordinates. It is. The X coordinate value indicated byreference numeral 704 has the highest frequency, and this X coordinate value is determined as the column edge. The column detection is important in that the rectangular area to be compared can be limited using the column detection result when determining the skew angle. This restriction means that only rectangular areas in the same column are compared.

図１２を再び参照すると、矩形領域バッファに格納されている全てのあるいは限られた一部の関連した矩形領域間の正接角（タンジェント角）を決定し、ヒストグラム上に投射してスキュー角を検出する（ステップＳ７０３）。図１４には、２つの矩形領域間の正接角が示されている。第１の矩形領域８０１と第２の矩形領域８０２とは、対角線８０４と水平線８０３とによって定まるスキュー角をもつ。対角線８０４は、矩形領域８０１の右下隅８０６から矩形領域８０２の右下隅８０７まで延びている。水平線８０３は、矩形領域８０１の隅８０６から延びている。スキュー角８０５は、良く知られた三角法の計算により、次のようにして計算される。
ΔＸ＝｜（点８０６のＸ座標）−（点８０７のＸ座標）｜
ΔＹ＝｜（点８０６のＹ座標）−（点８０７のＹ座標）｜
スキュー角＝（１８０×ΔＹ）／（π×ΔＸ）Referring again to FIG. 12, tangent angles (tangent angles) between all or a limited part of the associated rectangular areas stored in the rectangular area buffer are determined and projected onto the histogram to detect the skew angle. (Step S703). FIG. 14 shows a tangent angle between two rectangular regions. The firstrectangular area 801 and the secondrectangular area 802 have a skew angle determined by adiagonal line 804 and ahorizontal line 803. Thediagonal line 804 extends from the lowerright corner 806 of therectangular area 801 to the lowerright corner 807 of therectangular area 802. Thehorizontal line 803 extends from thecorner 806 of therectangular area 801. Theskew angle 805 is calculated by the well-known trigonometric calculation as follows.
ΔX = | (X coordinate of point 806) − (X coordinate of point 807) |
ΔY = | (Y coordinate of point 806) − (Y coordinate of point 807) |
Skew angle = (180 × ΔY) / (π × ΔX)

すなわち、対角線８０４を形成する座標点間の絶対値を計算し、スキュー角の式に挿入することで、スキュー角を計算できる。 That is, the skew angle can be calculated by calculating the absolute value between the coordinate points forming thediagonal line 804 and inserting it into the skew angle equation.

なお、このスキュー補正処理の詳細については、本出願人が出願した特許第３３０８０３２号公報（特許文献３）などに詳述されている。 The details of the skew correction processing are described in detail in Japanese Patent No. 33008032 (Patent Document 3) filed by the present applicant.

最後に、画像蓄積／送信処理部２７は、画像正規化処理方法の選択部２４で選択された画像正規化処理方法に基づいて画像正規化処理部２５で正規化された画像をＨＤＤ６などの記憶媒体に蓄積したり、あるいは、他の機能が動作する外部機器にネットワーク９を介して送信したりする。 Finally, the image accumulation /transmission processing unit 27 stores the image normalized by the imagenormalization processing unit 25 based on the image normalization processing method selected by the image normalization processingmethod selection unit 24 in theHDD 6 or the like. The information is stored in a medium, or transmitted to an external device that operates other functions via the network 9.

このように本実施の形態によれば、レイアウトの概略（文字や写真・絵の大体の空間的配置や分布など）に基づいて計算された画像データの画像特徴量を用いて当該画像データの画像タイプが分類識別された後、分類結果及び画像タイプと画像正規化処理の対応規則を対応付けた情報に基づいて画像正規化処理方法が選択され、選択された画像正規化処理方法に基づいて画像データに対して正規化処理が施され、送信または蓄積される。これにより、レイアウトの概略（文字や写真・絵の大体の空間的配置や文字と写真・絵の分布など）に従うことで画像のタイプを特徴付ける画像特徴量を高速に計算することができるとともに、画像データの画像タイプに適した正規化処理を適用して、画像の歪みをもたらすことなく、画像データを理想的な表現に変換した形で送信または蓄積することができる。 As described above, according to the present embodiment, an image of the image data is calculated using the image feature amount of the image data calculated based on the outline of the layout (such as the general spatial arrangement and distribution of characters, photographs, and pictures). After the type is classified and identified, the image normalization processing method is selected based on the classification result and the information that associates the correspondence type of the image type with the image normalization processing, and the image based on the selected image normalization processing method Data is normalized and transmitted or stored. As a result, image features that characterize the type of image can be calculated at high speed by following the outline of the layout (such as the spatial arrangement of characters, photos, and pictures, and the distribution of characters, photos, and pictures). By applying a normalization process suitable for the image type of the data, the image data can be transmitted or stored in a form converted into an ideal representation without causing image distortion.

なお、本実施の形態の「（２）ブロックの分類（ステップＳ２）」においては、ブロックから計算された（２５×Ｌ）次元の特徴量ベクトルｘについて、行列Ｆを用いて、ブロックの文字らしさと非文字らしさを表す係数成分から成る係数ベクトルａを計算したが、これに限るものではない。例えば、学習データから計算された特徴量ベクトルｘと、学習データに付属した教師信号（文字か、文字でないか）を用いた教師つき学習を前もって行い、識別関数を構築しておくようにしても良い。例えば、学習や識別関数は、線形判別分析と線形判別関数、ニューラルネットワークの誤差逆伝播とネットワークの重み係数などの既知のものを用いればよい。分類すべきブロックで計算された特徴量ベクトルｘについて、予め計算されておいた識別関数を用いて、ブロックを“絵”“文字”“他”のいずれかに分類する。 Note that in “(2) Block classification (step S2)” in the present embodiment, the character value of the block is calculated using the matrix F for the (25 × L) -dimensional feature vector x calculated from the block. However, the present invention is not limited to this. For example, supervised learning using a feature vector x calculated from learning data and a teacher signal (character or not) attached to the learning data may be performed in advance to construct an identification function. good. For example, the learning and discriminant functions may be known ones such as linear discriminant analysis and linear discriminant function, neural network back propagation error and network weight coefficient. With respect to the feature quantity vector x calculated for the block to be classified, the block is classified into one of “picture”, “character”, and “other” by using a discrimination function calculated in advance.

また、本実施の形態の「（２）ブロックの分類（ステップＳ２）」においては、２値画像から特徴を抽出するようにしたが、２値画像ではなく、多値画像から特徴を抽出するようにしても良い。この場合、３×３近傍の局所パターンの数は３５になる。これは、図７に示した局所パターンに加えて、１次自己相関において注目画素自身の濃淡値の２乗、２次自己相関において注目画素自身の濃淡値の３乗、８近傍の画素のそれぞれについて近傍画素の濃淡値の２乗と注目画素の濃淡値の積、合計１０個の相関値を計算しなければならないからである。２値画像では、濃淡値が１または０だけなので、濃淡値を２乗、３乗しても、もとの値と変わらないが、多値画像ではこれらのケースを考慮しなければならない。 Further, in “(2) Block classification (step S2)” of the present embodiment, features are extracted from a binary image, but features are extracted from a multi-valued image instead of a binary image. Anyway. In this case, the number of local patterns in the vicinity of 3 × 3 is 35. This is because, in addition to the local pattern shown in FIG. 7, the square of the gray value of the target pixel itself in the first-order autocorrelation, the third power of the gray value of the target pixel itself in the second-order autocorrelation, and the pixels near eight This is because a total of ten correlation values, the product of the square of the gray value of the neighboring pixels and the gray value of the target pixel, must be calculated. In a binary image, since the gray value is only 1 or 0, even if the gray value is squared or raised to the third power, it does not change from the original value, but in a multi-value image, these cases must be considered.

そして、これに応じて，特徴量ｆｋの次元もＭ＝３５になり、特徴量ベクトルｆ_k＝（ｇ（ｋ，１），ｇ（ｋ，１），・・・，ｇ（ｋ，３５））が計算される。また、ブロックの分類においても、（３５×Ｌ）次元の特徴量ベクトルｘ＝（ｇ（０，１），・・・，ｇ（０，２５），・・・，ｇ（Ｌ，１），・・・，ｇ（Ｌ，２５））を用いる。Accordingly, the dimension of the feature quantity fk is also M = 35, and the feature quantity vector f_k = (g (k, 1), g (k, 1),..., G (k, 35). ) Is calculated. Also in the block classification, (35 × L) -dimensional feature vector x = (g (0,1),..., G (0,25),..., G (L, 1), ..., G (L, 25)) are used.

［第２の実施の形態］
次に、本発明の第２の実施の形態を図１５に基づいて説明する。なお、前述した第１の実施の形態と同じ部分は同じ符号で示し説明も省略する。[Second Embodiment]
Next, a second embodiment of the present invention will be described with reference to FIG. The same parts as those in the first embodiment described above are denoted by the same reference numerals, and description thereof is also omitted.

第１の実施の形態においては、画像処理装置１としてＰＣなどのコンピュータを適用したが、本実施の形態は、画像処理装置１としてデジタル複合機などに備えられる情報処理装置を適用したものである。 In the first embodiment, a computer such as a PC is applied as theimage processing apparatus 1, but in the present embodiment, an information processing apparatus provided in a digital multifunction peripheral or the like is applied as theimage processing apparatus 1. .

図１５は、本発明の第２の実施の形態にかかるデジタル複合機５０を示す外観斜視図である。図１５に示すように、画像読取手段であるスキャナ部５１及び画像印刷装置であるプリンタ部５２を備えた画像形成装置であるデジタル複合機５０に備えられる情報処理装置に画像処理装置１を適用し、デジタル複合機５０のスキャナ部５１で読み取ったスキャン画像に対して画像データの蓄積／送信の際の正規化処理を施すようにしたものである。 FIG. 15 is an external perspective view showing a digitalmulti-function device 50 according to the second embodiment of the present invention. As shown in FIG. 15, theimage processing apparatus 1 is applied to an information processing apparatus provided in a digital multi-function peripheral 50 that is an image forming apparatus including ascanner unit 51 that is an image reading unit and aprinter unit 52 that is an image printing apparatus. The scan image read by thescanner unit 51 of the digitalmulti-function device 50 is subjected to normalization processing when storing / transmitting image data.

この場合、以下に示す３つの態様が考えられる。
１．スキャナ部５１におけるスキャン時に、画像タイプ識別部２３における画像タイプ識別処理まで実行し、画像データのヘッダに画像タイプ情報として記録する。
２．スキャナ部５１におけるスキャン時には特に何もせず、データ配信時またはデータ蓄積時に、画像正規化処理部２５による正規化処理まで行う。
３．スキャナ部５１におけるスキャン時に、画像正規化処理部２５による正規化処理まで行う。In this case, the following three modes are conceivable.
1. At the time of scanning in thescanner unit 51, the processing up to the image type identification process in the imagetype identification unit 23 is executed and recorded as image type information in the header of the image data.
2. No particular processing is performed at the time of scanning by thescanner unit 51, and normalization processing by the imagenormalization processing unit 25 is performed at the time of data distribution or data storage.
3. When scanning by thescanner unit 51, the normalization processing by the imagenormalization processing unit 25 is also performed.

［第３の実施の形態］
次に、本発明の第３の実施の形態を図１６に基づいて説明する。なお、前述した第１の実施の形態と同じ部分は同じ符号で示し説明も省略する。[Third Embodiment]
Next, a third embodiment of the present invention will be described with reference to FIG. The same parts as those in the first embodiment described above are denoted by the same reference numerals, and description thereof is also omitted.

第１の実施の形態においては、画像処理装置１としてローカルなシステム（例えば、パーソナルコンピュータ単体）を適用したが、本実施の形態は、画像処理装置１としてサーバクライアントシステムを構成するサーバコンピュータを適用したものである。 In the first embodiment, a local system (for example, a personal computer alone) is applied as theimage processing apparatus 1, but in this embodiment, a server computer constituting a server client system is applied as theimage processing apparatus 1. It is a thing.

図１６は、本発明の第３の実施の形態にかかるサーバクライアントシステムを示す模式図である。図１６に示すように、サーバコンピュータＳにネットワークＮを介してクライアントコンピュータＣが複数台接続されたサーバクライアントシステムを適用しており、各クライアントコンピュータＣからサーバコンピュータＳに対して画像を送信し、サーバコンピュータＳ（画像処理装置１）において画像に対して画像データの蓄積／送信の際の正規化処理を施すようにしたものである。また、ネットワークＮ上には、ネットワークスキャナＮＳが設けられている。 FIG. 16 is a schematic diagram showing a server client system according to the third embodiment of the present invention. As shown in FIG. 16, a server client system in which a plurality of client computers C are connected to a server computer S via a network N is applied, and an image is transmitted from each client computer C to the server computer S. In the server computer S (image processing apparatus 1), the image is subjected to normalization processing when storing / transmitting image data. A network scanner NS is provided on the network N.

この場合、以下に示す３つの態様が考えられる。
１．ネットワークスキャナＮＳを用いたサーバコンピュータＳ（画像処理装置１）によるスキャン時に、画像タイプ識別部２３における画像タイプ識別処理まで実行し、画像データのヘッダに画像タイプ情報として記録する。
２．ネットワークスキャナＮＳを用いたサーバコンピュータＳ（画像処理装置１）によるスキャン時には特に何もせず、データ配信時またはデータ蓄積時に、画像正規化処理部２５による正規化処理まで行う。
３．ネットワークスキャナＮＳを用いたサーバコンピュータＳ（画像処理装置１）によるスキャン時に、画像正規化処理部２５による正規化処理まで行う。In this case, the following three modes are conceivable.
1. At the time of scanning by the server computer S (image processing apparatus 1) using the network scanner NS, the processing up to the image type identification process in the imagetype identification unit 23 is executed and recorded as image type information in the header of the image data.
2. No particular processing is performed at the time of scanning by the server computer S (image processing apparatus 1) using the network scanner NS, and normalization processing by the imagenormalization processing unit 25 is performed at the time of data distribution or data storage.
3. At the time of scanning by the server computer S (image processing apparatus 1) using the network scanner NS, normalization processing by the imagenormalization processing unit 25 is also performed.

本発明の第１の実施の形態にかかる画像処理装置の電気的な接続を示すブロック図である。1 is a block diagram showing electrical connections of an image processing apparatus according to a first embodiment of the present invention.画像処理装置のＣＰＵが実行する画像データの蓄積／送信の際の正規化処理にかかる機能を示す機能ブロック図である。FIG. 3 is a functional block diagram illustrating functions related to normalization processing at the time of image data storage / transmission executed by the CPU of the image processing apparatus.その流れを概略的に示すフローチャートである。It is a flowchart which shows the flow roughly.画像特徴計算部における画像特徴量計算処理の流れを概略的に示すフローチャートである。It is a flowchart which shows roughly the flow of the image feature-value calculation process in an image feature calculation part.ブロック分類処理の流れを概略的に示すフローチャートである。It is a flowchart which shows the flow of a block classification process roughly.多重解像度処理を示す模式図である。It is a schematic diagram which shows multi-resolution processing.高次自己相関関数計算のためのマスクパターンの一例を示す模式図である。It is a schematic diagram which shows an example of the mask pattern for high-order autocorrelation function calculation.ブロック分類の例を示す模式図である。It is a schematic diagram which shows the example of a block classification | category.画像タイプと画像正規化処理の対応規則の一例を示すフローチャートである。It is a flowchart which shows an example of the correspondence rule of an image type and an image normalization process.中間調変換処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a halftone conversion process.階調補正処理の流れを示すフローチャートである。It is a flowchart which shows the flow of a gradation correction process.スキュー角の決定処理の流れを示すフローチャートである。It is a flowchart which shows the flow of the determination process of a skew angle.コラムエッジを検出するためのヒストグラムを示す図である。It is a figure which shows the histogram for detecting a column edge.２つの矩形領域間の正接角を説明するための図である。It is a figure for demonstrating the tangent angle between two rectangular areas.本発明の第２の実施の形態にかかるデジタル複合機を示す外観斜視図である。It is an external appearance perspective view which shows the digital multifunctional device concerning the 2nd Embodiment of this invention.本発明の第３の実施の形態にかかるサーバクライアントシステムを示す模式図である。It is a schematic diagram which shows the server client system concerning the 3rd Embodiment of this invention.

符号の説明Explanation of symbols

１画像処理装置
２２画像特徴量計算手段
２３画像タイプ識別手段
２４選択手段
２５正規化手段
２６記憶手段
５０画像形成装置
５１画像読取手段DESCRIPTION OFSYMBOLS 1 Image processing device 22 Image feature-value calculation means 23 Image type identification means 24 Selection means 25 Normalization means 26 Storage means 50Image forming apparatus 51 Image reading means