Movatterモバイル変換


[0]ホーム

URL:


CN1945598A - Image recognition device - Google Patents

Image recognition device
Download PDF

Info

Publication number
CN1945598A
CN1945598ACN 200610141555CN200610141555ACN1945598ACN 1945598 ACN1945598 ACN 1945598ACN 200610141555CN200610141555CN 200610141555CN 200610141555 ACN200610141555 ACN 200610141555ACN 1945598 ACN1945598 ACN 1945598A
Authority
CN
China
Prior art keywords
image data
scanning
normalization
pattern
feature quantity
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN 200610141555
Other languages
Chinese (zh)
Other versions
CN100419781C (en
Inventor
田中昭二
大津留诚
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Mitsubishi Electric Corp
Original Assignee
Mitsubishi Electric Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Mitsubishi Electric CorpfiledCriticalMitsubishi Electric Corp
Publication of CN1945598ApublicationCriticalpatent/CN1945598A/en
Application grantedgrantedCritical
Publication of CN100419781CpublicationCriticalpatent/CN100419781C/en
Expired - Fee Relatedlegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Landscapes

Abstract

Translated fromChinese

本发明提供一种在对每个规定大小的识别区域扫描每个帧的标准化了的图像数据时,能够在减轻了扫描所需要的计算量的同时,提高识别精度的图像识别装置。具有:记录了多个用于在图像数据内间隔抽取像素而扫描规定大小的识别区域的扫描模式的扫描模式记录部件,从扫描模式记录部件中读出并设置在前后的帧中不同的扫描模式,进行识别区域的扫描。

The present invention provides an image recognition device capable of improving recognition accuracy while reducing the calculation amount required for scanning when scanning standardized image data of each frame for each recognition area of a predetermined size. There is a scanning pattern recording unit that records a plurality of scanning patterns for scanning a recognition area of a predetermined size by thinning out pixels in the image data, and different scanning patterns are read from the scanning pattern recording unit and set in the preceding and following frames , scan the recognition area.

Description

Translated fromChinese
图像识别装置Image recognition device

技术领域technical field

本发明涉及进行用于检测对象物体的存在的识别处理的图像识别装置。The present invention relates to an image recognition device that performs recognition processing for detecting the presence of a target object.

背景技术Background technique

在现有的图像识别装置中,在通过间隔抽取图像数据内的像素而进行识别处理的高速化时,与识别对象的大小对应地设置像素的间隔抽取幅度,用设置的间隔抽取幅度扫描识别区域内的图像(例如参考专利文献1)。In conventional image recognition devices, when speeding up recognition processing by thinning pixels in image data, the pixel thinning width is set in accordance with the size of the recognition object, and the recognition area is scanned with the set thinning width. images within (for example, refer to Patent Document 1).

另外,在现有的图像识别装置中,为了强化与检测精度对应的对亮度变化的耐性,对图像全体或识别区域进行亮度的色阶修正等的标准化处理,但为了求出进行标准化处理的参数,而计算图像全体或识别区域内的像素值等的特征量,生成与该计算出的特征量对应的修正表,进行标准化处理。In addition, in conventional image recognition devices, normalization processing such as gradation correction of brightness is performed on the entire image or recognition area in order to strengthen the resistance to brightness changes corresponding to the detection accuracy. However, in order to obtain parameters for normalization processing , and calculate feature quantities such as pixel values in the entire image or in the recognition area, generate a correction table corresponding to the calculated feature quantities, and perform normalization processing.

专利文献1:特开2000-99724号公报(图2)Patent Document 1: JP-A-2000-99724 (FIG. 2)

在现有的图像识别装置中,由于如果暂时设置了间隔抽取幅度则原样地被固定,所以有以下的问题,即在间隔抽取幅度内存在识别对象的情况下会产生无法发现识别对象的情况。In a conventional image recognition device, if the thinning width is temporarily set, it is fixed as it is. Therefore, there is a problem that the recognition object may not be found if there is a recognition object within the thinning width.

另外,在现有的图像识别装置中,用于对图像全体或识别区域内的图像的像素值进行标准化处理的算法是固定的,另外如果像素多则也有无法忽视计算量的问题。In addition, in conventional image recognition devices, the algorithm for normalizing the pixel values of the entire image or the image in the recognition area is fixed, and there is also a problem that the amount of calculation cannot be ignored if there are many pixels.

发明内容Contents of the invention

本发明的图像识别装置就是为了解决上述那样的问题而提出的,其特征在于包括:输入每个帧的图像数据的图像数据输入部件;计算出所输入的上述图像数据的特征量的特征量计算部件;根据该特征量,对图像数据进行标准化的标准化处理部件;记录了多个用于在图像数据内对像素进行间隔抽取而对规定大小的识别区域进行扫描的扫描模式(pattern)的扫描模式记录部件;针对前后的帧设置不同的扫描模式的扫描模式设置部件;使用由扫描模式设置部件对该帧设置了的扫描模式,针对每个识别区域对每个帧的标准化了的图像数据进行扫描的扫描部件;对由扫描部件扫描了的识别区域内的图像数据和识别对象的模型数据进行对照,输出该对照结果的对照部件。The image recognition device of the present invention is proposed in order to solve the above-mentioned problems, and is characterized in that it includes: an image data input part that inputs image data of each frame; a feature value calculation part that calculates a feature value of the input image data ; Based on the feature quantity, a normalization processing part for standardizing image data; a plurality of scanning pattern records for scanning a recognition area of a predetermined size by thinning out pixels in the image data Components; a scanning mode setting component for setting different scanning modes for the frames before and after; scanning the standardized image data of each frame for each recognition area using the scanning mode set by the scanning mode setting component for the frame A scanning part; a comparing part that compares the image data in the recognition area scanned by the scanning part with the model data of the recognition object, and outputs the comparison result.

本发明具有记录了多个扫描模式的扫描模式记录部件,其中该扫描模式用于指定在图像数据内对像素进行间隔抽取而扫描识别区域时的扫描顺序和间隔抽取幅度,可以通过根据在前后的帧中不同的种类的扫描模式,在图像数据内对识别区域进行扫描,来减轻每个帧的识别处理所需要的计算量,并且可以用其他扫描模式对在各扫描模式中间隔抽取出的像素进行补足,因此能够提高识别精度。The present invention has a scanning pattern recording means that records a plurality of scanning patterns for specifying the scanning order and the thinning width when scanning the recognition area by thinning pixels in the image data, which can be determined according to the preceding and following Different types of scanning modes in the frame scan the recognition area in the image data to reduce the amount of calculation required for the recognition processing of each frame, and other scanning modes can be used to scan the pixels extracted at intervals in each scanning mode Complementing it can improve the recognition accuracy.

附图说明Description of drawings

图1是表示本发明的实施例1的图像识别装置的结构图。FIG. 1 is a configuration diagram showing an image recognition device according toEmbodiment 1 of the present invention.

图2是表示本发明的实施例1的图像识别装置的动作的流程图。FIG. 2 is a flowchart showing the operation of the image recognition device according toEmbodiment 1 of the present invention.

图3是表示标准化模式记录部件的查找表的图。Fig. 3 is a diagram showing a lookup table of a normalization pattern recording unit.

图4是表示扫描模式记录部件的扫描模式的图。Fig. 4 is a diagram showing a scanning pattern of a scanning pattern recording unit.

图5是表示识别区域的扫描顺序的图。FIG. 5 is a diagram showing the scanning order of the recognition area.

图6是表示识别区域的扫描顺序的图。FIG. 6 is a diagram showing the scanning order of the recognition area.

图7是表示识别区域的扫描顺序的图。FIG. 7 is a diagram showing the scanning order of the recognition area.

图8是表示本发明的实施例2的图像识别装置的结构图。Fig. 8 is a configuration diagram showing an image recognition device according toEmbodiment 2 of the present invention.

图9是表示本发明的实施例2的图像识别装置的动作的流程图。Fig. 9 is a flowchart showing the operation of the image recognition device according to the second embodiment of the present invention.

图10是表示图像数据的像素值的图。FIG. 10 is a diagram showing pixel values of image data.

图11是表示积分图像数据的图。FIG. 11 is a diagram showing integrated image data.

图12是表示对图像数据设置了识别区域的状态的图。FIG. 12 is a diagram showing a state where a recognition area is set for image data.

具体实施方式Detailed ways

实施例1Example 1

图1是用于实施本发明的实施例1的图像识别装置的结构图。在图1中,将通过对来自未图示的摄像装置的视频信号进行数字化而输入到输入部件1的图像数据转送到特征量计算部件2和标准化处理部件4。特征量计算部件2针对每个帧计算出其图像数据的全部像素的平均值和分布等特征量,并发送到标准化模式选择部件3。在标准化模式记录部件8中记录有多种标准化模式的查找表,标准化模式选择部件3参照记录在标准化模式记录部件8中的查找表,选择与特征量计算部件2计算出的特征量对应的标准化模式,并展开到存储器(未图示)中。标准化处理部件4使用该被展开到存储器内的标准化模式,对图像数据进行标准化。扫描模式设置部件6对每个帧设置扫描模式记录部件9内的扫描模式。扫描部件5根据由扫描模式设置部件6设置了的扫描模式,在扫描部件5中扫描识别区域。对照部件7对由扫描部件5扫描了的识别区域内的图像数据和识别对象的模型数据进行对照,输出对照结果。FIG. 1 is a block diagram of an image recognition device for implementingEmbodiment 1 of the present invention. In FIG. 1 , image data input to aninput section 1 by digitizing a video signal from an unillustrated imaging device is transferred to a featurequantity calculation section 2 and anormalization processing section 4 . The featurequantity calculation unit 2 calculates feature quantities such as an average value and a distribution of all pixels of the image data for each frame, and sends it to the normalizationmode selection unit 3 . A lookup table of multiple normalization patterns is recorded in the normalizationpattern recording part 8, and the normalizationpattern selection part 3 refers to the lookup table recorded in the normalizationpattern recording part 8, and selects the normalization pattern corresponding to the feature quantity calculated by the featurequantity calculation part 2. schema, and expanded into memory (not shown). Thenormalization processing unit 4 normalizes the image data using the normalization pattern expanded in the memory. The scanningpattern setting section 6 sets the scanning pattern in the scanningpattern recording section 9 for each frame. Thescanning part 5 scans the recognition area in thescanning part 5 according to the scanning mode set by the scanningmode setting part 6 . The matchingunit 7 compares the image data in the recognition area scanned by thescanning unit 5 with the model data of the recognition target, and outputs a comparison result.

接着,使用图2说明这样构成的图像识别装置的动作。图2是表示本实施例1的图像识别装置的动作的流程图。首先,向输入部件1输入每个帧的图像数据(st101)。将输入到输入部件1的帧的图像数据转送到特征量计算部件2和标准化处理部件4(st102)。特征量计算部件2对每个帧计算出转送了的图像数据的特征量(st103)。在此,说明作为特征量计算出图像数据内的全部像素值的平均值的情况,但特征量也可以是像素值的分布、直方图、最小值、最大值等。另外,在此用0~255的8比特表现图像数据的各像素值。进而,用INT型定义作为特征量的像素的平均值,对小数点以下进行舍入而取得0~255的整数值。Next, the operation of the image recognition device configured in this way will be described using FIG. 2 . FIG. 2 is a flowchart showing the operation of the image recognition device of the first embodiment. First, image data of each frame is input to the input section 1 (st101). The image data of the frame input to theinput section 1 is transferred to the featurequantity calculation section 2 and the normalization processing section 4 (st102). The featureamount calculation unit 2 calculates the feature amount of the transferred image data for each frame (st103). Here, a case will be described where the average value of all pixel values in the image data is calculated as the feature quantity, but the feature quantity may be a distribution of pixel values, a histogram, a minimum value, a maximum value, or the like. In addition, here, each pixel value of the image data is represented by 8 bits of 0 to 255. Furthermore, an average value of pixels serving as a feature value is defined in INT type, and an integer value of 0 to 255 is obtained by rounding off the decimal point.

标准化模式选择部件3参照记录在标准化模式记录部件8中的查找表,选择与由特征量计算部件2计算出的特征量对应的标准化模式,并展开到存储器中(st104)。在图3中表示在设记录在标准化模式记录部件8中的特征量为像素的平均值时的查找表。在图3中,在查找表100中与0~255的特征量对应地记录有多种标准化模式。通过组合多种函数能够构成各标准化模式,例如组合一次函数、二次函数、三角函数、指数函数等,预先构成与特征量对应的模式。例如在由特征量计算部件2计算出的特征量是0的情况下,选择标准化模式101,在是252的情况下选择标准化模式102,在是255的情况下选择标准化模式103,并展开到存储器中。在标准化模式101、102、103中,横轴表示标准化处理前的像素值,纵轴表示标准化处理后的像素值。The normalizationpattern selection unit 3 refers to the lookup table recorded in the normalizationpattern recording unit 8, selects a normalization pattern corresponding to the feature quantity calculated by the featurequantity calculation unit 2, and expands it into the memory (st104). FIG. 3 shows a lookup table when the feature quantity recorded in the normalizationpattern recording unit 8 is an average value of pixels. In FIG. 3 , a plurality of normalization patterns are recorded in the lookup table 100 corresponding to feature quantities of 0 to 255. Each standardized pattern can be formed by combining various functions, such as combining linear functions, quadratic functions, trigonometric functions, exponential functions, etc., to form patterns corresponding to feature quantities in advance. For example, when the feature quantity calculated by the featurequantity calculation unit 2 is 0, select the normalization mode 101, select the normalization mode 102 when it is 252, select the normalization mode 103 when it is 255, and expand to the memory middle. In the normalization patterns 101 , 102 , and 103 , the horizontal axis represents pixel values before normalization processing, and the vertical axis represents pixel values after normalization processing.

另外,在作为特征量使用了像素的分布值的情况下,也可以与上述的平均值的情况一样,作成具有与各分布值对应的标准化模式的查找表。Also, when distribution values of pixels are used as feature quantities, a lookup table having a normalization pattern corresponding to each distribution value may be created similarly to the case of the above-mentioned average value.

接着,标准化处理部件4通过使用由标准化模式选择部件3选择并展开到存储器中的标准化模式,对图像数据的像素值进行变换,而进行标准化处理(st105)。这样,通过预先将作为与图像数据的特征量对应地对像素值进行了变换的结果的标准化模式,作为查找表记录在标准化模式记录部件8中,在进行标准化处理时就不需要进行计算处理了。另外,标准化模式选择部件3与特征量对应地选择标准化模式,因此能够灵活地对应摄影时的照明变化。Next, thenormalization processing unit 4 performs normalization processing by converting the pixel values of the image data using the normalization mode selected by the normalizationmode selection unit 3 and expanded into the memory (st105). In this way, by pre-recording the normalization pattern which is the result of converting the pixel values corresponding to the feature value of the image data in the normalizationpattern recording part 8 as a lookup table, calculation processing is not required when normalization processing is performed. . In addition, the normalizationmode selection unit 3 selects the normalization mode according to the feature value, so it can flexibly respond to changes in lighting during photography.

接着,扫描模式设置部件6依照记录在扫描模式记录部件9中的那样设置扫描模式(st106)。扫描部件5根据由扫描模式设置部件6设置了的扫描模式,对识别区域进行扫描(st107)。Next, the scanningpattern setting section 6 sets the scanning pattern as recorded in the scanning pattern recording section 9 (st106). Thescanning unit 5 scans the recognition area according to the scanning mode set by the scanning mode setting unit 6 (st107).

在此,说明识别区域的扫描的方法。如图4所示,在扫描模式记录部件9中,记录有使用扫描模式个数、扫描模式顺序、扫描模式。扫描模式个数表示所使用的扫描模式的个数,扫描模式顺序表示所使用的扫描模式的顺序。另外,扫描模式由模式ID、座标数、座标表构成。模式ID表示用于确定该扫描模式的ID,座标数表示适用于1帧的图像数据的识别区域的个数。另外,座标表只包含座标个数的表示图像数据内的x座标、y座标的座标值,各座标值表示图像数据内的识别区域的位置。Here, a method of scanning the recognition area will be described. As shown in FIG. 4 , in the scanpattern recording unit 9 , the number of scan patterns used, the order of the scan patterns, and the scan patterns are recorded. The number of scan modes indicates the number of scan modes used, and the order of scan modes indicates the order of scan modes used. In addition, the scan pattern is composed of a pattern ID, a coordinate number, and a coordinate table. The mode ID indicates an ID for specifying the scan mode, and the coordinate number indicates the number of identification regions applicable to one frame of image data. In addition, the coordinate table includes only coordinate values indicating x-coordinates and y-coordinates in the image data, and each coordinate value indicates the position of the recognition area in the image data.

扫描部件5针对标准化处理了的图像数据,使用具有由扫描模式顺序指定的模式ID的扫描模式,扫描识别区域。在此,为了说明的方便,说明使用扫描模式个数为3,检测模式顺序为1、2、3的顺序,各帧的图像数据的大小为横8×纵10的80个像素,识别区域的大小为横2×纵2的4个像素的情况,但可以将它们设置为适当的值。Thescanning unit 5 scans the recognition area with respect to the normalized image data using a scan pattern having a pattern ID specified in the order of the scan patterns. Here, for the convenience of description, the number of scanning modes is 3, the sequence of detection modes is 1, 2, 3, the size of the image data of each frame is 80 pixels of horizontal 8 × vertical 10, and the recognition area The case where the size is 4 pixels of horizontal 2 × vertical 2, but they can be set to appropriate values.

由于先头的检测模式顺序是1,所以扫描部件5针对最初输入的帧的图像数据,根据模式ID为1的扫描模式21进行识别区域的扫描。另外,针对接着输入的帧的图像数据,根据模式ID为2的扫描模式22进行识别区域的扫描,进而针对其后输入的帧的图像数据,根据模式ID为3的扫描模式23进行识别区域的扫描。这样,以在扫描模式顺序中指定的顺序,根据各扫描模式进行识别区域的扫描。Since the first detection pattern sequence is 1, thescanning unit 5 scans the recognition area in accordance with thescanning pattern 21 whose pattern ID is 1 with respect to the image data of the first input frame. In addition, for the image data of the next input frame, the scanning of the recognition area is performed according to thescanning mode 22 whose mode ID is 2, and for the image data of the frame input thereafter, the scanning of the recognition area is performed according to thescanning mode 23 whose mode ID is 3. scanning. In this way, the scanning of the recognition area is performed according to each scanning mode in the order specified in the scanning mode order.

图5是表示根据扫描模式21进行识别区域的扫描时的扫描顺序的图。图5的各模块表示图像数据内的像素。附加了数字的像素表示由座标表内的座标值指定。另外,数字的顺序与座标表内的座标值的顺序对应。即,扫描模式数据21内的座标表的最初的座标值(x1-1,y1-1)是附加了1的像素的图像数据内的座标,下一个座标值(x1-2,y1-2)是附加了2的像素的图像数据内的座标。现在,扫描模式21的座标个数是20,因此向图像数据内的像素附加1到20的数字。FIG. 5 is a diagram showing a scanning sequence when scanning the recognition area according to thescanning pattern 21 . Each block in FIG. 5 represents a pixel within image data. Pixel representations appended with numbers are specified by coordinate values in the coordinate table. In addition, the order of numbers corresponds to the order of coordinate values in the coordinate table. That is, the first coordinate value (x1-1, y1-1) of the coordinate table in thescan pattern data 21 is the coordinate in the image data of the pixel to which 1 is added, and the next coordinate value (x1-2, y1-2) is the coordinate in the image data of the pixel to which 2 is added. Now, the number of coordinates of thescan pattern 21 is 20, so numbers from 1 to 20 are added to pixels in the image data.

接着,扫描部件5以记录在座标表中的座标值的顺序进行识别区域的扫描。这时,各座标值表示识别区域的左上端的座标。最初,在将图5中的1的位置作为识别区域的左上端的用斜线所示的位置处设置识别区域。接着,在将2的位置作为识别区域的左上端的位置处设置识别区域。循环这些动作共座标个数的次数,在此为20次,进行识别区域的扫描。Next, thescanning unit 5 scans the recognition area in the order of the coordinate values recorded in the coordinate table. In this case, each coordinate value represents the coordinate of the upper left end of the recognition area. Initially, a recognition area is set at a position indicated by oblique lines with the position of 1 in FIG. 5 being the upper left end of the recognition area. Next, a recognition area is set at a position where the position of 2 is the upper left end of the recognition area. These actions are repeated for the number of coordinates, 20 times here, to scan the recognition area.

另外,针对下一个帧的图像数据,扫描部件5根据扫描模式22循环进行同样的动作,进行识别区域的扫描。在图6中表示根据扫描模式22扫描识别区域时的扫描顺序。如从图6可知的那样,扫描模式22的座标个数为16,座标表内的各座标值(x2-1,y2-1)~(x2-16,y2-16)是逐个像素地将扫描模式21的座标表内的座标值(x1-1,y1-1)~(x1-16,y1-16)向下方移动了的座标值。In addition, with respect to the image data of the next frame, thescanning unit 5 performs the same operations in a cycle according to thescanning pattern 22 to scan the recognition area. FIG. 6 shows the scanning sequence when scanning the recognition area according to thescanning pattern 22 . As can be seen from FIG. 6, the number of coordinates in thescanning pattern 22 is 16, and each coordinate value (x2-1, y2-1)-(x2-16, y2-16) in the coordinate table is pixel by pixel Coordinate values (x1-1, y1-1) to (x1-16, y1-16) in the coordinate table of thescan pattern 21 are shifted downward.

这样,通过针对连续的帧根据不同的扫描模式进行识别区域的扫描,能够用其他的扫描模式对在各扫描模式中间隔抽取出的像素进行补足,能够提高识别精度。In this way, by performing scanning of the recognition area in accordance with different scan patterns for consecutive frames, the pixels extracted in each scan pattern can be supplemented with other scan patterns, and the recognition accuracy can be improved.

另外,如果将例如如图7所示那样从图像数据的中心开始螺旋状地扫描识别区域那样的扫描模式记录在扫描模式记录部件9中,则能够提供对识别对象存在于图像的中心的可能性高的情况有效的识别区域的扫描顺序。这样,用户能够与目的对应地设置识别区域的扫描顺序。In addition, if a scan pattern that scans the identification area spirally from the center of the image data as shown in FIG. A high case effectively identifies the scan order of the region. In this way, the user can set the scanning order of the recognition areas corresponding to the purpose.

另外,在此说明了预先设置了扫描模式顺序的情况,但也可以不特别地决定这些顺序。即,也可以构成为随机地选择记录在扫描模式记录部件9中的扫描模式。In addition, the case where the order of the scan patterns is set in advance has been described here, but these orders may not be particularly determined. That is, the scan pattern recorded in the scanpattern recording section 9 may be randomly selected.

接着,对照部件7对扫描部件5扫描的识别区域内的图像数据和识别对象的模型数据进行对照(st108)。该对照的方法例如有求出图像相关性的方法或以下的方法。Next, thematching unit 7 compares the image data in the recognition area scanned by thescanning unit 5 with the model data of the recognition target (st108). The method of this comparison includes, for example, the method of obtaining image correlation or the following methods.

例如抽出学习了物体的特征后的数据作为被称为矩形特征(Rectangle Feature:Rf)的特征量。使用组合了矩形特征的被称为分类器(Classifier)的判别函数来表现物体的特征。具体地说,如公式(1)所示那样,与各矩形特征的值(Rf(i))是否超过了阈值(th)对应地赋予加权(pv或nv)。接着,定义公式(2)所示那样的全部的矩形特征的加权的总和(Cls),如公式(3)所示那样,在该总和超过了某阈值(th2)的情况下,将其判断为识别对象(obiect),在没有超过的情况下,判断为非识别对象(nonobject)。将该判断结果作为对照结果输出(st109)。For example, data obtained by learning the features of an object is extracted as a feature quantity called a rectangle feature (Rectangle Feature: Rf). Object features are expressed using a discriminant function called a classifier that combines rectangular features. Specifically, as shown in formula (1), weighting (pv or nv) is given according to whether the value (Rf(i)) of each rectangular feature exceeds a threshold value (th). Next, the weighted sum (Cls) of all rectangular features as shown in formula (2) is defined, and when the sum exceeds a certain threshold (th2) as shown in formula (3), it is judged as If the recognition object (obiect) does not exceed, it is judged as a non-recognition object (nonobject). This judgment result is output as a collation result (st109).

公式(1)Formula 1)

RfRf((ii))ii∈∈∀∀==RfRf((ii))>>ththe th→&Right Arrow;pvPVRfRf((ii))≤≤ththe th→&Right Arrow;nvnv

公式(2)Formula (2)

ClsCls==ΣΣii∈∈∀∀RfRf((ii))

公式(3)Formula (3)

clscls>>ththeth22→&Right Arrow;objectobjectclscls≤≤ththeth22→&Right Arrow;nonobjectnon-object

在本实施例1的图像识别装置中,说明了用0~255的8比特表现输入的图像数据的各像素值的灰度等级(gray scale)的图像,但也可以是用R、G、B分别为8比特的合计24比特表现各像素值的全彩色等的图像。在该情况下,在st103中作为图像数据的特征量求出像素值的平均值的情况下,求出每个颜色的像素值的平均值,在st104中选择标准化模式时,通过与各色的特征量对应地选择不同种类的标准化模式,能够起到同样的效果。In the image recognition device of thepresent embodiment 1, 8 bits of 0 to 255 are used to express the image of the gray scale (gray scale) of each pixel value of the input image data, but it is also possible to use R, G, B A total of 24 bits each of 8 bits expresses an image such as full color of each pixel value. In this case, when the average value of the pixel values is obtained as the feature value of the image data in st103, the average value of the pixel values for each color is obtained, and when the normalization mode is selected in st104, the average value of the pixel values is obtained by combining with the characteristics of each color The same effect can be achieved by selecting different types of normalization modes corresponding to the quantity.

另外,记录在标准化模式记录部件8中的查找表的标准化模式也可以是对于全部的特征量的每个都不同的标准化模式,根据特征量的范围,也可以是相同的标准化模式。这样,通过使用在某种程度的特征量的范围内相同的标准化模式,能够削减标准化模式记录部件8的存储器容量。In addition, the normalization pattern of the lookup table recorded in the normalizationpattern recording unit 8 may be a different normalization pattern for all the feature quantities, or may be the same normalization pattern depending on the range of the feature quantity. In this way, by using the same normalization pattern within a certain range of feature values, it is possible to reduce the memory capacity of the normalizationpattern recording unit 8 .

实施例2Example 2

图8是用于实施本发明的实施例2的图像识别装置的结构图,向与图1对应的部分附加相同的编号。在图8中,输入到输入部件1的图像数据被转送到扫描部件5和积分图像数据计算部件10。扫描模式设置部件6针对每个帧设置扫描模式记录部件9内的扫描模式,扫描部件5根据所设置的扫描模式,扫描识别区域。另外,积分图像数据计算部件10计算每个帧的图像数据的积分图像数据。特征量计算部件2使用该积分图像数据,计算识别区域内的图像数据的特征量。标准化模式选择部件3参照记录在标准化模式记录部件8中的查找表,选择与由特征量计算部件2计算出的特征量对应的标准化模式并展开到存储器(在图中未表示)中。标准化处理部件4使用该被展开在存储器内的标准化模式,对识别区域内的图像数据进行标准化。对照部件7对该标准化了的图像数据和识别对象的模型数据进行对照,输出对照结果。FIG. 8 is a configuration diagram of an image recognition apparatus according toEmbodiment 2 of the present invention, and the parts corresponding to those in FIG. 1 are assigned the same reference numerals. In FIG. 8 , the image data input to theinput section 1 is transferred to thescanning section 5 and the integrated imagedata calculation section 10 . The scanningpattern setting part 6 sets the scanning pattern in the scanningpattern recording part 9 for each frame, and thescanning part 5 scans the identification area according to the set scanning pattern. In addition, the integral imagedata calculation section 10 calculates integral image data of the image data of each frame. The featurequantity calculating means 2 calculates the feature quantity of the image data in the recognition area using this integrated image data. The normalizationpattern selection unit 3 refers to the lookup table recorded in the normalizationpattern recording unit 8, selects a normalization pattern corresponding to the feature quantity calculated by the featurequantity calculation unit 2, and expands it into a memory (not shown). Thenormalization processing unit 4 normalizes the image data in the recognition area using the normalization pattern developed in the memory. Thecollation unit 7 collates the normalized image data and the model data of the recognition target, and outputs the collation result.

接着,使用图9说明这样构成的图像识别装置的动作。图9是表示本实施例2的图像识别装置的动作的流程图。首先,向输入部件1输入每个帧的图像数据(st201)。输入到输入部件1的图像数据被转送到扫描部件5和积分图像数据计算部件10(st202)。扫描模式设置部件6与st106一样地(参照图2,下同),根据记录在扫描模式记录部件9中的扫描模式,设置扫描模式(st203)。另外,与st107一样地,扫描部件5根据所设置的扫描模式,扫描识别区域(st204)。Next, the operation of the image recognition device configured in this way will be described using FIG. 9 . FIG. 9 is a flowchart showing the operation of the image recognition device according to the second embodiment. First, image data of each frame is input to the input section 1 (st201). The image data input to theinput section 1 is transferred to thescanning section 5 and the integral image data calculation section 10 (st202). The scanmode setting unit 6 sets the scan mode based on the scan mode recorded in the scan mode recording unit 9 (st203) as in st106 (see FIG. 2 , the same applies hereinafter). In addition, similarly to st107, thescanning unit 5 scans the recognition area according to the set scanning mode (st204).

另一方面,积分图像数据计算部件10计算从输入部件1转送的图像数据的积分图像数据(st205)。在此,积分图像数据是针对图像数据在水平垂直方向上求出了像素值的累计的图像数据。在设原始的图像数据的像素值为I(x,y)时,用公式(4)表示积分图像数据I’(x,y)。On the other hand, the integral imagedata calculation section 10 calculates integral image data of the image data transferred from the input section 1 (st205). Here, the integrated image data is image data in which pixel values are accumulated horizontally and vertically for image data. When the pixel value of the original image data is I(x, y), the integrated image data I'(x, y) is represented by formula (4).

公式(4)Formula (4)

II′′((xx,,ythe y))==ΣΣxx′′≤≤xxΣΣythe y′′≤≤ythe yII((xx′′,,ythe y′′))

例如,在原始的图像数据是具有图10所示那样的像素值的图像数据的情况下,该图像数据的积分图像数据如图11所示。For example, when the original image data is image data having pixel values as shown in FIG. 10 , integrated image data of the image data is as shown in FIG. 11 .

接着,特征量计算部件2使用积分图像数据,计算出识别区域内的图像数据的特征量(st206)。在此,在设特征量为像素值的平均的情况下,需要计算出识别区域内的像素值的和。现在,如果将识别部件5识别的识别区域设置为图12所示那样附加了斜线的区域,设A、B、C、D的图像数据内的座标分别为(xA,yA)、(xB,yB)、(xC,yC)、(xD,yD),则可以根据公式(5)求出识别区域内的像素值的和S。Next, the featurequantity calculation unit 2 calculates the feature quantity of the image data in the recognition area using the integrated image data (st206). Here, when the feature amount is an average of pixel values, it is necessary to calculate the sum of the pixel values in the recognition area.Now , if the recognition area recognized bytherecognition part 5 is set as the area with oblique lines as shown in FIG. (xB , yB ), (xC , yC ), (xD , yD ), then the sum S of the pixel values in the recognition area can be calculated according to formula (5).

公式(5)Formula (5)

S=I′(xA,yA)+I′(xD,yD)-(I′(xB,yB)+I′(xC,yC))S=I'(xA , yA )+I'(xD , yD )-(I'(xB , yB )+I'(xC , yC ))

这样,通过使用积分图像数据,容易地求出任意的矩形区域的像素值的和,能够减少特征量计算部件2中的计算量。In this way, by using the integrated image data, the sum of the pixel values of an arbitrary rectangular area can be easily obtained, thereby reducing the amount of calculation in the featurevalue calculation unit 2 .

接着,与st104一样地,标准化模式选择部件3参照记录在标准化模式记录部件8中的查找表,选择与由特征量计算部件2计算出的特征量对应的标准化模式,并展开到存储器中(st207)。标准化处理部件4针对识别区域内的图像数据,通过与st105一样的操作,进行标准化处理(st208)。然后,对照部件7通过与st108、st109一样的步骤,进行与模型数据的对照(st209),输出对照结果(st210)。Next, as in st104, the normalizationpattern selection unit 3 refers to the lookup table recorded in the normalizationpattern recording unit 8, selects the normalization pattern corresponding to the feature quantity calculated by the featurequantity calculation unit 2, and expands it into the memory (st207 ). Thenormalization processing unit 4 performs normalization processing (st208) on the image data in the recognition area by the same operation as st105. Then, the collation means 7 performs collation with the model data through the same steps as st108 and st109 (st209), and outputs the collation result (st210).

在本实施例2的图像识别装置中,由于以识别区域为单位进行图像数据的标准化处理,所以对局部的照明变化的耐性变强,起到能够提高识别精度的效果。In the image recognition device according to the second embodiment, since the image data is normalized in units of recognition regions, the resistance to local illumination changes is increased, and the recognition accuracy can be improved.

另外,在本实施例中,说明了作为特征量使用了图像数据的像素值的平均值的情况,但作为特征量也可以是像素值的分布。在该情况下,也可以在st205中求出积分图像数据的同时,在将图像数据的各像素值进行了平方的基础上,求出在水平垂直方向上求出了像素值的累计值后的平方积分图像数据,使用积分图像数据和平方积分图像数据,计算出识别区域内的像素值的分布值v。可以根据公式(6)计算出分布值v。In addition, in this embodiment, the case where the average value of the pixel values of the image data is used as the feature quantity has been described, but the feature quantity may be a distribution of pixel values. In this case, while calculating the integrated image data in st205, the pixel values of the image data are squared, and the integrated value of the pixel values obtained in the horizontal and vertical directions may be obtained. In the square integral image data, the distribution value v of the pixel values in the recognition area is calculated using the integral image data and the square integral image data. The distribution value v can be calculated according to formula (6).

公式(6)Formula (6)

vv==11nnoSSSS--((11nnoSS))22

在公式(6)中,In formula (6),

SS:识别区域内的平方积分图像数据的和SS: sum of squared integral image data within the recognition area

S:识别区域内的积分图像数据的和S: sum of integral image data within the recognition area

n:识别区域内的像素的总数。n: the total number of pixels in the recognition area.

这样,即使在作为特征量使用像素值的分布值的情况下,也能够减轻特征量计算部件2中的计算量。In this way, even in the case where distribution values of pixel values are used as feature quantities, the amount of calculation in the featurequantity calculation section 2 can be reduced.

Claims (6)

Translated fromChinese
1.一种图像识别装置,其特征在于包括:1. An image recognition device, characterized in that it comprises:输入每个帧的图像数据的图像数据输入部件;an image data input part that inputs image data of each frame;计算出所输入的上述图像数据的特征量的特征量计算部件;a feature quantity calculating means for calculating a feature quantity of the inputted image data;根据该特征量,对上述图像数据进行标准化的标准化处理部件;A normalization processing unit for standardizing the above-mentioned image data according to the feature quantity;记录了多个用于在图像数据内对像素进行间隔抽取而对规定大小的识别区域进行扫描的扫描模式的扫描模式记录部件;a scan pattern recording unit that records a plurality of scan patterns for scanning a recognition area of a predetermined size by thinning out pixels within the image data;针对前后的帧设置不同的上述扫描模式的扫描模式设置部件;A scanning mode setting component for setting different above-mentioned scanning modes for the frames before and after;使用由上述扫描模式设置部件对该帧设置了的上述扫描模式,针对每个上述识别区域对每个帧的标准化了的图像数据进行扫描的扫描部件;A scanning unit that scans the normalized image data of each frame for each of the identification regions using the scanning mode set for the frame by the scanning mode setting unit;对由上述扫描部件扫描了的识别区域内的图像数据和识别对象的模型数据进行对照,输出该对照结果的对照部件。A collating unit that compares the image data in the recognition area scanned by the scanning unit with the model data of the recognition target, and outputs the result of the comparison.2.一种图像识别装置,其特征在于包括:2. An image recognition device, characterized in that it comprises:输入每个帧的图像数据的图像数据输入部件;an image data input part that inputs image data of each frame;预先记录了与图像数据的特征量对应的多种标准化模式的标准化模式记录部件;a normalization pattern recording means pre-recording a plurality of normalization patterns corresponding to the feature quantities of the image data;计算出所输入的上述图像数据的特征量的特征量计算部件;a feature quantity calculating means for calculating a feature quantity of the inputted image data;从上述标准化模式记录部件中选择与由上述特征量计算部件计算出的特征量对应的标准化模式的标准化模式选择部件;A normalization pattern selection part for selecting a normalization pattern corresponding to the feature quantity calculated by the above-mentioned feature quantity calculation part from the above-mentioned normalization pattern recording part;使用由上述标准化模式选择部件选择出的标准化模式,对上述图像数据进行标准化的标准化处理部件;a normalization processing unit for normalizing the image data using the normalization mode selected by the normalization mode selection unit;针对每个规定大小的识别区域,扫描标准化了的图像数据的扫描部件;For each identification area of specified size, scan the scanning part of the standardized image data;对由上述扫描部件扫描了的识别区域内的图像数据和识别对象的模型数据进行对照,输出该对照结果的对照部件。A collating unit that compares the image data in the recognition area scanned by the scanning unit with the model data of the recognition target, and outputs the result of the comparison.3.一种图像识别装置,其特征在于包括:3. An image recognition device, characterized in that it comprises:输入每个帧的图像数据的图像数据输入部件;an image data input part that inputs image data of each frame;预先记录了与图像数据的特征量对应的多种标准化模式的标准化模式记录部件;a normalization pattern recording means pre-recording a plurality of normalization patterns corresponding to the feature quantities of the image data;计算出所输入的上述图像数据的特征量的特征量计算部件;a feature quantity calculating means for calculating a feature quantity of the inputted image data;从上述标准化模式记录部件中选择与由上述特征量计算部件计算出的特征量对应的标准化模式的标准化模式选择部件;A normalization pattern selection part for selecting a normalization pattern corresponding to the feature quantity calculated by the above-mentioned feature quantity calculation part from the above-mentioned normalization pattern recording part;使用由上述标准化模式选择部件选择出的标准化模式,对上述图像数据进行标准化的标准化处理部件;a normalization processing unit for normalizing the image data using the normalization mode selected by the normalization mode selection unit;记录了多个扫描模式的扫描模式记录部件,其中该扫描模式是用于在图像数据内间隔抽取像素而扫描规定大小的识别区域的模式;a scanning pattern recording means that records a plurality of scanning patterns, wherein the scanning pattern is a pattern for scanning an identification area of a predetermined size by thinning out pixels in the image data;针对前后的帧设置不同的上述扫描模式的扫描模式设置部件;A scanning mode setting component for setting different above-mentioned scanning modes for the frames before and after;针对每个上述识别区域,使用由上述扫描模式设置部件对该帧设置了的扫描模式,扫描每个帧的标准化了的图像数据的扫描部件;A scanning unit that scans the standardized image data of each frame using the scanning mode set by the scanning mode setting unit for the frame for each of the above-mentioned identification regions;对由上述扫描部件扫描了的识别区域内的图像数据和识别对象的模型数据进行对照,输出该对照结果的对照部件。A collating unit that compares the image data in the recognition area scanned by the scanning unit with the model data of the recognition target, and outputs the result of the comparison.4.一种图像识别装置,其特征在于包括:4. An image recognition device, characterized in that it comprises:输入每个帧的图像数据的图像数据输入部件;an image data input part that inputs image data of each frame;记录了多个扫描模式的扫描模式记录部件,其中该扫描模式是用于在图像数据内间隔抽取像素而扫描规定大小的识别区域的模式;a scanning pattern recording means that records a plurality of scanning patterns, wherein the scanning pattern is a pattern for scanning an identification area of a predetermined size by thinning out pixels in the image data;针对前后的帧设置不同的上述扫描模式的扫描模式设置部件;A scanning mode setting component for setting different above-mentioned scanning modes for the frames before and after;针对每个上述识别区域,使用对该帧设置了的扫描模式,扫描每个帧的图像数据的扫描部件;A scanning component that scans the image data of each frame by using the scanning mode set for the frame for each of the above identified regions;计算出上述识别区域的图像数据的特征量的特征量计算部件;A feature quantity calculating means for calculating the feature quantity of the image data of the above-mentioned identification area;预先记录了与图像数据的特征量对应的多种标准化模式的标准化模式记录部件;a normalization pattern recording means pre-recording a plurality of normalization patterns corresponding to the feature quantities of the image data;从上述标准化模式记录部件中选择与由上述特征量计算部件计算出的特征量对应的标准化模式的标准化模式选择部件;A normalization pattern selection part for selecting a normalization pattern corresponding to the feature quantity calculated by the above-mentioned feature quantity calculation part from the above-mentioned normalization pattern recording part;使用由上述标准化模式选择部件选择出的标准化模式,对上述识别区域的图像数据进行标准化的标准化处理部件;a normalization processing part for normalizing the image data of the recognition area using the normalization mode selected by the normalization mode selection part;在由上述扫描部件进行了扫描后,对由上述标准化处理部件进行了标准化的识别区域内的图像数据和识别对象的模型数据进行对照,输出对照结果的对照部件。A collating unit that compares the image data in the recognition area normalized by the normalization processing unit with the model data of the recognition object after scanning by the scanning unit, and outputs a comparison result.5.根据权利要求4所述的图像识别装置,其特征在于还包括:5. The image recognition device according to claim 4, further comprising:计算出对输入到上述图像数据输入部件的图像数据的像素值进行了累计的积分图像数据的积分图像数据计算部件,其中integral image data calculation means for calculating integral image data in which pixel values of image data input to the above-mentioned image data input means are accumulated, wherein上述特征量计算部件使用上述积分图像数据,计算出上述识别区域的图像数据的特征量。The feature amount calculation means calculates feature amounts of the image data of the recognition region using the integral image data.6.根据权利要求1~5的任意一个所述的图像识别装置,其特征在于:6. The image recognition device according to any one of claims 1-5, characterized in that:上述特征量是像素值的平均或分布。The aforementioned feature quantity is an average or distribution of pixel values.
CNB2006101415553A2005-10-052006-09-28 Image recognition deviceExpired - Fee RelatedCN100419781C (en)

Applications Claiming Priority (5)

Application NumberPriority DateFiling DateTitle
JP20052925252005-10-05
JP20052925252005-10-05
JP2005-2925252005-10-05
JP20060240862006-02-01
JP2006-0240862006-02-01

Publications (2)

Publication NumberPublication Date
CN1945598Atrue CN1945598A (en)2007-04-11
CN100419781C CN100419781C (en)2008-09-17

Family

ID=38044997

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CNB2006101415553AExpired - Fee RelatedCN100419781C (en)2005-10-052006-09-28 Image recognition device

Country Status (1)

CountryLink
CN (1)CN100419781C (en)

Cited By (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101571698B (en)*2008-05-022011-12-07夏普株式会社Method for matching images, image matching device, image data output apparatus, and recording medium
CN114219839A (en)*2021-12-242022-03-22欧波同科技产业有限公司Frame image position calculation method based on matrix filling algorithm

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
JP4091174B2 (en)*1998-08-112008-05-28株式会社東芝 Image processing apparatus and image processing method
JP3529640B2 (en)*1998-09-172004-05-24松下電器産業株式会社 Image data recognition method and apparatus
JP2001006599A (en)*1999-06-222001-01-12Jeol Ltd Electron beam control method in electron beam device
JP3712234B2 (en)*2002-03-192005-11-02株式会社日立製作所 Region of interest extraction method and image processing server
JP4521747B2 (en)*2002-12-132010-08-11シャープ株式会社 Object information processing apparatus, image processing system, game apparatus, and image processing method
JP4603807B2 (en)*2004-03-102010-12-22富士通株式会社 Character recognition device, character recognition method, medium processing method, character recognition program, and computer-readable recording medium on which character recognition program is recorded

Cited By (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101571698B (en)*2008-05-022011-12-07夏普株式会社Method for matching images, image matching device, image data output apparatus, and recording medium
CN114219839A (en)*2021-12-242022-03-22欧波同科技产业有限公司Frame image position calculation method based on matrix filling algorithm

Also Published As

Publication numberPublication date
CN100419781C (en)2008-09-17

Similar Documents

PublicationPublication DateTitle
JP4670664B2 (en) Image recognition device
JP4771906B2 (en) Method for classifying images with respect to JPEG compression history
US6985631B2 (en)Systems and methods for automatically detecting a corner in a digitally captured image
KR101795823B1 (en)Text enhancement of a textual image undergoing optical character recognition
Khanna et al.Forensic techniques for classifying scanner, computer generated and digital camera images
JP6139396B2 (en) Method and program for compressing binary image representing document
KR100874561B1 (en) Image Processing Apparatus and Method
CN101228550A (en)Binarization of images
Khanna et al.Scanner identification using feature-based processing and analysis
EP2605186B1 (en)Method and apparatus for recognizing a character based on a photographed image
CN103177249A (en)Image processing apparatus and image processing method
CN101042735A (en)Image binarization method and device
JP2011128990A (en)Image processor and image processing method
CN102737240B (en)Method of analyzing digital document images
CN102750686B (en)Super-resolution file image restoration processing method based on learning
JP2005521116A (en) Device for detecting edges in image blocks
US9332154B2 (en)Image binarization using dynamic sub-image division
CN100433047C (en)Device and method for detecting blurring of image
Lins et al.Doceng'2020 time-quality competition on binarizing photographed documents
US9167129B1 (en)Method and apparatus for segmenting image into halftone and non-halftone regions
CN1237485C (en)Method for covering face of news interviewee using quick face detection
CN1945598A (en)Image recognition device
CN1797428A (en)Method and device for self-adaptive binary state of text, and storage medium
CN1920853A (en) Systems and methods for content identification
CN1941838A (en)File and picture binary coding method

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
C14Grant of patent or utility model
GR01Patent grant
C17Cessation of patent right
CF01Termination of patent right due to non-payment of annual fee

Granted publication date:20080917

Termination date:20120928


[8]ページ先頭

©2009-2025 Movatter.jp