Movatterモバイル変換


[0]ホーム

URL:


CN112906699A - Method for detecting and identifying enlarged number of license plate - Google Patents

Method for detecting and identifying enlarged number of license plate
Download PDF

Info

Publication number
CN112906699A
CN112906699ACN202011532814.1ACN202011532814ACN112906699ACN 112906699 ACN112906699 ACN 112906699ACN 202011532814 ACN202011532814 ACN 202011532814ACN 112906699 ACN112906699 ACN 112906699A
Authority
CN
China
Prior art keywords
license plate
image
layer
network
sample
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Granted
Application number
CN202011532814.1A
Other languages
Chinese (zh)
Other versions
CN112906699B (en
Inventor
刘毛溪
梁添才
赵清利
徐天适
潘新生
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Shenzhen Radio & Tv Xinyi Technology Co ltd
Original Assignee
Shenzhen Xinyi Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Shenzhen Xinyi Technology Co LtdfiledCriticalShenzhen Xinyi Technology Co Ltd
Priority to CN202011532814.1ApriorityCriticalpatent/CN112906699B/en
Publication of CN112906699ApublicationCriticalpatent/CN112906699A/en
Application grantedgrantedCritical
Publication of CN112906699BpublicationCriticalpatent/CN112906699B/en
Activelegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Images

Classifications

Landscapes

Abstract

Translated fromChinese

本发明属于智能交通领域,为车牌放大号的检测识别方法,包括:对车牌放大号所在区域进行检测定位,得到原始车牌放大号的样本图像;基于深度卷积的识别网络对车牌放大号字符进行识别,在训练阶段对原始车牌放大号的样本图像进行扩展获得训练样本集,然后构建识别网络并对实际的车牌放大号图像进行特征提取,得到最终的文本识别结果。本发明对识别网络在训练过程中文本识别部分和超分辨图像重构部分产生的损失进行加权计算,提升识别网络对低质量图像的特征表达能力,得到最优化的权重参数,降低训练样本集创建难度的同时,有效提高了车牌字符的识别效率,解决了车牌放大号的字符检测受字符大小、样式及间距不一致等影响而效果不佳的问题。

Figure 202011532814

The invention belongs to the field of intelligent transportation, and relates to a method for detecting and identifying an enlarged license plate number. In the training phase, the sample image of the original license plate magnification is expanded to obtain a training sample set, and then a recognition network is constructed and the actual license plate magnification image is extracted to obtain the final text recognition result. The invention performs weighted calculation on the loss generated by the text recognition part and the super-resolution image reconstruction part in the training process of the recognition network, improves the feature expression ability of the recognition network for low-quality images, obtains optimized weight parameters, and reduces the creation of training sample sets. At the same time, it effectively improves the recognition efficiency of license plate characters, and solves the problem that the character detection of the enlarged license plate is affected by the inconsistency of character size, style and spacing, and the effect is not good.

Figure 202011532814

Description

Method for detecting and identifying enlarged number of license plate
Technical Field
The invention belongs to the field of intelligent transportation, and particularly relates to a method for detecting and identifying an enlarged license plate number.
Background
The license plate recognition technology is a key and core module of an intelligent traffic system, and the detection and recognition technology of the enlarged license plate number can technically promote the existing license plate recognition technology in the following three aspects:
1. the number plate related to the existing number plate recognition technology is mainly a standard motor vehicle number plate, but relates to little or little number of the number plate amplified numbers (namely the amplified number plate numbers), and the thirteenth regulation of the national road traffic safety law enforcement regulations: the heavy and medium-sized truck and the trailer thereof, the tractor and the trailer thereof, the body or the rear part of the carriage of the tractor and the trailer thereof should be sprayed with enlarged marks, and characters should be aligned and kept clear. Therefore, the method for detecting and identifying the enlarged license plate number can supplement the existing license plate identification technology and improve the functional integrity of the license plate identification technology.
2. Due to the influences of the working environment, the driving road condition and the suspension position of the number plate of the vehicle, the situations of the number plate missing, blocking, fouling, blurring, overexposure and the like may exist in the images of the passenger and freight vehicles captured by the camera, and the detection and identification effects of the number plate identification technology on the standard number plate in the images of the passenger and freight vehicles captured by the tail are further influenced. Compared with a standard license plate, the license plate enlarged number is larger in size and more obvious in position, and the license plate enlarged number can be clearly captured even if the standard license plate is fuzzy.
3. The existing license plate recognition technology is directly applied to recognition of the enlarged license plate number, and the effect is poor. The main reasons are that: the spraying position of the license plate enlarged number is generally the rear part of a passenger and freight vehicle carriage, the background of the license plate enlarged number is not uniform, and more interference exists; the font size, font style and character spacing of the license plate enlarged number spray coating are inconsistent.
Disclosure of Invention
In order to solve the technical problems in the prior art, the invention provides a detection and recognition method for a license plate enlarged number, which effectively improves the recognition efficiency of license plate characters while reducing the difficulty in creating a training sample set by improving the sample labeling efficiency, performs weighted calculation and analysis on the losses of a text recognition part and a super-resolution image reconstruction part in the training process, and improves the feature extraction effect of a network model on low-quality images, thereby effectively solving the technical problem that the detection of the characters of the license plate enlarged number is not good in effect due to the influences of inconsistent character sizes, patterns and intervals and the like.
The invention is realized by adopting the following technical scheme: a method for detecting and identifying an enlarged number of a license plate comprises the following steps:
s1, detecting and positioning the area where the license plate enlarged number is located to obtain a sample image of the original license plate enlarged number;
s2, recognizing the license plate enlarged number characters based on a deep convolution recognition network;
step S2 includes:
s21, in the training stage of the recognition network model, expanding the sample image of the original license plate enlarged number to obtain a training sample set;
s22, constructing a recognition network based on the expanded training sample set, and performing feature extraction on the actual license plate enlarged number image by using the constructed recognition network to obtain a final text recognition result; and performing weighted calculation analysis on losses generated by the text recognition part and the super-resolution image reconstruction part of the recognition network in the training process, so as to improve the feature expression capability of the recognition network on low-quality images and obtain optimized weight parameters of the recognition network.
Compared with the prior art, the invention has the following advantages and beneficial effects:
1. the license plate is the most distinctive feature of a vehicle, compared with a standard motor vehicle license plate, the license plate enlarged number has no fixed background, the length-width ratio is greatly changed, the fonts and the sizes of characters are diversified, and the character intervals are inconsistent, so that the existing license plate recognition technology is difficult to be directly applied to recognition of the license plate enlarged number.
Aiming at the characteristics of the license plate enlarged number, the invention designs a license plate enlarged number detection and recognition algorithm based on a deep convolutional network. According to the algorithm, in the network model training process, through a mode of increasing sample image diversity, weighted calculation analysis is carried out on losses generated by a text recognition part and a super-resolution image reconstruction part in the training process, the characteristic extraction effect of the network model on low-quality images is improved, the characteristics of the license plate enlarged number can be well learned, and the algorithm has the characteristics of strong robustness and high applicability.
2. The existing mainstream license plate recognition technology mainly comprises the following processes: the three processes of license plate detection and positioning, license plate character segmentation and positioning and license plate character classification and recognition are carried out, three models of license plate detection, character segmentation and character recognition need to be constructed respectively, time and resources are consumed, and when a training sample set is established in a model training stage, single-character position labeling needs to be carried out on a complete license plate sample, so that too much labor is consumed.
In the license plate amplified number recognition process, the invention simplifies the flow of the license plate recognition technology, provides an end-to-end license plate amplified number recognition algorithm to replace the existing license plate character segmentation positioning algorithm and the existing character classification recognition algorithm, and improves the recognition efficiency; the diversity of sample images is enriched through data preprocessing, data enhancement and other modes, and the robustness of the algorithm is improved; by reducing the labeling process and improving the labeling efficiency, the difficulty in creating the training sample set is reduced.
Drawings
FIG. 1 is a schematic diagram of a license plate information detection and identification process in an embodiment of the present invention;
FIG. 2 is an illustration of an enlarged license plate position in a captured image of a vehicle according to an embodiment of the present invention;
FIG. 3 is a diagram of a license plate number amplification detection network structure in an embodiment of the present invention;
FIG. 4 is a schematic diagram illustrating a training sample image expansion process according to an embodiment of the present invention;
FIG. 5 is an exemplary diagram of a positive sample image of a license plate with an enlarged number, wherein three sub-images (a), (b), and (c) respectively illustrate one form of the positive sample image;
FIG. 6 is a schematic structural diagram of a license plate amplification number recognition network in a training phase in the embodiment of the invention;
FIG. 7 is a schematic diagram of a CNN layer structure of a license plate number amplification identification network in an embodiment of the present invention;
FIG. 8 is a schematic diagram of an RNN layer structure of a license plate number-enlarging recognition network according to an embodiment of the present invention;
FIG. 9 is a schematic diagram of an SR layer structure of a license plate amplified number recognition network in the embodiment of the present invention;
FIG. 10 is a schematic diagram of the RG module structure of the SR layer;
FIG. 11 is a schematic diagram of the RCAB sub-module in the RG module of the SR layer;
FIG. 12 is a schematic structural diagram of a license plate amplification number recognition network in an inference stage in the embodiment of the invention.
Detailed Description
The present invention will be described in further detail with reference to examples and drawings, but the present invention is not limited thereto.
Examples
The embodiment provides a method for detecting and identifying an enlarged license plate number based on a deep neural network, which mainly comprises the steps of detecting and identifying the enlarged license plate number as shown in figure 1, wherein the position label information of the enlarged license plate number is shown in figure 2; the detailed steps are as follows:
and S1, detecting and positioning the area where the license plate enlarged number is located, and obtaining a sample image of the original license plate enlarged number.
In the step, a detection network (such as a convolutional neural network) based on deep convolution is used for detecting and positioning the enlarged license plate number, and a lightweight network architecture MobileNet-SSD is taken as an example for detailed description:
s11, firstly, according to the label sample data distribution in the training sample, calculating the generation parameters of each layer of default box by using a k-means clustering algorithm (k-means clustering, k-means) of a convolutional neural network (YOLOv 3). Because the license plate amplification number sample image generally has a large width-to-height ratio, the size of the input image of the detection network is set to be w x h, (1.5 w < h < 2 w) so as to eliminate the influence on the detection effect.
And S12, using various data enhancement methods in the training process to increase the diversity of the sample images and improve the detection performance of the detection network, including horizontal turning, cutting, zooming in and zooming out and the like.
And S13, extracting the features of the sample image by using a backbone convolutional network (MobileNet), and constructing a feature pyramid network with 6 layers for position regression and class classification.
And S14, processing the output of the multilayer characteristic pyramid network through the non-maximum suppression unit to obtain a final detection positioning result of the area where the license plate amplification number is located.
The structure of the detection network is shown in fig. 3, and includes a backbone convolutional network MobileNet, a Non-Maximum Suppression unit (NMS), and a multilayer feature pyramid network, where the backbone convolutional network MobileNet is connected to an input end of the multilayer feature pyramid network, an output end of each layer of feature pyramid network is connected to the Non-Maximum Suppression unit, and the Non-Maximum Suppression unit outputs a final detection positioning result.
Step S2, recognizing characters of the enlarged number of the license plate
The method comprises the following steps of identifying license plate enlarged number characters by a deep convolution-based identification network (such as a convolution neural network (CRNN)), specifically comprising the following steps:
and S21, expanding the training sample image to obtain a training sample set.
In the training stage of identifying the network model, the CRNN convolutional neural network uses an end-to-end (end-to-end) training mode, and needs a large number of input sample images to perform network optimization training, the invention firstly labels the sample images of the original license plate enlarged number, and then expands the labeled sample images of the original license plate enlarged number, and the expansion process is shown in fig. 4 and mainly comprises the following steps:
s211, cropping the sample image to generate area images with different sizes, as shown in (a) - (c) of fig. 5, the area images obtained after cropping specifically include the following categories:
the original license plate enlarged sample (7-8 characters): such a sample image is an enlarged-size area image of the original license plate, as shown in fig. 5 (a);
② defective license plate enlarged sample (5-7 characters): the sample image is a region image obtained by cutting after discarding the original license plate province region for short, as shown in the diagram (b) of fig. 5;
③ sample after boundary expansion: the sample image is an area image obtained by carrying out random boundary expansion on the two types of license plate amplified number area images. The extended formula is specifically as follows:
Figure BDA0002852496220000041
wherein l, r, u and b are the expansion sizes of the license plate magnified region image at the left, right, upper and lower boundaries respectively, w and h are the width and height of the original license plate magnified region image, and random is a random function.
Fourthly, loading samples: the samples are false detection samples of a detection network, namely non-license plate amplified number areas.
S212, image normalization processing and color transformation: before model training, the convolutional neural network CRNN needs to normalize the four types of region images obtained after the sample is cut in step S211, normalize the size to W × 32, where W is the normalized image width, and then perform color transformation; the method mainly comprises the following steps:
keeping the height h unchanged, and stretching the random width of the image to improve the recognition capability of a convolutional neural network (CRNN) on narrower characters; the formula for the random width stretch transform is:
w*=w*(random(0.4*w,0.8*w)+1)
wherein, w*And w is the original image width, and random is a random function.
② judging the aspect ratio w of the image after width stretching*Whether/h is equal to the normalized size, i.e., W/32:
1) if w is*W/32, scaling the image to W32;
2) if w is*W/32 is smaller than h, the image is scaled to W***32,w**=w*(32/h), then expanding the left and right image boundaries, the formula is as follows:
Figure BDA0002852496220000051
wherein l and r are the expansion sizes of the left and right boundaries respectively, and random is a random function. In this embodiment, the convolutional neural network CRNN has no requirement on the width of the image, and therefore, in the size normalization process, the normalized size is W × 32, but the maximum width value is set to 280 in the width stretching transformation and left and right boundary expansion processes.
3) If w is*W/32, scaling the image to W x h**,h**=h*(W/w*) Then, the upper and lower boundaries of the image are expanded, and the formula is as follows:
Figure BDA0002852496220000052
wherein u and b are the extension sizes of the upper and lower boundaries respectively, and random is a random function.
And thirdly, random color space transformation is carried out, the diversity of the samples is further increased, and the sample images which are finally input into the identification network are generated.
S213, generating a sample label: and storing each license plate character of the license plate number in an array, and then generating a sample label of the license plate number according to the index value of the license plate character corresponding to the array.
The convolutional neural network CRNN needs to set a space (blank) tag, which is generally set as the first bit ("0") or the last bit ("n-1") of the tag list, where n is the length of the tag list, i.e., the number of character classes), the length of the sample tag is 8, and less than 8 bits are complemented with "0" after the tag value.
For example, if the label value of blank is set to be "0", the label value of the positive sample image with the license plate number of "87569" is "986710000", that is, the label value of the license plate character is obtained by adding 1 to the corresponding index value of the license plate character in the label list, and the label value of the license plate character is obtained by adding 1 to the corresponding index value of the character in the label list no matter whether the license plate character is a number, a letter or a Chinese character; for a negative sample image, its label value is "00000000".
S22, constructing a recognition network based on the expanded sample image, and performing feature extraction on the actual license plate enlarged number image by using the constructed recognition network.
In this embodiment, a feature extraction network is constructed as an identification network, and specifically, the feature extraction network is a deep convolutional network including a convolutional layer (CNN), a feature super-resolution branch network (SR layer), a cyclic layer (RNN), a transcription layer (CTC), and a loss function layer, where the convolutional layer is connected to the SR layer and the cyclic layer, the transcription layer is connected to the cyclic layer, the transcription layer and the feature super-resolution branch network are connected to the loss function layer, the size of an input image is W × 32, W is an image width, and 32 is an image height.
In the invention, the SR layer and the RNN layer share the characteristic sequence of the image and do not need an additional characteristic extraction network, so the number of network layers of the SR layer is less, the SR layer has a simpler structure than the existing super-resolution network, the occupied video memory of a video card is less in the training process, and the training time is shorter.
S221, a feature sequence is extracted from the input image by the convolutional layer (CNN).
Taking a dense convolutional network (DenseNet) as an example, when constructing a feature extraction network, connecting CNN layers in series by using 3 DenseNet blocks, wherein the depth of each DenseNet block is d, the feature map growth rate is r, connecting convolution layers with the kernel size of k × k and random inactivation layers (dropouts) between every two DenseNet blocks, setting the proportion of the random inactivation layers dropouts as ratio, finally connecting a pooling layer with the kernel size of m × N, and outputting a feature map with dimensions of N × C × H × W, wherein N, C, H and W are the batch processing size, the feature map channel number, the feature map height and the feature map width respectively.
S222, in the training stage, the feature expression capability of the CNN layer is improved through a feature super-resolution branch network (SR layer), and a super-resolution image is reconstructed and output.
The purpose of the feature super-resolution branch network is to obtain high-resolution image features using low-resolution images. Due to the influence of hardware conditions, working environments and driving road conditions, the camera can often acquire a large amount of low-quality license plate amplified number images, and the recognition result is influenced. Therefore, in the training process, the characteristic super-resolution branch network is added to improve the characteristic expression capability of the CNN layer, namely, the characteristic sequence obtained by the CNN layer is input into the SR layer to reconstruct the super-resolution image, so that the low-resolution characteristics are restored into the corresponding super-resolution image.
Because the license plate amplification number identification data set does not distinguish high-resolution images and low-resolution images, in the training process, the invention uses two image expansion modes of Gaussian blur processing and 4-time up-down sampling to perform online expansion preprocessing on the original image to generate the low-resolution images so as to enrich the diversity of sample images in the training data set; and after the generated low-resolution image is subjected to feature sequence extraction by the convolutional layer, the low-resolution image is input to a SR layer of a feature super-resolution branch network and is reconstructed into a super-resolution image. In this embodiment, the image after the "gaussian blur processing" and the "4-fold up-down sampling" processing is represented as:
Figure BDA0002852496220000061
wherein, IblurFor processed low resolution images, fd-uAnd fgauRespectively representing 4 times of up-down sampling and Gaussian blur processing, O is an original image, p1And p2Are two random parameters and alpha is a threshold.
The SR layer is mainly implemented by 2 super-resolution base units based on a residual network structure (Resnet) and an upsampling unit (UpSample), where the super-resolution base unit is a residual channel attention block RG, the RCAB is a sub-module of the residual channel attention block RG, and two RCAB sub-blocks constitute a residual attention module RG.
SR layer uses characteristic sequence F output by CNN layerCNNPerforming super-resolution reconstruction, and outputting deeper features through two RG layers, namely:
FRG=HRG(HRG(FCNN))
wherein, FRGFeatures processed by two layers of RG modules, HRGCorresponding operation for the RG module; then using the upsampling layer UpSample, convolution operation pair FRGAnd processing the characteristics to obtain a super-resolution reconstructed image O with the same size as the input image.
Figure BDA0002852496220000071
Wherein, FUPFor the feature processed by the UpSample module at the up-sampling layer, HUPFor the corresponding operation of UpSample Module, HConvCorresponding operations for the convolution module. And finally, the original high-resolution image in the training sample set is used as a real sample label, the loss of the reconstructed super-resolution image is calculated by using the super-resolution loss function of S225, and the reconstruction effect of the super-resolution image is judged and evaluated according to the loss value.
S223, the tag value distribution, i.e., the true value distribution of the feature sequence obtained from the convolutional layer (CNN) is predicted by the cycle layer (RNN).
The circulation layer RNN comprises two bidirectional long and short term memory networks (BilSTM), the features extracted by the convolution layer CNN are converted by the circulation layer to obtain T x N x M dimensional features, the T is the time sequence length of the circulation layer RNN, N is the batch processing size, M is the input feature length, then T x N x N dimensional label distribution results are obtained by the full connection layer, and N is the length of a label list (character category number); the loop layer RNN may be expressed as y ═ Rw(x) Where x is the input, w is the RNN layer parameter, and y is the output.
S224, the tag value distribution obtained from the cycle layer RNN is converted into the final recognition result by the transcription layer (CTC) through operations such as de-duplication integration.
A blank mechanism is introduced into CTC in a transcription layer, and the purpose is to obtain a final predicted text sequence through operations such as de-duplication integration. Taking the "-" symbol representing blank as an example, the CTC in the transcription layer considers that "continuous repeated characters without blank intervals" are the same character, deletes "continuous repeated characters without blank intervals" for the character sequence, and then deletes all "-" characters from the path to obtain the final predicted text sequence.
For the input x given by the cycle layer RNN, the probability that the transcription layer outputs the correct license plate is as follows:
Figure BDA0002852496220000072
wherein, pi ∈ B-1(l) Representing all paths of which the result is correct license plate L after B conversion (namely after cycle layer RNN processing), and L is a prediction output sequence (namely a predicted license plate number); for any path π is:
Figure BDA0002852496220000081
where L' is all paths. In the training process, the training target of the CTC of the transcription layer is essentially through the gradient
Figure BDA0002852496220000082
Adjusting the parameter w of the loop layer RNN such that for an input sample, pi ∈ B-1(l) The probability p (l | x) of the correct license plate is the greatest.
And S225, calculating and identifying the total loss of the network through a loss function.
In the training process, the loss function simultaneously contains the loss of the text recognition part and the loss of the super-resolution branch network part, so that the feature sequence extracted by the CNN layer simultaneously contains the information of the recognition part and the super-resolution branch network part, the feature expression capability of the recognition network on low-quality images is improved, and the feature extraction effect of the recognition network on the low-quality images is improved.
That is, in the present invention, the total loss of recognition network is the text recognition loss L generated by the transcription layer CTCrecAnd super-resolution image loss L generated by super-resolution branch networksrSumming and using a hyper-parameter lambda to the super-resolution image loss LsrThe weights of (a) are adjusted, i.e. weighted summation; the loss function can be described as:
Figure BDA0002852496220000083
wherein O is the original image, Oi,jIs the pixel value of the original image at the (I, j) position, Ii,jThe pixel value of a super-resolution image output by a SR layer of the characteristic super-resolution branch network at the (i, j) position is represented by x, S is a training sample set and z is a sample real label. And reducing the total loss of the recognition network through training to obtain the optimized weight parameter of the recognition network.
The five stages of steps S221 to S225 form a training stage of the recognition network, and refer to fig. 6 to 11 in detail.
S226, reasoning output stage
Using the trained recognition network model to perform inference output, specifically referring to fig. 12, the main process includes: directly inputting an actual license plate amplification number image into a CNN layer without image preprocessing such as Gaussian blur processing or up-down sampling to obtain a corresponding characteristic sequence; directly inputting the characteristic sequence output by the CNN layer into the RNN layer to obtain the probability distribution of all character types of each time step; inputting the character type probability distribution output by the RNN layer into a CTC layer, taking the characters with the maximum probability distribution in all the character types of each time step as the output characters of the time step by the CTC layer, splicing the output characters with all the time steps to obtain a sequence path as the maximum probability path, and finally obtaining the final text recognition result by using a blank mechanism in the CTC layer.
That is, in the training stage, the SR layer continuously updates the network weights through iterative training to minimize the loss function, so as to obtain the optimized weight parameters. In the inference stage, the input of the CNN layer is an actually acquired license plate amplification number image, and image preprocessing such as Gaussian blur processing or up-down sampling is not performed; and the recognition network does not use the SR layer any more, and directly uses the trained weight parameters, and because the output result of the SR layer is a super-resolution image and is useless for reasoning of the recognition network, the SR layer is discarded in the reasoning stage, and the character recognition result cannot be influenced.
The above embodiments are preferred embodiments of the present invention, but the present invention is not limited to the above embodiments, and any other changes, modifications, substitutions, combinations, and simplifications which do not depart from the spirit and principle of the present invention should be construed as equivalents thereof, and all such changes, modifications, substitutions, combinations, and simplifications are intended to be included in the scope of the present invention.

Claims (10)

1. A method for detecting and identifying an enlarged number of a license plate is characterized by comprising the following steps:
s1, detecting and positioning the area where the license plate enlarged number is located to obtain a sample image of the original license plate enlarged number;
s2, recognizing the license plate enlarged number characters based on a deep convolution recognition network;
step S2 includes:
s21, in the training stage of the recognition network model, expanding the sample image of the original license plate enlarged number to obtain a training sample set;
s22, constructing a recognition network based on the expanded training sample set, and performing feature extraction on the actual license plate enlarged number image by using the constructed recognition network to obtain a final text recognition result; and performing weighted calculation analysis on losses generated by the text recognition part and the super-resolution image reconstruction part of the recognition network in the training process, so as to improve the feature expression capability of the recognition network on low-quality images and obtain optimized weight parameters of the recognition network.
2. The method for detecting and identifying the enlarged license plate number of claim 1, wherein the step S21 includes:
s211, cutting the sample image to generate area images with different sizes;
s212, before training of the recognition network model, normalizing the multi-class area image obtained after the sample is cut in the step S211, and then performing color transformation; normalizing the size to W32, W being the normalized image width;
s213, storing each license plate character of the license plate number in an array, and then generating a sample label of the license plate number according to the corresponding index value of the license plate character in the stored array.
3. The method for detecting and identifying the enlarged license plate number according to claim 2, wherein the area image obtained after clipping in step S211 includes the following categories:
the original license plate amplified number sample is an area image of the original license plate amplified number;
the defective license plate enlarged number sample is an area image obtained by cutting after discarding the original license plate province, namely the area;
the sample after the boundary expansion is a regional image obtained by carrying out random boundary expansion on the original license plate amplified number sample and the defective license plate amplified number sample;
and the negative sample is a false detection sample of the detection network, namely a non-license plate amplified number area.
4. The method for detecting and identifying the enlarged license plate number of claim 2, wherein step S212 comprises:
(1) keeping the height h unchanged, and stretching the image in random width;
(2) determining the aspect ratio w of the width-stretched image*Whether/h is equal to W/32; if yes, scaling the image to W32; if w is*W/32 is smaller than h, the image is scaled to W***32,w**=w*(32/h), then expand the left and right image borders:
Figure FDA0002852496210000011
wherein l and r are the expansion sizes of the left and right boundaries respectively, and random is a random function; if w is*W/32, scaling the image to W x h**,h**=h*(W/w*) Then is aligned withAnd (3) expanding the upper and lower boundaries of the image:
Figure FDA0002852496210000021
wherein u and b are respectively the expansion sizes of the upper and lower boundaries, and random is a random function;
(3) and carrying out random color space transformation to generate a sample image which is finally input into the identification network.
5. The method for detecting and identifying an enlarged license plate number of claim 1, wherein the identification network constructed in step S22 is a deep convolutional network comprising a convolutional layer, a characteristic super-resolution tributary network, a cyclic layer, a transcription layer, and a loss function layer, wherein the convolutional layer is connected to the characteristic super-resolution tributary network and the cyclic layer, the transcription layer is connected to the cyclic layer, and the transcription layer and the characteristic super-resolution tributary network are connected to the loss function layer.
6. The method for detecting and identifying the enlarged license plate number of claim 5, wherein the step S22 includes:
s221, extracting a characteristic sequence from the input image through the convolution layer;
s222, in the training stage, the original image is expanded to generate a low-resolution image, and the generated low-resolution image is input into a characteristic super-resolution branch network to be reconstructed into a super-resolution image after a characteristic sequence is extracted from a convolutional layer; meanwhile, a characteristic sequence extracted from the convolutional layer of the original high-resolution image in the training sample set is used as a real sample label, and the loss of the reconstructed super-resolution image is calculated;
s223, predicting the label value distribution of the characteristic sequence acquired from the convolutional layer through the loop layer;
s224, converting the label value distribution obtained from the circulation layer into a final recognition result through a duplication elimination integration operation by the transcription layer;
s225, calculating and identifying the total loss of the network through a loss function, and transcribing the layersResulting text recognition loss LrecAnd super-resolution image loss L generated by super-resolution branch networksrWeighted summation as the total loss of the identified network, where the super-resolution image loses LsrThe weight of (a) is adjusted by a hyperparameter λ; the total loss of the recognition network is reduced through training, and the optimized weight parameter of the recognition network is obtained;
s226, reasoning and outputting by using the recognition network model obtained after training, and directly inputting the actual license plate enlarged number image into the convolutional layer to obtain a corresponding characteristic sequence; inputting the characteristic sequence into a circulation layer to obtain the probability distribution of all character types of each time step; inputting the character type probability distribution output by the circulation layer into the transcription layer, taking the character with the maximum probability distribution in all the character types of each time step as the output character of the time step by the transcription layer, splicing the output characters with all the time steps to obtain a sequence path as the maximum probability path, and finally obtaining the final text recognition result by using a blank mechanism in the transcription layer.
7. The method for detecting and identifying enlarged license plate number of claim 5, wherein step S222 is performed by expanding the original image through Gaussian blur processing and multiple up-down sampling to generate a low-resolution image, and setting IblurFor processed low resolution images, fd-uAnd fgauRespectively representing multiple up-down sampling and Gaussian blur processing, wherein O is an original image, and the image after multiple up-down sampling and Gaussian blur processing is represented as follows:
Figure FDA0002852496210000031
wherein p is1And p2Are two random parameters and alpha is a threshold.
8. The method for detecting and identifying the enlarged license plate number of claim 7, wherein the loss function of step S225 is described as:
Figure FDA0002852496210000032
wherein O is the original image, Oi,jIs the pixel value of the original image at the (I, j) position, Ii,jAnd (3) outputting the pixel value of the super-resolution image at the (i, j) position by the characteristic super-resolution branch network, wherein x is input, S is a training sample set, and z is a sample real label.
9. The method for detecting and identifying an enlarged license plate according to claim 6, wherein the loop layer in step S223 includes two bidirectional long-short term memory networks, the features extracted from the convolution layer are transformed by the loop layer to obtain T × N × M dimensional features, and the T × N × M dimensional features are input into the loop layer, where T is the length of the time sequence of the loop layer, N is the size of batch processing, M is the length of the input features, and then T × N dimensional label distribution results are obtained through the fully connected layer, and N is the length of the label list; the cyclic layer is denoted by y ═ Rw(x) Where x is the input, w is the parameter of the loop layer, and y is the output.
10. The method for detecting and identifying the enlarged license plate number according to claim 9, wherein in step S224, for the input x given by the loop layer, the probability that the transcription layer outputs the correct license plate is:
Figure FDA0002852496210000033
wherein, pi ∈ B-1(l) Representing all paths of which the result is the correct license plate L after the conversion of the circulation layer, wherein L is a prediction output sequence; for any path π is:
Figure FDA0002852496210000034
wherein L' is all paths; in the training process, the transcription layerBy gradient of training target
Figure FDA0002852496210000035
Adjusting the parameter w of the loop layer so that for the input sample, pi ∈ B-1(l) The probability p (l | x) of the correct license plate is the greatest.
CN202011532814.1A2020-12-232020-12-23 A detection and recognition method for enlarged license plate numbersActiveCN112906699B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN202011532814.1ACN112906699B (en)2020-12-232020-12-23 A detection and recognition method for enlarged license plate numbers

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN202011532814.1ACN112906699B (en)2020-12-232020-12-23 A detection and recognition method for enlarged license plate numbers

Publications (2)

Publication NumberPublication Date
CN112906699Atrue CN112906699A (en)2021-06-04
CN112906699B CN112906699B (en)2024-06-14

Family

ID=76111578

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN202011532814.1AActiveCN112906699B (en)2020-12-232020-12-23 A detection and recognition method for enlarged license plate numbers

Country Status (1)

CountryLink
CN (1)CN112906699B (en)

Cited By (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN113538347A (en)*2021-06-292021-10-22中国电子科技集团公司电子科学研究院 Image detection method and system based on efficient bidirectional path aggregation attention network
CN113610770A (en)*2021-07-152021-11-05浙江大华技术股份有限公司License plate recognition method, device and equipment
CN113887370A (en)*2021-09-262022-01-04深圳市爱深盈通信息技术有限公司 A method, system, storage medium and device for batch generation of enlarged license plate
CN113935899A (en)*2021-09-062022-01-14杭州志创科技有限公司Ship plate image super-resolution method based on semantic information and gradient supervision
CN114118199A (en)*2021-09-012022-03-01济宁安泰矿山设备制造有限公司Image classification method and system for fault diagnosis of intelligent pump cavity endoscope
CN114187434A (en)*2021-11-052022-03-15同济大学 An end-to-end license plate recognition method based on Raspberry Pi 4B
CN114332844A (en)*2022-03-162022-04-12武汉楚精灵医疗科技有限公司Intelligent classification application method, device, equipment and storage medium of medical image
CN114639090A (en)*2021-10-282022-06-17中国公路工程咨询集团有限公司Robust Chinese license plate recognition method under uncontrolled environment
CN114882492A (en)*2022-07-112022-08-09浙江大华技术股份有限公司License plate recognition method, device, terminal and computer readable storage medium
CN115035526A (en)*2022-06-212022-09-09亿嘉和科技股份有限公司Deep learning-based automatic LED character positioning and identifying method
CN115063786A (en)*2022-08-182022-09-16松立控股集团股份有限公司High-order distant view fuzzy license plate detection method
CN115761760A (en)*2022-11-292023-03-07江苏理工学院 Method and system for first-born character detection and recognition based on 3D vision technology
US20230281782A1 (en)*2022-02-222023-09-07Yancheng Ducheng Construction Co., LtdClosed-circuit television (cctv) online pipeline detection system
CN119810811A (en)*2024-11-072025-04-11华南师范大学 A method for recognizing enlarged license plate numbers based on overall and character region detection

Citations (12)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050257748A1 (en)*2002-08-022005-11-24Kriesel Marshall SApparatus and methods for the volumetric and dimensional measurement of livestock
CN104809443A (en)*2015-05-052015-07-29上海交通大学Convolutional neural network-based license plate detection method and system
CN106446895A (en)*2016-10-282017-02-22安徽四创电子股份有限公司License plate recognition method based on deep convolutional neural network
CN108764228A (en)*2018-05-282018-11-06嘉兴善索智能科技有限公司Word object detection method in a kind of image
US20190095730A1 (en)*2017-09-252019-03-28Beijing University Of Posts And TelecommunicationsEnd-To-End Lightweight Method And Apparatus For License Plate Recognition
CN110020651A (en)*2019-04-192019-07-16福州大学Car plate detection localization method based on deep learning network
GB201912428D0 (en)*2019-08-292019-10-16Sita Information Networking Computing Uk LtdLuggage identification via image recognition
RU2706891C1 (en)*2019-06-062019-11-21Самсунг Электроникс Ко., Лтд.Method of generating a common loss function for training a convolutional neural network for converting an image into an image with drawn parts and a system for converting an image into an image with drawn parts
CN110633755A (en)*2019-09-192019-12-31北京市商汤科技开发有限公司Network training method, image processing method and device and electronic equipment
CN111461134A (en)*2020-05-182020-07-28南京大学 A low-resolution license plate recognition method based on generative adversarial network
CN111626295A (en)*2020-07-272020-09-04杭州雄迈集成电路技术股份有限公司Training method and device for license plate detection model
US20200301799A1 (en)*2019-03-232020-09-24Uatc, LlcSystems and Methods for Generating Synthetic Sensor Data via Machine Learning

Patent Citations (12)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050257748A1 (en)*2002-08-022005-11-24Kriesel Marshall SApparatus and methods for the volumetric and dimensional measurement of livestock
CN104809443A (en)*2015-05-052015-07-29上海交通大学Convolutional neural network-based license plate detection method and system
CN106446895A (en)*2016-10-282017-02-22安徽四创电子股份有限公司License plate recognition method based on deep convolutional neural network
US20190095730A1 (en)*2017-09-252019-03-28Beijing University Of Posts And TelecommunicationsEnd-To-End Lightweight Method And Apparatus For License Plate Recognition
CN108764228A (en)*2018-05-282018-11-06嘉兴善索智能科技有限公司Word object detection method in a kind of image
US20200301799A1 (en)*2019-03-232020-09-24Uatc, LlcSystems and Methods for Generating Synthetic Sensor Data via Machine Learning
CN110020651A (en)*2019-04-192019-07-16福州大学Car plate detection localization method based on deep learning network
RU2706891C1 (en)*2019-06-062019-11-21Самсунг Электроникс Ко., Лтд.Method of generating a common loss function for training a convolutional neural network for converting an image into an image with drawn parts and a system for converting an image into an image with drawn parts
GB201912428D0 (en)*2019-08-292019-10-16Sita Information Networking Computing Uk LtdLuggage identification via image recognition
CN110633755A (en)*2019-09-192019-12-31北京市商汤科技开发有限公司Network training method, image processing method and device and electronic equipment
CN111461134A (en)*2020-05-182020-07-28南京大学 A low-resolution license plate recognition method based on generative adversarial network
CN111626295A (en)*2020-07-272020-09-04杭州雄迈集成电路技术股份有限公司Training method and device for license plate detection model

Non-Patent Citations (2)

* Cited by examiner, † Cited by third party
Title
SUMMA, S: "Gait changes after weight loss on adolescent with severe obesity after sleeve gastrectomy", SURGERY FOR OBESITY AND RELATED DISEASES, vol. 15, no. 3, pages 374 - 381, XP085685502, DOI: 10.1016/j.soard.2019.01.007*
何鎏一;杨国为;: "基于深度学习的光照不均匀文本图像的识别系统", 计算机应用与软件, no. 06, pages 196 - 223*

Cited By (17)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN113538347A (en)*2021-06-292021-10-22中国电子科技集团公司电子科学研究院 Image detection method and system based on efficient bidirectional path aggregation attention network
CN113538347B (en)*2021-06-292023-10-27中国电子科技集团公司电子科学研究院Image detection method and system based on efficient bidirectional path aggregation attention network
CN113610770A (en)*2021-07-152021-11-05浙江大华技术股份有限公司License plate recognition method, device and equipment
CN114118199A (en)*2021-09-012022-03-01济宁安泰矿山设备制造有限公司Image classification method and system for fault diagnosis of intelligent pump cavity endoscope
CN113935899A (en)*2021-09-062022-01-14杭州志创科技有限公司Ship plate image super-resolution method based on semantic information and gradient supervision
CN113887370A (en)*2021-09-262022-01-04深圳市爱深盈通信息技术有限公司 A method, system, storage medium and device for batch generation of enlarged license plate
CN114639090A (en)*2021-10-282022-06-17中国公路工程咨询集团有限公司Robust Chinese license plate recognition method under uncontrolled environment
CN114639090B (en)*2021-10-282025-05-13中国公路工程咨询集团有限公司 A robust Chinese license plate recognition method in uncontrolled environment
CN114187434A (en)*2021-11-052022-03-15同济大学 An end-to-end license plate recognition method based on Raspberry Pi 4B
US20230281782A1 (en)*2022-02-222023-09-07Yancheng Ducheng Construction Co., LtdClosed-circuit television (cctv) online pipeline detection system
CN114332844A (en)*2022-03-162022-04-12武汉楚精灵医疗科技有限公司Intelligent classification application method, device, equipment and storage medium of medical image
CN115035526A (en)*2022-06-212022-09-09亿嘉和科技股份有限公司Deep learning-based automatic LED character positioning and identifying method
CN114882492A (en)*2022-07-112022-08-09浙江大华技术股份有限公司License plate recognition method, device, terminal and computer readable storage medium
CN114882492B (en)*2022-07-112022-11-22浙江大华技术股份有限公司License plate recognition method, device, terminal and computer readable storage medium
CN115063786A (en)*2022-08-182022-09-16松立控股集团股份有限公司High-order distant view fuzzy license plate detection method
CN115761760A (en)*2022-11-292023-03-07江苏理工学院 Method and system for first-born character detection and recognition based on 3D vision technology
CN119810811A (en)*2024-11-072025-04-11华南师范大学 A method for recognizing enlarged license plate numbers based on overall and character region detection

Also Published As

Publication numberPublication date
CN112906699B (en)2024-06-14

Similar Documents

PublicationPublication DateTitle
CN112906699A (en)Method for detecting and identifying enlarged number of license plate
CN109035149B (en) A deep learning-based motion blurring method for license plate images
CN109840521B (en)Integrated license plate recognition method based on deep learning
CN111914797B (en) Traffic sign recognition method based on multi-scale lightweight convolutional neural network
CN112990065B (en)Vehicle classification detection method based on optimized YOLOv5 model
CN110059768B (en) Semantic segmentation method and system for fusion of point and area features for street view understanding
CN101944174B (en)Identification method of characters of licence plate
CN111986125B (en)Method for multi-target task instance segmentation
CN109815956B (en)License plate character recognition method based on self-adaptive position segmentation
CN106845478A (en)The secondary licence plate recognition method and device of a kind of character confidence level
CN113255659A (en)License plate correction detection and identification method based on MSAFF-yolk 3
CN112036231A (en)Vehicle-mounted video-based lane line and road surface indication mark detection and identification method
CN114463715B (en) A lane line detection method
CN115375959B (en) A vehicle image recognition model establishment and recognition method
CN110321803B (en)Traffic sign identification method based on SRCNN
Dorbe et al.FCN and LSTM based computer vision system for recognition of vehicle type, license plate number, and registration country
CN114998815B (en)Traffic vehicle identification tracking method and system based on video analysis
CN113239865A (en)Deep learning-based lane line detection method
Shanthakumari et al.Mask RCNN and Tesseract OCR for vehicle plate character recognition
CN109800762A (en)A kind of fuzzy license plate recognizer based on the Dynamic Matching factor
CN114898290A (en)Real-time detection method and system for marine ship
Latha et al.Image understanding: semantic segmentation of graphics and text using faster-RCNN
Ahmed et al.A deep learning based bangladeshi vehicle classification using fine-tuned multi-class vehicle image network (mvinet) model
Das et al.Object detection on scene images: a novel approach
Ibrahim et al.Offline Kurdish Character Handwritten Recognition (Okchr) Using Cnn With Various Preprocessing Techniques

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant
CP03Change of name, title or address

Address after:518000 1001, building T3, Hualian Business Center, Nanshan community, Nanshan street, Shenzhen City, Guangdong Province

Patentee after:Shenzhen Radio & TV Xinyi Technology Co.,Ltd.

Country or region after:China

Address before:518000 1001, building T3, Hualian Business Center, Nanshan community, Nanshan street, Shenzhen City, Guangdong Province

Patentee before:SHENZHEN XINYI TECHNOLOGY Co.,Ltd.

Country or region before:China

CP03Change of name, title or address
CP03Change of name, title or address

Address after:518000 Guangdong Province Shenzhen City Nanshan District Nanshan Street Nanshan Community Hualian Business Center T3 Building 901 Office

Patentee after:Shenzhen Radio & TV Xinyi Technology Co.,Ltd.

Country or region after:China

Address before:518000 1001, building T3, Hualian Business Center, Nanshan community, Nanshan street, Shenzhen City, Guangdong Province

Patentee before:Shenzhen Radio & TV Xinyi Technology Co.,Ltd.

Country or region before:China

CP03Change of name, title or address

[8]ページ先頭

©2009-2025 Movatter.jp