Movatterモバイル変換


[0]ホーム

URL:


CN108681735A - Optical character recognition method based on convolutional neural networks deep learning model - Google Patents

Optical character recognition method based on convolutional neural networks deep learning model
Download PDF

Info

Publication number
CN108681735A
CN108681735ACN201810270374.3ACN201810270374ACN108681735ACN 108681735 ACN108681735 ACN 108681735ACN 201810270374 ACN201810270374 ACN 201810270374ACN 108681735 ACN108681735 ACN 108681735A
Authority
CN
China
Prior art keywords
model
deep learning
optical character
neural networks
convolutional neural
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201810270374.3A
Other languages
Chinese (zh)
Inventor
陆成学
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
China Science And Technology Co Ltd (beijing) Technology Co Ltd
Original Assignee
China Science And Technology Co Ltd (beijing) Technology Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by China Science And Technology Co Ltd (beijing) Technology Co LtdfiledCriticalChina Science And Technology Co Ltd (beijing) Technology Co Ltd
Priority to CN201810270374.3ApriorityCriticalpatent/CN108681735A/en
Publication of CN108681735ApublicationCriticalpatent/CN108681735A/en
Pendinglegal-statusCriticalCurrent

Links

Classifications

Landscapes

Abstract

The present invention discloses a kind of optical character recognition method based on convolutional neural networks deep learning model.This approach includes the following steps:The Chinese characters in common use and 10 Arabic numerals of collection different fonts and 26 English alphabet data sets are simultaneously converted into picture format;Slight distortion and rotation are carried out to enhance the robustness of model to picture, generate model training database;Establish the deep learning model of optical character identification;Training set image input model is continued to optimize into object function using convolutional neural networks model by the method for supervised learning, learns a multi-categorizer;For new test sample, feature extraction is carried out to it based on model obtained in the previous step and application model grader obtains final classification result.The present invention proposes new model and method to application of the deep learning based on convolutional neural networks in optical character identification, this method can be applied in general pattern classification task, especially text identification problem, the optical character identification model proposed by the present invention based on deep learning can significantly improve the recognition correct rate of character recognition.

Description

Optical character recognition method based on convolutional neural networks deep learning model
Technical field
The present invention relates to computer vision, pattern-recognition, the technical fields such as natural scene feature recognition, especially a kind of basesIn the optical character recognition method of convolutional neural networks deep learning model.
Background technology
Optical character identification is because it in real-life practicability has obtained the extensive concern of domestic and foreign scholars, base at presentIt is concentrated mainly on scanned document character recognition in the application of optical character identification.Optical character identification streetscape identifier identify,The foreground of being widely applied also is used in bank's ID card information identification, classroom blackboard-writing identification etc..Optical character identification has heightThe advantages of effect property and convenience.There is a large amount of research effort just constantly promoting the development of field of optical character recognition at present.
A usual character recognition system is acquired by character, Character segmentation, feature extraction, several step structures such as characteristic matchingAt.Wherein feature extraction has most important influence for the accuracy of character recognition.When the feature using most identificationWhen matching is compared to character, better discrimination can be usually obtained, it is on the contrary then will be greatly reduced character recognition systemAccuracy.And the research of character recognition is also concentrated mainly in the method for character feature extraction, it is based on convolutional neural networksDeep learning method in detection feature automatically and extract characteristic aspect and have big advantage.
In recent years, the deep learning model based on convolutional neural networks is prominent in numerous computer vision problems because of itGo out performance and obtains great concern.Its basic thought is to carry an original image automatically behind multilayer convolution sum pondTake wherein most representational feature.Deep learning is all obtained in character recognition, image classification, natural language processing etc. in fieldsObtained howling success.And with the development of technology, how to learn to suitable for particular problem (such as be used for image classification, characterIdentification) model become scholars' focus of attention.
Using the method for deep learning, a weight matrix with identification can be obtained by study and is biased towardsAmount.Weight vector and biasing constitute a grader, and classification knot will be can be obtained after character input grader to be testedFruit.Research under this theoretical frame is mainly concentrated in so that the model learnt has differentiation performance more outstanding.
However, in character recognition problem under practical application scene, it is not usually to mark that we, which can be obtained character picture,Accurate character picture.Due to intensity of illumination, the factors such as placement position, character picture usually has a degree of rotation or torsionIt is bent.If the character picture of standard is directly used in above-mentioned model, is had in the model acquired and greatly represent judgement indexWeaker requires character picture very stringent information, then the recognition correct rate of model can substantially reduce.And if it is intended to obtainingObtain good recognition effect, it usually needs the additional capacity for increasing character training set is to expand its coverage area.
For deep learning model have the characteristics that good ability in feature extraction this, it is proposed that existing optics wordSymbol identification model is improved, and learns a grader under deep learning frame to complete the identification to character.In this way in realityIt, can be in a unification from the identification of the input character picture of non-standard (including but not limited to) to the end under the application environment of borderFrame in be resolved.
Invention content
(1) technical problems to be solved
The problem of for input picture in character recognition problem under actual environment may be non-standard image, the present invention proposeCharacter feature extraction and character recognition are placed on by a kind of optical character recognition method based on neural network deep learning modelIt is resolved under one unified frame so that it is correct that the interaction of above-mentioned two step improves final character recognition jointlyRate.
(2) technical solution
A kind of technical solution of optical character recognition method based on neural network deep learning model proposed by the present inventionIt is as follows:
Step S1 collects the Chinese character of common different fonts, 10 Arabic numerals and 26 English alphabets and generates figureThe data set of piece format.
Step S2, training set and test set sample to acquisition suitably carry out slight rotation and distortion.
Step S3, each layer weight matrix parameter W and biasing b of the grader of Optimization Learning training set, passes through stochastic gradientThe optimal way of descent method (SGD) minimizes object function, study optimum classifier parameter W and b.
Step S4 carries out a propagated forward, calculates the probability value of its affiliated each classification, obtains the classification of test characterAs a result.
Beneficial effects of the present invention:The present invention is directed to the character recognition problem under actual application environment, can directly inputNon-standard character image carries out character recognition.It is placed on a unified model frame by expressing character feature, with character recognitionIt is solved under frame, it is hereby achieved that higher discrimination, enhances the robustness of algorithm.
Description of the drawings
Fig. 1 is the system flow chart of the optical character recognition method based on neural network deep learning model.
Specific implementation mode
To make the objectives, technical solutions, and advantages of the present invention clearer, below in conjunction with specific example, and with reference to detailedThin attached drawing, the present invention is described in more detail.But described embodiment is intended merely to facilitate the understanding of the present invention, and rightIt does not play any restriction effect.
Fig. 1 is flow chart of the method for the present invention, as shown in Figure 1, proposed by the present invention a kind of based on neural network depthThe optical character recognition method for practising model includes following steps:
Step S1 collects the Chinese character of common different fonts, 26 English alphabets of 10 Arabic numerals and English alphabetAnd generate the data set of picture format.
Step S2, training set and test set sample to acquisition suitably carry out slight rotation and distortion.
Step S3, each layer weight matrix parameter W of the grader of Optimization Learning training set, passes through stochastic gradient descent method(SGD) optimal way minimizes object function, study optimum classifier parameter W and b.
S31 initializes weights square for multiple convolution kernels of each convolutional layer in training set by Gaussian ProfileBattle array.Next, entering alternately error propagated forward and gradient back-propagation process, each of which volume is provided simultaneously by SGD algorithmsThe weights of product core.S32 and S33 is recycled until restraining or reaching iterations requirement.
This is the object function of a typical classification problem, and the optimization for completing this object function can be in the hope of one groupSorting parameter W and b.
S32, the value of propagated forward counting loss function:
S33, the Grad of backpropagation counting loss function pair parameters.
Wherein, f is hidden layer.
Step S4 carries out a propagated forward, calculates the probability value of its affiliated each classification.
Wherein, s=g (xi;W, b).
Case study on implementation:
For the specific implementation mode and verification effectiveness of the invention that the present invention will be described in detail, we propose the present inventionMethod be applied to the database that forms of picture generated by Chinese characters in common use, 10 Arabic numerals and 26 letters.The dataLibrary is included in the image that rotation in various degree and distortion obtain.In our embodiment, we extract every in image firstA character.Using the single character after extraction as the input feature vector of training and test.
According to the step S3 in the technical detail introduced before, we first carry out all training set data input modelsTraining, wherein training parameter W are set as Gaussian Profile, mean value 0, standard deviation 0.01.Next according to step S31, S32 andS33 completes the training to model.Grader is inputted to obtain final classification results by step S4 to new test image.
Particular embodiments described above has carried out further in detail the purpose of the present invention, technical solution and advantageous effectIt describes in detail bright, it should be understood that the above is only a specific embodiment of the present invention, is not intended to restrict the invention, it is allWithin the spirit and principles in the present invention, any modification, equivalent substitution, improvement and etc. done should be included in the guarantor of the present inventionWithin the scope of shield.

Claims (6)

CN201810270374.3A2018-03-282018-03-28Optical character recognition method based on convolutional neural networks deep learning modelPendingCN108681735A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201810270374.3ACN108681735A (en)2018-03-282018-03-28Optical character recognition method based on convolutional neural networks deep learning model

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201810270374.3ACN108681735A (en)2018-03-282018-03-28Optical character recognition method based on convolutional neural networks deep learning model

Publications (1)

Publication NumberPublication Date
CN108681735Atrue CN108681735A (en)2018-10-19

Family

ID=63800544

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201810270374.3APendingCN108681735A (en)2018-03-282018-03-28Optical character recognition method based on convolutional neural networks deep learning model

Country Status (1)

CountryLink
CN (1)CN108681735A (en)

Cited By (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN109598270A (en)*2018-12-042019-04-09龙马智芯(珠海横琴)科技有限公司Distort recognition methods and the device, storage medium and processor of text
CN109858305A (en)*2019-01-172019-06-07柳州康云互联科技有限公司A kind of two dimensional code positioning identification system and method based on deep learning
CN110059705A (en)*2019-04-222019-07-26厦门商集网络科技有限责任公司A kind of OCR recognition result decision method and equipment based on modeling
CN110956133A (en)*2019-11-292020-04-03上海眼控科技股份有限公司Training method of single character text normalization model, text recognition method and device
CN111797908A (en)*2020-06-182020-10-20浪潮金融信息技术有限公司Training set generation method of deep learning model for print character recognition
CN112580657A (en)*2020-12-232021-03-30陕西天诚软件有限公司Self-learning character recognition method
CN113191251A (en)*2021-04-282021-07-30北京有竹居网络技术有限公司Method and device for detecting stroke order, electronic equipment and storage medium
US11295155B2 (en)2020-04-082022-04-05Konica Minolta Business Solutions U.S.A., Inc.Online training data generation for optical character recognition
CN117173716A (en)*2023-09-012023-12-05湖南天桥嘉成智能科技有限公司Deep learning-based high-temperature slab ID character recognition method and system

Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP0784828A2 (en)*1994-10-051997-07-23United Parcel Service Of America, Inc.Method of and apparatus for segmenting foreground and background information for optical character recognition of labels employing single layer recurrent neural network
CN104966097A (en)*2015-06-122015-10-07成都数联铭品科技有限公司Complex character recognition method based on deep learning
CN105184312A (en)*2015-08-242015-12-23中国科学院自动化研究所Character detection method and device based on deep learning
CN107273897A (en)*2017-07-042017-10-20华中科技大学A kind of character recognition method based on deep learning

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
EP0784828A2 (en)*1994-10-051997-07-23United Parcel Service Of America, Inc.Method of and apparatus for segmenting foreground and background information for optical character recognition of labels employing single layer recurrent neural network
CN104966097A (en)*2015-06-122015-10-07成都数联铭品科技有限公司Complex character recognition method based on deep learning
CN105184312A (en)*2015-08-242015-12-23中国科学院自动化研究所Character detection method and device based on deep learning
CN107273897A (en)*2017-07-042017-10-20华中科技大学A kind of character recognition method based on deep learning

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
张超群: ""基于深度学习的字符识别"", 《中国优秀硕士学位论文全文数据库信息科技辑》*

Cited By (13)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN109598270B (en)*2018-12-042020-05-05龙马智芯(珠海横琴)科技有限公司Method and device for identifying distorted characters, storage medium and processor
CN109598270A (en)*2018-12-042019-04-09龙马智芯(珠海横琴)科技有限公司Distort recognition methods and the device, storage medium and processor of text
CN109858305A (en)*2019-01-172019-06-07柳州康云互联科技有限公司A kind of two dimensional code positioning identification system and method based on deep learning
CN110059705A (en)*2019-04-222019-07-26厦门商集网络科技有限责任公司A kind of OCR recognition result decision method and equipment based on modeling
CN110956133A (en)*2019-11-292020-04-03上海眼控科技股份有限公司Training method of single character text normalization model, text recognition method and device
US11295155B2 (en)2020-04-082022-04-05Konica Minolta Business Solutions U.S.A., Inc.Online training data generation for optical character recognition
CN111797908B (en)*2020-06-182022-08-09浪潮金融信息技术有限公司Training set generation method of deep learning model for print character recognition
CN111797908A (en)*2020-06-182020-10-20浪潮金融信息技术有限公司Training set generation method of deep learning model for print character recognition
CN112580657A (en)*2020-12-232021-03-30陕西天诚软件有限公司Self-learning character recognition method
CN112580657B (en)*2020-12-232022-11-01陕西天诚软件有限公司Self-learning character recognition method
CN113191251A (en)*2021-04-282021-07-30北京有竹居网络技术有限公司Method and device for detecting stroke order, electronic equipment and storage medium
CN117173716A (en)*2023-09-012023-12-05湖南天桥嘉成智能科技有限公司Deep learning-based high-temperature slab ID character recognition method and system
CN117173716B (en)*2023-09-012024-03-26湖南天桥嘉成智能科技有限公司Deep learning-based high-temperature slab ID character recognition method and system

Similar Documents

PublicationPublication DateTitle
CN108681735A (en)Optical character recognition method based on convolutional neural networks deep learning model
JP2022532177A (en) Forged face recognition methods, devices, and non-temporary computer-readable storage media
CN112329779B (en)Method and related device for improving certificate identification accuracy based on mask
JP2016134175A (en) Method and system for performing text-image queries using wildcards
WO2017016240A1 (en)Banknote serial number identification method
CN108133212A (en)A kind of quota invoice amount identifying system based on deep learning
CN103544504B (en)Scene character recognition method based on multi-scale map matching core
CN106372624B (en)Face recognition method and system
CN106570475B (en)A kind of dark-red enameled pottery seal search method
CN112069900A (en)Bill character recognition method and system based on convolutional neural network
CN106228166B (en)The recognition methods of character picture
CN108364037A (en)Method, system and the equipment of Handwritten Chinese Character Recognition
Hossain et al.Recognition and solution for handwritten equation using convolutional neural network
CN110414506B (en)Bank card number automatic identification method based on data augmentation and convolution neural network
CN110610174A (en) Identification method of bank card number under complex conditions
Harizi et al.Convolutional neural network with joint stepwise character/word modeling based system for scene text recognition
Liu et al.Scene text recognition with high performance CNN classifier and efficient word inference
He et al.Aggregating local context for accurate scene text detection
Zhang et al.OCR with the Deep CNN Model for Ligature Script‐Based Languages like Manchu
Kobchaisawat et al.Thai text localization in natural scene images using convolutional neural network
CN111242114B (en)Character recognition method and device
Yildirim et al.Text recognition in natural images using multiclass hough forests
Dat et al.An improved CRNN for Vietnamese Identity Card Information Recognition.
Zuo et al.An intelligent knowledge extraction framework for recognizing identification information from real-world ID card images
CN114694133A (en)Text recognition method based on combination of image processing and deep learning

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
RJ01Rejection of invention patent application after publication

Application publication date:20181019

RJ01Rejection of invention patent application after publication

[8]ページ先頭

©2009-2025 Movatter.jp