Movatterモバイル変換


[0]ホーム

URL:


CN109726715A - A kind of character image serializing identification, structural data output method - Google Patents

A kind of character image serializing identification, structural data output method
Download PDF

Info

Publication number
CN109726715A
CN109726715ACN201811614263.6ACN201811614263ACN109726715ACN 109726715 ACN109726715 ACN 109726715ACN 201811614263 ACN201811614263 ACN 201811614263ACN 109726715 ACN109726715 ACN 109726715A
Authority
CN
China
Prior art keywords
character image
text
exports
structural data
character
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Pending
Application number
CN201811614263.6A
Other languages
Chinese (zh)
Inventor
雷钧
林路
林康
王慜骊
安通鉴
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
SUNYARD SYSTEM ENGINEERING Co Ltd
Original Assignee
SUNYARD SYSTEM ENGINEERING Co Ltd
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by SUNYARD SYSTEM ENGINEERING Co LtdfiledCriticalSUNYARD SYSTEM ENGINEERING Co Ltd
Priority to CN201811614263.6ApriorityCriticalpatent/CN109726715A/en
Publication of CN109726715ApublicationCriticalpatent/CN109726715A/en
Pendinglegal-statusCriticalCurrent

Links

Landscapes

Abstract

The present invention discloses a kind of method that character image serializes identification, structural data output, this method specifically: obtain multiple character image blocks;Text image feature extraction is carried out to each character image block using full depth convolutional neural networks, each character image block is expressed as feature vector;The feature vector is handled using deep neural network, and exports a probability distribution about character set;Using connection chronological classification layer as transcription layer, the dynamic programming algorithm that the probability distribution about character set is propagated using forward calculation and reversed gradient exports computer readable text;Error correction is carried out to computer readable text using language model, structural data is obtained and exports.Method recognition accuracy of the invention is high, and robustness is good, and discrimination is high.

Description

A kind of character image serializing identification, structural data output method
Technical field
The present invention relates to image identification technical fields in computer software more particularly to a kind of serializing of character image to knowNot, structural data output method.
Background technique
Referring to through the equipment such as computer, benefit for financial field word area detection fixation and recognition technology based on OCRThe effective information in paper material is automatically extracted and identified with OCR technique (optical character identification), and carries out corresponding positionReason.It is one of the key technology that the computer for realizing that bank is with no paper automatically processes.In financial industry OCR, text often withThe form of sequence occurs, rather than occurs in isolation.Traditional document OCR identification technology is very weak to anti-interference ability, can not knowPicture in the case of other complex background, and low efficiency when output " row text information " and " column text information ".
Summary of the invention
In view of the deficiencies of the prior art, the present invention provides a kind of character image serializing identifications, structural data output sideMethod, specific technical solution are as follows:
A kind of character image serializing identification, structural data output method, which is characterized in that this method includes following stepIt is rapid:
S1: multiple character image blocks are obtained;
S2: text image feature extraction is carried out to each character image block using full depth convolutional neural networks, eachCharacter image block is expressed as feature vector;
S3: being handled the feature vector using deep neural network, and exports one about the general of character setRate distribution;
S4: using connection chronological classification layer as transcription layer, by the probability distribution about character set using forward calculation andThe dynamic programming algorithm that reversed gradient is propagated, exports computer readable text;
S5: error correction is carried out to computer readable text using language model, structural data is obtained and exports.
Further, the S2 specifically:
Using the picture of arbitrary size as input, response diagram of corresponding size is exported, each position is corresponding in the response diagramOriginal image an acceptance region and full convolutional neural networks share convolution response diagram, as feature vector.
Further, the deep neural network is the double-deck Recognition with Recurrent Neural Network.
Further, the S5 specifically:
S5.1: establishing corpus, and with training term vector and language model;
S5.2: the computer readable text that S4 is obtained is put into the language model after training, and by beam-search modeIn the insertion language model, revised text is exported.
Beneficial effects of the present invention are as follows:
(1) directly sequence label can be learnt, is marked without others;
(2) original image pixels are directly based upon and extract feature, do not need to carry out binaryzation, Character segmentation, character locating etc.Image pretreatment operation;
(3) Tag Estimation is carried out using recurrent neural network, it can direct output character sequence prediction result;
(4) length of recognition result is not limited, while calculating loss using CTC, so that character is in character stringIn position be also not limited;
(5) recurrent neural network is used, it is empty to consume less storage using less network weight for the layer that connects more complete than traditionBetween, there is preferable recognition accuracy and robustness, while full convolution is decoded using beam-search method insertion language model and is passedReturn network, further increases discrimination.
Detailed description of the invention
Fig. 1 is the flow diagram of character image serializing identification of the invention, structural data output method.
Specific embodiment
Below according to attached drawing and preferred embodiment the present invention is described in detail, the objects and effects of the present invention will become brighterWhite, below in conjunction with drawings and examples, the present invention will be described in further detail.It should be appreciated that described herein specificEmbodiment is only used to explain the present invention, is not intended to limit the present invention.
As shown in Figure 1, a kind of character image serializing identification, structural data output method, this method include following stepIt is rapid:
S1: multiple character image blocks are obtained;
S2: using full depth convolutional neural networks (deep neural network is the double-deck Recognition with Recurrent Neural Network) to each text figureAs block progress text image feature extraction, each character image block is expressed as feature vector;Specially with the figure of arbitrary sizePiece exports response diagram of corresponding size as input, and each position corresponds to an acceptance region of original image and complete in the response diagramConvolutional neural networks share convolution response diagram, as feature vector;
S3: being handled the feature vector using deep neural network, and exports one about the general of character setRate distribution;
S4: using connection chronological classification layer (Connectionist Temporal Classifier, hereinafter referred to as CTC)As transcription layer, the dynamic programming algorithm that the probability distribution about character set is propagated using forward calculation and reversed gradient is defeatedComputer readable text out;CTC is a kind of probability function for converting prediction result to label sequence, for input feature vector andThe uncertain time series problem of alignment relation between output label, can automatic end-to-end ground Optimized model parameter and right simultaneouslyThe boundary of neat cutting.
The picture of 256 size of 32x in example, maximum can cutting 256 arrange, that is, input feature vector maximum 256, and exportingThe length maximum setting of label is 18, this to be optimized with CTC model.
About CTC model, it is assumed that the picture of 32x 256, numeric string label are " 123 ", and picture is pressed column cutting (CTC meetingOptimize segmentation model), every piece then branched away goes identification number again, and it is each digital or spcial character general for finding out this blockRate (unrecognized to be then labeled as spcial character "-"), has thus obtained each based on input feature vector sequence (picture)The generic probability distribution of mutually indepedent modeling unit individual (marking off the block come) (including "-" node).Based on probability pointCloth calculates the probability P (123) that sequence label is " 123 ", sets the probability of " 123 " here as the sum of all subsequences, this liningSequence include '-' and ' 1', ' 2', ' 3' continuously repeats.
S5: carrying out error correction to computer readable text using language model, obtain structural data and export, specifically:
Establish corpus, and with training term vector and language model;
The computer readable text that S4 is obtained is put into the language model after training, and beam-search mode is embedded in instituteIn the language model stated, revised text is exported.
The image data that daily workout generates belongs under more satisfactory, noiseless environment, is easy accuracy rate and just reaches100%, actual production environment picture may some line segments or discrete point noise, can be voluntarily in generating training setIncrease some noises, improves test model training effect.
It will appreciated by the skilled person that being not used to limit the foregoing is merely the preferred embodiment of inventionSystem invention, although invention is described in detail referring to previous examples, for those skilled in the art, stillIt can modify to the technical solution of aforementioned each case history or equivalent replacement of some of the technical features.It is allWithin the spirit and principle of invention, modification, equivalent replacement for being made etc. be should be included within the protection scope of invention.

Claims (4)

CN201811614263.6A2018-12-272018-12-27A kind of character image serializing identification, structural data output methodPendingCN109726715A (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201811614263.6ACN109726715A (en)2018-12-272018-12-27A kind of character image serializing identification, structural data output method

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201811614263.6ACN109726715A (en)2018-12-272018-12-27A kind of character image serializing identification, structural data output method

Publications (1)

Publication NumberPublication Date
CN109726715Atrue CN109726715A (en)2019-05-07

Family

ID=66296477

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201811614263.6APendingCN109726715A (en)2018-12-272018-12-27A kind of character image serializing identification, structural data output method

Country Status (1)

CountryLink
CN (1)CN109726715A (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN110399486A (en)*2019-07-022019-11-01精硕科技(北京)股份有限公司A kind of classification method, device and equipment, storage medium
CN110837838A (en)*2019-11-062020-02-25创新奇智(重庆)科技有限公司End-to-end frame number identification system and method based on deep learning
CN110942004A (en)*2019-11-202020-03-31深圳追一科技有限公司Handwriting recognition method and device based on neural network model and electronic equipment
CN111062397A (en)*2019-12-182020-04-24厦门商集网络科技有限责任公司Intelligent bill processing system
CN111353397A (en)*2020-02-222020-06-30郑州铁路职业技术学院 Structured sharing system of Chinese blackboard writing in online classroom based on big data and OCR
CN112508023A (en)*2020-10-272021-03-16重庆大学Deep learning-based end-to-end identification method for code-spraying characters of parts
CN114757840A (en)*2022-03-242022-07-15北京字跳网络技术有限公司 Image processing method, apparatus, readable medium and electronic device
CN119314187A (en)*2024-09-292025-01-14中国人民解放军海军工程大学 A method and system for correcting errors in paper equipment operation record recognition results

Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN106570456A (en)*2016-10-132017-04-19华南理工大学Handwritten Chinese character recognition method based on full-convolution recursive network
CN107145853A (en)*2017-04-282017-09-08深圳市唯特视科技有限公司A kind of scene image text suggesting method based on full convolutional network
WO2018089762A1 (en)*2016-11-112018-05-17Ebay Inc.Online personal assistant with image text localization
CN108090400A (en)*2016-11-232018-05-29中移(杭州)信息技术有限公司A kind of method and apparatus of image text identification

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN106570456A (en)*2016-10-132017-04-19华南理工大学Handwritten Chinese character recognition method based on full-convolution recursive network
WO2018089762A1 (en)*2016-11-112018-05-17Ebay Inc.Online personal assistant with image text localization
CN108090400A (en)*2016-11-232018-05-29中移(杭州)信息技术有限公司A kind of method and apparatus of image text identification
CN107145853A (en)*2017-04-282017-09-08深圳市唯特视科技有限公司A kind of scene image text suggesting method based on full convolutional network

Cited By (9)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN110399486A (en)*2019-07-022019-11-01精硕科技(北京)股份有限公司A kind of classification method, device and equipment, storage medium
CN110837838A (en)*2019-11-062020-02-25创新奇智(重庆)科技有限公司End-to-end frame number identification system and method based on deep learning
CN110942004A (en)*2019-11-202020-03-31深圳追一科技有限公司Handwriting recognition method and device based on neural network model and electronic equipment
CN111062397A (en)*2019-12-182020-04-24厦门商集网络科技有限责任公司Intelligent bill processing system
CN111353397A (en)*2020-02-222020-06-30郑州铁路职业技术学院 Structured sharing system of Chinese blackboard writing in online classroom based on big data and OCR
CN111353397B (en)*2020-02-222021-01-01郑州铁路职业技术学院 Structured sharing system of Chinese blackboard writing in online classroom based on big data and OCR
CN112508023A (en)*2020-10-272021-03-16重庆大学Deep learning-based end-to-end identification method for code-spraying characters of parts
CN114757840A (en)*2022-03-242022-07-15北京字跳网络技术有限公司 Image processing method, apparatus, readable medium and electronic device
CN119314187A (en)*2024-09-292025-01-14中国人民解放军海军工程大学 A method and system for correcting errors in paper equipment operation record recognition results

Similar Documents

PublicationPublication DateTitle
CN109726715A (en)A kind of character image serializing identification, structural data output method
CN109635150B (en)Text generation method, device and storage medium
CN112733866B (en)Network construction method for improving text description correctness of controllable image
CN110609891A (en) A Visual Dialogue Generation Method Based on Context-Aware Graph Neural Network
CN111460824B (en)Unmarked named entity identification method based on anti-migration learning
CN111738169A (en) A Handwritten Formula Recognition Method Based on End-to-End Network Model
CN111274804A (en)Case information extraction method based on named entity recognition
CN104680178B (en)Image classification method based on transfer learning multi attractor cellular automaton
CN110415309B (en) Method to realize automatic generation of fingerprint images based on generative adversarial network
CN109657039A (en)A kind of track record information extraction method based on the double-deck BiLSTM-CRF
CN108427740A (en)A kind of Image emotional semantic classification and searching algorithm based on depth measure study
CN110175248A (en)A kind of Research on face image retrieval and device encoded based on deep learning and Hash
BoscoA genetic algorithm for image segmentation
CN113420552B (en)Biomedical multi-event extraction method based on reinforcement learning
CN108446334A (en)Image retrieval method based on content for unsupervised countermeasure training
Uehara et al.Visual question generation for class acquisition of unknown objects
CN111159332A (en)Text multi-intention identification method based on bert
CN113919358A (en)Named entity identification method and system based on active learning
Georgieva et al.Optical character recognition for autonomous stores
CN117746078B (en)Object detection method and system based on user-defined category
CN109740151A (en)Public security notes name entity recognition method based on iteration expansion convolutional neural networks
CN109992770A (en) A Lao Named Entity Recognition Method Based on Combinatorial Neural Network
CN116450829A (en)Medical text classification method, device, equipment and medium
CN112084913A (en) An end-to-end human detection and attribute recognition method
CN111291705B (en)Pedestrian re-identification method crossing multiple target domains

Legal Events

DateCodeTitleDescription
PB01Publication
PB01Publication
SE01Entry into force of request for substantive examination
SE01Entry into force of request for substantive examination
WD01Invention patent application deemed withdrawn after publication

Application publication date:20190507

WD01Invention patent application deemed withdrawn after publication

[8]ページ先頭

©2009-2025 Movatter.jp