Movatterモバイル変換


[0]ホーム

URL:


US20170352170A1 - Nearsighted camera object detection - Google Patents

Nearsighted camera object detection
Download PDF

Info

Publication number
US20170352170A1
US20170352170A1US15/626,416US201715626416AUS2017352170A1US 20170352170 A1US20170352170 A1US 20170352170A1US 201715626416 AUS201715626416 AUS 201715626416AUS 2017352170 A1US2017352170 A1US 2017352170A1
Authority
US
United States
Prior art keywords
image
camera
document
characters
source
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US15/626,416
Inventor
Scott E. Barton
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Sage Software Inc
Original Assignee
Sage Software Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by Sage Software IncfiledCriticalSage Software Inc
Priority to US15/626,416priorityCriticalpatent/US20170352170A1/en
Assigned to SAGE SOFTWARE, INC.reassignmentSAGE SOFTWARE, INC.ASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: BARTON, Scott E.
Publication of US20170352170A1publicationCriticalpatent/US20170352170A1/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

A system and process of nearsighted (myopia) camera object detection involves detecting the objects through edge detection and outlining or thickening them with a heavy border. Thickening may include making the object bold in the case of text characters. The bold characters are then much more apparent and heavier weighted than the background. Thresholding operations are then applied (usually multiple times) to the grayscale image to remove all but the darkest foreground objects in the background resulting in a nearsighted (myopic) image. Additional processes may be applied to the nearsighted image, such as morphological closing, contour tracing and bounding of the objects or characters. The bound objects or characters can then be averaged to provide repositioning feedback for the camera user. Processed images can then be captured and subjected to OCR to extract relevant information from the image.

Description

Claims (21)

48. A method of generating, during acquisition via a camera of an image of a document, a plurality of pre-processed images of the document, the pre-processed images being used to optimize a capture position of the camera when capturing the image of the document for optical character recognition, the method comprising:
obtaining a plurality of source images, including a first source image and a second source image, continuously acquired via the camera of a computing device, each of the obtained plurality of source images containing characters associated with the document, wherein the first source image is acquired by the camera at a first capture position, and wherein the second source image is acquired by the camera at a second capture position, wherein the first capture position is different from the second capture position;
for each of the plurality of obtained source images, pre-processing a given obtained source image to generate, by a processor of the computing device, a pre-processed image of the given obtained source image; and
presenting, on a graphical user interface, via a display of the computing device, i) the pre-processed image and ii) a graphical indicator to guide physical repositioning of the camera to capture an image to be used by an optical character recognition operation to determine characters of the image of the document, wherein the graphical widget presents one or more parameters associated with the camera being in an determined appropriate range position.
60. A system of generating, during acquisition via a camera of an image of a document, a plurality of pre-processed images of the document, the pre-processed images being used to optimize a capture position of the camera when capturing the image of the document for optical character recognition, the system comprising:
a camera;
a processor; and
a memory operatively coupled to the processor, the memory having instructions stored thereon, wherein execution of the instructions by the processor, cause the processor to:
obtain a plurality of source images, including a first source image and a second source image, continuously acquired via the camera, each of the obtained plurality of source images containing characters associated with the document, wherein the first source image is acquired by the camera at a first capture position, and wherein the second source image is acquired by the camera at a second capture position, wherein the first capture position is different from the second capture position;
for each of the plurality of obtained source images, pre-process a given obtained source image to generate a pre-processed image of the given obtained source image; and
present, on a graphical user interface, via a display, i) the pre-processed image and ii) a graphical indicator to guide physical repositioning of the camera to capture an image to be used by an optical character recognition operation to determine characters of the image of the document, wherein the graphical widget presents one or more parameters associated with the camera being in an determined appropriate range position.
67. A non-transitory computer readable medium for capturing the image of the document for optical character recognition, the computer readable medium having instructions stored thereon, wherein execution of the instructions by a processor of a computing device, cause the processor to:
obtain a plurality of source images, including a first source image and a second source image, continuously acquired via a camera, each of the obtained plurality of source images containing characters associated with the document, wherein the first source image is acquired by the camera at a first capture position, and wherein the second source image is acquired by the camera at a second capture position, wherein the first capture position is different from the second capture position;
for each of the plurality of obtained source images, pre-process a given obtained source image to generate a pre-processed image of the given obtained source image; and
present, on a graphical user interface, via a display of the computing device, i) the pre-processed image and ii) a graphical indicator to guide physical repositioning of the camera to capture an image to be used by an optical character recognition operation to determine characters of the image of the document, wherein the graphical widget presents one or more parameters associated with the camera being in an determined appropriate range position.
US15/626,4162015-07-082017-06-19Nearsighted camera object detectionAbandonedUS20170352170A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US15/626,416US20170352170A1 (en)2015-07-082017-06-19Nearsighted camera object detection

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US14/794,328US9684984B2 (en)2015-07-082015-07-08Nearsighted camera object detection
US15/626,416US20170352170A1 (en)2015-07-082017-06-19Nearsighted camera object detection

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US14/794,328ContinuationUS9684984B2 (en)2015-07-082015-07-08Nearsighted camera object detection

Publications (1)

Publication NumberPublication Date
US20170352170A1true US20170352170A1 (en)2017-12-07

Family

ID=57686064

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US14/794,328Expired - Fee RelatedUS9684984B2 (en)2015-07-082015-07-08Nearsighted camera object detection
US15/626,416AbandonedUS20170352170A1 (en)2015-07-082017-06-19Nearsighted camera object detection

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US14/794,328Expired - Fee RelatedUS9684984B2 (en)2015-07-082015-07-08Nearsighted camera object detection

Country Status (2)

CountryLink
US (2)US9684984B2 (en)
WO (1)WO2017008029A1 (en)

Families Citing this family (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US10037459B2 (en)*2016-08-192018-07-31Sage Software, Inc.Real-time font edge focus measurement for optical character recognition (OCR)
CN109684511A (en)*2018-12-102019-04-26上海七牛信息技术有限公司A kind of video clipping method, video aggregation method, apparatus and system
CN109727192B (en)*2018-12-282023-06-27北京旷视科技有限公司 Method and device for image processing

Family Cites Families (25)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
JPS5539025B2 (en)1973-07-091980-10-08
CA2077969C (en)*1991-11-191997-03-04Daniel P. HuttenlocherMethod of deriving wordshapes for subsequent comparison
JP3148072B2 (en)*1994-04-082001-03-19シャープ株式会社 Exposure control method and exposure control device
JP3777785B2 (en)1998-03-182006-05-24コニカミノルタビジネステクノロジーズ株式会社 Image processing device
US6671395B1 (en)1999-10-152003-12-30D. Michael OttDocument image processing with stroke preservation and background suppression
US6640010B2 (en)*1999-11-122003-10-28Xerox CorporationWord-to-word selection on images
US7043080B1 (en)2000-11-212006-05-09Sharp Laboratories Of America, Inc.Methods and systems for text detection in mixed-context documents using local geometric signatures
US6778700B2 (en)2001-03-142004-08-17Electronics For Imaging, Inc.Method and apparatus for text detection
JP2004040395A (en)*2002-07-022004-02-05Fujitsu Ltd Image distortion correction apparatus, method and program
US8320708B2 (en)2004-04-022012-11-27K-Nfb Reading Technology, Inc.Tilt adjustment for optical character recognition in portable reading machine
US7899258B2 (en)*2005-08-122011-03-01Seiko Epson CorporationSystems and methods to convert images into high-quality compressed documents
US8059170B2 (en)*2006-08-282011-11-15Creative Technology Ltd.Method and system for processing a video instant message
US7899248B2 (en)*2007-08-302011-03-01Seiko Epson CorporationFast segmentation of images
JP2009193356A (en)*2008-02-142009-08-27Canon Inc Image processing apparatus, image processing method, program, and storage medium
US8649600B2 (en)*2009-07-102014-02-11Palo Alto Research Center IncorporatedSystem and method for segmenting text lines in documents
JP5709906B2 (en)2010-02-242015-04-30アイピープレックス ホールディングス コーポレーション Augmented reality panorama for the visually impaired
US8391602B2 (en)*2010-04-082013-03-05University Of CalcuttaCharacter recognition
CN103177709B (en)2011-12-202015-03-11北大方正集团有限公司Method and device for displaying characters
US9992471B2 (en)*2012-03-152018-06-05Fuji Xerox Co., Ltd.Generating hi-res dewarped book images
GB201217721D0 (en)*2012-10-032012-11-14Holition LtdVideo image processing
US9058644B2 (en)2013-03-132015-06-16Amazon Technologies, Inc.Local image enhancement for text recognition
US8837833B1 (en)*2013-06-302014-09-16Google Inc.Payment card OCR with relaxed alignment
US9697431B2 (en)2013-08-162017-07-04Conduent Business Services, LlcMobile document capture assist for optimized text recognition
US9262689B1 (en)*2013-12-182016-02-16Amazon Technologies, Inc.Optimizing pre-processing times for faster response
US9563812B2 (en)*2015-04-082017-02-07Toshiba Tec Kabushiki KaishaImage processing apparatus, image processing method and computer-readable storage medium

Also Published As

Publication numberPublication date
US20170011275A1 (en)2017-01-12
US9684984B2 (en)2017-06-20
WO2017008029A1 (en)2017-01-12

Similar Documents

PublicationPublication DateTitle
US9785850B2 (en)Real time object measurement
JP6501092B2 (en) Image processing apparatus and method for foreground mask correction for object segmentation
US8594439B2 (en)Image processing
US9576210B1 (en)Sharpness-based frame selection for OCR
CN108090511B (en)Image classification method and device, electronic equipment and readable storage medium
US8811751B1 (en)Method and system for correcting projective distortions with elimination steps on multiple levels
US20070253040A1 (en)Color scanning to enhance bitonal image
US8897600B1 (en)Method and system for determining vanishing point candidates for projective correction
US20050249429A1 (en)Method, apparatus, and program for image processing
CN107886026B (en)graphic code processing method and device
CN103034856B (en)The method of character area and device in positioning image
CN118275449B (en)Copper strip surface defect detection method, device and equipment
CN112396050B (en)Image processing method, device and storage medium
US9418316B1 (en)Sharpness-based frame selection for OCR
JP2016523397A (en) Method and system for information recognition
WO2013148566A1 (en)Image blur detection
CN109993739B (en)Seal authenticity identification method and device
US11037017B2 (en)Method and device for obtaining image of form sheet
CN108764328A (en)The recognition methods of Terahertz image dangerous material, device, equipment and readable storage medium storing program for executing
CN106934806A (en)It is a kind of based on text structure without with reference to figure fuzzy region dividing method out of focus
US8913836B1 (en)Method and system for correcting projective distortions using eigenpoints
US10037459B2 (en)Real-time font edge focus measurement for optical character recognition (OCR)
US20170352170A1 (en)Nearsighted camera object detection
CN110473222A (en)Image-element extracting method and device
CN105139391A (en)Edge detecting method for traffic image in fog-and-haze weather

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:SAGE SOFTWARE, INC., CALIFORNIA

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:BARTON, SCOTT E.;REEL/FRAME:042758/0053

Effective date:20170412

STCBInformation on status: application discontinuation

Free format text:EXPRESSLY ABANDONED -- DURING EXAMINATION


[8]ページ先頭

©2009-2025 Movatter.jp