Movatterモバイル変換


[0]ホーム

URL:


US20090116757A1 - Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categories - Google Patents

Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categories
Download PDF

Info

Publication number
US20090116757A1
US20090116757A1US12/266,472US26647208AUS2009116757A1US 20090116757 A1US20090116757 A1US 20090116757A1US 26647208 AUS26647208 AUS 26647208AUS 2009116757 A1US2009116757 A1US 2009116757A1
Authority
US
United States
Prior art keywords
document
image
documents
text
job
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US12/266,472
Inventor
Depankar Neogi
Steven K. Ladd
Venugopal Govindaraju
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Gruntworx LLC
Original Assignee
COPANION Inc
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by COPANION IncfiledCriticalCOPANION Inc
Priority to US12/266,472priorityCriticalpatent/US20090116757A1/en
Publication of US20090116757A1publicationCriticalpatent/US20090116757A1/en
Assigned to COPANION, INC.reassignmentCOPANION, INC.PROPRIETARY INFORMATION AND INVENTIONS AGREEMENTAssignors: LADD, STEVEN, NEOGI, DEPANKAR, GOVINDARAJU, VENUGOPAL
Assigned to GRUNTWORX, LLCreassignmentGRUNTWORX, LLCASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: COPANION, INC.
Assigned to GRUNTWORX, LLCreassignmentGRUNTWORX, LLCASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: COPANION, INC.
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

A method in a document analysis system automatically extracts from each received electronic document image and text features, in which the image features are indicative of how the document is laid out or textually-organized and therefore indicative of a corresponding document category, next compares the extracted image and text features with feature sets associated with each document category, and then classifies each document to a document category, the feature set of which best matches the extracted features of the document.

Description

Claims (1)

1. In a document analysis system that receives and processes jobs, a method of automatically recognizing and classifying each document in a job into a corresponding document category by automatically recognizing image and text features in the document so that each job may be automatically organized according to the categories of documents it contains, the method comprising:
automatically extracting from each received document image and text features, in which the image features are indicative of how the document is laid out or textually-organized and therefore indicative of a corresponding document category, and the text features are distinctive words that are indicative of a corresponding document category;
comparing the extracted image and text features with feature sets associated with each category of document, in which each feature set includes a subset of text features and corresponding weights and a subset of image features and corresponding weights;
classifying each document to a document category, the feature set of which best matches the extracted features of said document; and
organizing each job according to the categories of documents it contains.
US12/266,4722007-11-062008-11-06Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categoriesAbandonedUS20090116757A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US12/266,472US20090116757A1 (en)2007-11-062008-11-06Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categories

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US98585107P2007-11-062007-11-06
US12/266,472US20090116757A1 (en)2007-11-062008-11-06Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categories

Publications (1)

Publication NumberPublication Date
US20090116757A1true US20090116757A1 (en)2009-05-07

Family

ID=40588156

Family Applications (6)

Application NumberTitlePriority DateFiling Date
US12/266,454AbandonedUS20090116755A1 (en)2007-11-062008-11-06Systems and methods for enabling manual classification of unrecognized documents to complete workflow for electronic jobs and to assist machine learning of a recognition system using automatically extracted features of unrecognized documents
US12/266,472AbandonedUS20090116757A1 (en)2007-11-062008-11-06Systems and methods for classifying electronic documents by extracting and recognizing text and image features indicative of document categories
US12/266,462AbandonedUS20090116736A1 (en)2007-11-062008-11-06Systems and methods to automatically classify electronic documents using extracted image and text features and using a machine learning subsystem
US12/266,468AbandonedUS20090116746A1 (en)2007-11-062008-11-06Systems and methods for parallel processing of document recognition and classification using extracted image and text features
US12/266,469AbandonedUS20090116756A1 (en)2007-11-062008-11-06Systems and methods for training a document classification system using documents from a plurality of users
US12/266,465Active2029-05-25US8538184B2 (en)2007-11-062008-11-06Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US12/266,454AbandonedUS20090116755A1 (en)2007-11-062008-11-06Systems and methods for enabling manual classification of unrecognized documents to complete workflow for electronic jobs and to assist machine learning of a recognition system using automatically extracted features of unrecognized documents

Family Applications After (4)

Application NumberTitlePriority DateFiling Date
US12/266,462AbandonedUS20090116736A1 (en)2007-11-062008-11-06Systems and methods to automatically classify electronic documents using extracted image and text features and using a machine learning subsystem
US12/266,468AbandonedUS20090116746A1 (en)2007-11-062008-11-06Systems and methods for parallel processing of document recognition and classification using extracted image and text features
US12/266,469AbandonedUS20090116756A1 (en)2007-11-062008-11-06Systems and methods for training a document classification system using documents from a plurality of users
US12/266,465Active2029-05-25US8538184B2 (en)2007-11-062008-11-06Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category

Country Status (2)

CountryLink
US (6)US20090116755A1 (en)
WO (1)WO2009061917A1 (en)

Cited By (12)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20090116746A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for parallel processing of document recognition and classification using extracted image and text features
US20110255790A1 (en)*2010-01-152011-10-20Copanion, Inc.Systems and methods for automatically grouping electronic document pages
US8428375B2 (en)*2010-11-172013-04-23Via Technologies, Inc.System and method for data compression and decompression in a graphics processing system
US8701167B2 (en)*2009-05-282014-04-15Kjaya, LlcMethod and system for fast access to advanced visualization of medical scans using a dedicated web portal
CN105469028A (en)*2014-09-292016-04-06株式会社东芝Information processing device, information processing method
RU2641225C2 (en)*2014-01-212018-01-16Общество с ограниченной ответственностью "Аби Девелопмент"Method of detecting necessity of standard learning for verification of recognized text
US9881053B2 (en)*2016-05-132018-01-30Maana, Inc.Machine-assisted object matching
US10204143B1 (en)2011-11-022019-02-12Dub Software Group, Inc.System and method for automatic document management
US10482462B1 (en)2016-03-182019-11-19Wells Fargo Bank, N.A.Automatic teller machine game-based authentication functionality
US10726955B2 (en)*2009-05-282020-07-28Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US10839302B2 (en)2015-11-242020-11-17The Research Foundation For The State University Of New YorkApproximate value iteration with complex returns by bounding
US20230260657A1 (en)*2009-05-282023-08-17Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal

Families Citing this family (99)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
RU2635259C1 (en)*2016-06-222017-11-09Общество с ограниченной ответственностью "Аби Девелопмент"Method and device for determining type of digital document
WO2007011841A2 (en)*2005-07-152007-01-25Indxit Systems, Inc.Systems and methods for data indexing and processing
US7917492B2 (en)*2007-09-212011-03-29Limelight Networks, Inc.Method and subsystem for information acquisition and aggregation to facilitate ontology and language-model generation within a content-search-service system
US8966389B2 (en)*2006-09-222015-02-24Limelight Networks, Inc.Visual interface for identifying positions of interest within a sequentially ordered information encoding
US9015172B2 (en)2006-09-222015-04-21Limelight Networks, Inc.Method and subsystem for searching media content within a content-search service system
US20090210786A1 (en)*2008-02-192009-08-20Kabushiki Kaisha ToshibaImage processing apparatus and image processing method
US8671112B2 (en)*2008-06-122014-03-11Athenahealth, Inc.Methods and apparatus for automated image classification
US8713007B1 (en)*2009-03-132014-04-29Google Inc.Classifying documents using multiple classifiers
EP2488963A1 (en)*2009-10-152012-08-22Rogers Communications Inc.System and method for phrase identification
JP2011123598A (en)*2009-12-092011-06-23Canon IncImage discriminating apparatus and method, and program
US20110191145A1 (en)*2010-02-022011-08-04Bank Of America CorporationDigital Records Management
JP5703748B2 (en)*2010-03-172015-04-22株式会社リコー Management system, management method, and temporary storage document server
WO2011140536A1 (en)*2010-05-072011-11-10Purdue Research FoundationQuantitative image analysis for wound healing assay
US8885931B2 (en)*2011-01-262014-11-11Microsoft CorporationMitigating use of machine solvable HIPs
US8379980B2 (en)2011-03-252013-02-19Intel CorporationSystem, method and computer program product for document image analysis using feature extraction functions
KR101849696B1 (en)*2011-07-192018-04-17삼성전자주식회사Method and apparatus for obtaining informaiton of lighting and material in image modeling system
US8527532B2 (en)*2012-01-312013-09-03Adobe Systems IncorporatedTransforming function calls for interaction with hierarchical data structures
JP2014036314A (en)*2012-08-082014-02-24Canon IncScan service system, scan service method, and scan service program
JP5895777B2 (en)*2012-09-062016-03-30富士ゼロックス株式会社 Information classification program and information processing apparatus
US9348899B2 (en)2012-10-312016-05-24Open Text CorporationAuto-classification system and method with dynamic user feedback
US9286379B2 (en)*2012-11-262016-03-15Wal-Mart Stores, Inc.Document quality measurement
US8885951B1 (en)2012-12-142014-11-11Tony CristofanoSystem and method for data identification and extraction of forms
FR3000585A1 (en)*2012-12-312014-07-04Eads Europ Aeronautic Defence METHOD FOR ANALYZING GEOGRAPHIC REGIONS AND DETECTING ZONES OF INTEREST
US9703855B1 (en)*2013-04-152017-07-11Ca, Inc.System and method for classifying content with a web service
US10162829B2 (en)*2013-09-032018-12-25Adobe Systems IncorporatedAdaptive parallel data processing
US9286372B2 (en)2013-11-062016-03-15Sap SeContent management with RDBMS
TWI505207B (en)*2014-03-262015-10-21Excellence Inc E Electronic official document automatic delivery system and method
US20170109439A1 (en)*2014-06-032017-04-20Hewlett-Packard Development Company, L.P.Document classification based on multiple meta-algorithmic patterns
US20170046350A1 (en)*2014-09-242017-02-16Hewlett-Packard Development Company, L.P.Media organization
US9367899B1 (en)*2015-05-292016-06-14Konica Minolta Laboratory U.S.A., Inc.Document image binarization method
US10726281B2 (en)*2015-07-292020-07-28Invensense, Inc.Method and apparatus for user and moving vehicle detection
CN105426462A (en)*2015-11-132016-03-23深圳码隆科技有限公司Image searching method and device based on image element
EP3196811A1 (en)*2016-01-202017-07-26Accenture Global Services LimitedCognitive document reader
US10776399B1 (en)2016-06-062020-09-15Casepoint LLCDocument classification prediction and content analytics using artificial intelligence
US10095747B1 (en)2016-06-062018-10-09@Legal Discovery LLCSimilar document identification using artificial intelligence
US10725896B2 (en)2016-07-152020-07-28Intuit Inc.System and method for identifying a subset of total historical users of a document preparation system to represent a full set of test scenarios based on code coverage
US10579721B2 (en)2016-07-152020-03-03Intuit Inc.Lean parsing: a natural language processing system and method for parsing domain-specific languages
US11222266B2 (en)2016-07-152022-01-11Intuit Inc.System and method for automatic learning of functions
US10140277B2 (en)2016-07-152018-11-27Intuit Inc.System and method for selecting data sample groups for machine learning of context of data fields for various document types and/or for test data generation for quality assurance systems
US11049190B2 (en)2016-07-152021-06-29Intuit Inc.System and method for automatically generating calculations for fields in compliance forms
US9984471B2 (en)*2016-07-262018-05-29Intuit Inc.Label and field identification without optical character recognition (OCR)
CN109964224A (en)2016-09-222019-07-02恩芙润斯公司 Systems, methods, and computer-readable media for semantic information visualization and temporal signal inference indicating significant associations between life science entities
US10607101B1 (en)*2016-12-142020-03-31Revenue Management Solutions, LlcSystem and method for patterned artifact removal for bitonal images
US10331732B1 (en)*2016-12-162019-06-25National Technology & Engineering Solutions Of Sandia, LlcInformation searching system
US11568148B1 (en)2017-02-172023-01-31Narrative Science Inc.Applied artificial intelligence technology for narrative generation based on explanation communication goals
WO2018150211A1 (en)*2017-02-202018-08-23Csiba AndrasMethod for handling documents on ontology base
US10884981B1 (en)2017-06-192021-01-05Wells Fargo Bank, N.A.Tagging tool for managing data
US10663298B2 (en)*2017-06-252020-05-26Invensense, Inc.Method and apparatus for characterizing platform motion
CN107480711B (en)*2017-08-042020-09-01合肥美的智能科技有限公司Image recognition method and device, computer equipment and readable storage medium
CN107563379B (en)*2017-09-022019-12-24西安电子科技大学 A method for localizing text in images of natural scenes
EP3685284A4 (en)*2017-09-222021-06-16Intuit Inc.Lean parsing: a natural language processing system and method for parsing domain-specific languages
RU2672395C1 (en)*2017-09-292018-11-14Акционерное общество "Лаборатория Касперского"Method for training a classifier designed for determining the category of a document
US11176363B2 (en)2017-09-292021-11-16AO Kaspersky LabSystem and method of training a classifier for determining the category of a document
US11816435B1 (en)2018-02-192023-11-14Narrative Science Inc.Applied artificial intelligence technology for contextualizing words to a knowledge base using natural language processing
US10546054B1 (en)*2018-02-282020-01-28Intuit Inc.System and method for synthetic form image generation
RU2695489C1 (en)*2018-03-232019-07-23Общество с ограниченной ответственностью "Аби Продакшн"Identification of fields on an image using artificial intelligence
US10162850B1 (en)*2018-04-102018-12-25Icertis, Inc.Clause discovery for validation of documents
US11042713B1 (en)2018-06-282021-06-22Narrative Scienc Inc.Applied artificial intelligence technology for using natural language processing to train a natural language generation system
US10936974B2 (en)2018-12-242021-03-02Icertis, Inc.Automated training and selection of models for document analysis
CN111383299B (en)*2018-12-282022-09-06Tcl科技集团股份有限公司Image processing method and device and computer readable storage medium
US11462037B2 (en)2019-01-112022-10-04Walmart Apollo, LlcSystem and method for automated analysis of electronic travel data
US10990767B1 (en)2019-01-282021-04-27Narrative Science Inc.Applied artificial intelligence technology for adaptive natural language understanding
US10726374B1 (en)2019-02-192020-07-28Icertis, Inc.Risk prediction based on automated analysis of documents
JP7243286B2 (en)*2019-02-252023-03-22コニカミノルタ株式会社 Image forming device and document management system
US11373029B2 (en)2019-04-012022-06-28Hyland Uk Operations LimitedSystem and method integrating machine learning algorithms to enrich documents in a content management system
US10657603B1 (en)*2019-04-032020-05-19Progressive Casualty Insurance CompanyIntelligent routing control
US11151660B1 (en)*2019-04-032021-10-19Progressive Casualty Insurance CompanyIntelligent routing control
US11783005B2 (en)2019-04-262023-10-10Bank Of America CorporationClassifying and mapping sentences using machine learning
US11328025B1 (en)2019-04-262022-05-10Bank Of America CorporationValidating mappings between documents using machine learning
US11163956B1 (en)2019-05-232021-11-02Intuit Inc.System and method for recognizing domain specific named entities using domain specific word embeddings
WO2020243846A1 (en)*2019-06-062020-12-10Bear Health Technologies Inc.System and method for automated file reporting
US11487902B2 (en)2019-06-212022-11-01nference, inc.Systems and methods for computing with private healthcare data
US11545242B2 (en)2019-06-212023-01-03nference, inc.Systems and methods for computing with private healthcare data
US12333393B2 (en)2019-06-212025-06-17nference, inc.Systems and methods for adaptively improving the performance of locked machine learning programs
KR20210001760A (en)2019-06-282021-01-06휴렛-팩커드 디벨롭먼트 컴퍼니, 엘.피.Detecting and processing multi feeding
JP7698626B2 (en)2019-07-162025-06-25エヌフェレンス,インコーポレイテッド A system and method for inserting data into a database structured based on a pictorial representation of a data table.
US11556711B2 (en)2019-08-272023-01-17Bank Of America CorporationAnalyzing documents using machine learning
US11423231B2 (en)2019-08-272022-08-23Bank Of America CorporationRemoving outliers from training data for machine learning
US11526804B2 (en)2019-08-272022-12-13Bank Of America CorporationMachine learning model training for reviewing documents
US11449559B2 (en)2019-08-272022-09-20Bank Of America CorporationIdentifying similar sentences for machine learning
RU2019128026A (en)2019-09-052021-03-05Общество С Ограниченной Ответственностью «Яндекс» METHOD AND SYSTEM FOR RANKING A SET OF DIGITAL DOCUMENTS
CN110781234A (en)*2019-10-242020-02-11北京锐安科技有限公司TRS database retrieval method, device, equipment and storage medium
KR20210066398A (en)*2019-11-282021-06-07휴렛-팩커드 디벨롭먼트 컴퍼니, 엘.피.Document management of image forming device
US11783128B2 (en)2020-02-192023-10-10Intuit Inc.Financial document text conversion to computer readable operations
US20210294851A1 (en)*2020-03-232021-09-23UiPath, Inc.System and method for data augmentation for document understanding
US11829661B2 (en)2020-04-212023-11-28Hewlett-Packard Development Company, L.P.Media feed rate adjustments
US11335108B2 (en)2020-08-102022-05-17Marlabs IncorporatedSystem and method to recognise characters from an image
US12354022B2 (en)*2020-11-122025-07-08Samsung Electronics Co., Ltd.On-device knowledge extraction from visually rich documents
RU2764705C1 (en)2020-12-222022-01-19Общество с ограниченной ответственностью «Аби Продакшн»Extraction of multiple documents from a single image
US11930153B2 (en)2021-01-082024-03-12Hewlett-Packard Development Company, L.P.Feature extractions to optimize scanned images
US11704352B2 (en)2021-05-032023-07-18Bank Of America CorporationAutomated categorization and assembly of low-quality images into electronic documents
US11798258B2 (en)2021-05-032023-10-24Bank Of America CorporationAutomated categorization and assembly of low-quality images into electronic documents
IT202100016208A1 (en)*2021-06-212022-12-21Witit S R L Start Up Costituita A Norma Dellarticolo 4 Comma 10 Bis Del Decreto Legge 24 Gennaio 201 Method and system for the digital acquisition of paper documents
US12046011B2 (en)*2021-06-222024-07-23Docusign, Inc.Machine learning-based document splitting and labeling in an electronic document system
US11830267B2 (en)2021-08-272023-11-28Optum, Inc.Techniques for digital document analysis using document image fingerprinting
US11881041B2 (en)2021-09-022024-01-23Bank Of America CorporationAutomated categorization and processing of document images of varying degrees of quality
WO2023081795A1 (en)2021-11-052023-05-11nference, inc.Method and system for determining relationships between linguistic entities
US11361034B1 (en)2021-11-302022-06-14Icertis, Inc.Representing documents using document keys
GB2634464A (en)*2022-07-282025-04-09Wisedocs IncSystem and method for automated file reporting

Citations (35)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5642288A (en)*1994-11-101997-06-24Documagix, IncorporatedIntelligent document recognition and handling
US5680478A (en)*1992-04-241997-10-21Canon Kabushiki KaishaMethod and apparatus for character recognition
US5778103A (en)*1992-10-191998-07-07TmssequoiaOCR image pre-processor
US5943669A (en)*1996-11-251999-08-24Fuji Xerox Co., Ltd.Document retrieval device
US5995665A (en)*1995-05-311999-11-30Canon Kabushiki KaishaImage processing apparatus and method
US5999664A (en)*1997-11-141999-12-07Xerox CorporationSystem for searching a corpus of document images by user specified document layout components
US6006226A (en)*1997-09-241999-12-21Ricoh Company LimitedMethod and system for document image feature extraction
US6094653A (en)*1996-12-252000-07-25Nec CorporationDocument classification method and apparatus therefor
US6101515A (en)*1996-05-312000-08-08Oracle CorporationLearning system for classification of terminology
US6243501B1 (en)*1998-05-202001-06-05Canon Kabushiki KaishaAdaptive recognition of documents using layout attributes
US20020022956A1 (en)*2000-05-252002-02-21Igor UkrainczykSystem and method for automatically classifying text
US6393150B1 (en)*1998-12-042002-05-21Eastman Kodak CompanyRegion-based image binarization system
US20030226100A1 (en)*2002-05-172003-12-04Xerox CorporationSystems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections
US6823331B1 (en)*2000-08-282004-11-23Entrust LimitedConcept identification system and method for use in reducing and/or representing text content of an electronic document
US20050060643A1 (en)*2003-08-252005-03-17Miavia, Inc.Document similarity detection and classification system
US20050117803A1 (en)*2003-11-282005-06-02Canon Kabushiki KaishaDocument recognition device, document recognition method and program, and storage medium
US6943905B2 (en)*2001-12-202005-09-13Sharp Laboratories Of America, Inc.Virtual print driver system and method
US6947933B2 (en)*2003-01-232005-09-20Verdasys, Inc.Identifying similarities within large collections of unstructured data
US20050244060A1 (en)*2004-04-302005-11-03Xerox CorporationReformatting binary image data to generate smaller compressed image data size
US6976207B1 (en)*1999-04-282005-12-13Ser Solutions, Inc.Classification method and apparatus
US20060036649A1 (en)*2004-08-122006-02-16Simske Steven JIndex extraction from documents
US7039856B2 (en)*1998-09-302006-05-02Ricoh Co., Ltd.Automatic document classification using text and images
US20060190489A1 (en)*2005-02-232006-08-24Janet VohariwattSystem and method for electronically processing document images
US7190477B2 (en)*2001-02-222007-03-13Sharp Laboratories Of America, Inc.System and method for managing and processing a print job using print job tickets
US7194471B1 (en)*1998-04-102007-03-20Ricoh Company, Ltd.Document classification system and method for classifying a document according to contents of the document
US20070118391A1 (en)*2005-10-242007-05-24Capsilon Fsg, Inc.Business Method Using The Automated Processing of Paper and Unstructured Electronic Documents
US20070203885A1 (en)*2006-02-282007-08-30Korea Advanced Institute Of Science & TechnologyDocument Classification Method, and Computer Readable Record Medium Having Program for Executing Document Classification Method By Computer
US20070201764A1 (en)*2006-02-272007-08-30Samsung Electronics Co., Ltd.Apparatus and method for detecting key caption from moving picture to provide customized broadcast service
US20070211964A1 (en)*2006-03-092007-09-13Gad AgamImage-based indexing and classification in image databases
US20070247531A1 (en)*2006-04-192007-10-25Yining DengMethod and system to reduce flicker artifacts in captured images
US20080062472A1 (en)*2006-09-122008-03-13Morgan StanleyDocument handling
US20090116755A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for enabling manual classification of unrecognized documents to complete workflow for electronic jobs and to assist machine learning of a recognition system using automatically extracted features of unrecognized documents
US7623712B2 (en)*2005-06-092009-11-24Canon Kabushiki KaishaImage processing method and apparatus
US7783117B2 (en)*2005-08-122010-08-24Seiko Epson CorporationSystems and methods for generating background and foreground images for document compression
US7797260B2 (en)*2008-02-112010-09-14Yahoo! Inc.Automated document classifier tuning including training set adaptive to user browsing behavior

Family Cites Families (6)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US116756A (en)*1871-07-04Improvement in spring vehicles
US119296A (en)*1871-09-26Improvement in whip-stocks
US116736A (en)*1871-07-04Improvement in weeding-tools
US116746A (en)*1871-07-04Improvement in sleeping-cars
US116757A (en)*1871-07-04Improvement in book-binding apparatus
US5778106A (en)*1996-03-141998-07-07Polaroid CorporationElectronic camera with reduced color artifacts

Patent Citations (41)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5680478A (en)*1992-04-241997-10-21Canon Kabushiki KaishaMethod and apparatus for character recognition
US5778103A (en)*1992-10-191998-07-07TmssequoiaOCR image pre-processor
US5642288A (en)*1994-11-101997-06-24Documagix, IncorporatedIntelligent document recognition and handling
US5995665A (en)*1995-05-311999-11-30Canon Kabushiki KaishaImage processing apparatus and method
US6101515A (en)*1996-05-312000-08-08Oracle CorporationLearning system for classification of terminology
US5943669A (en)*1996-11-251999-08-24Fuji Xerox Co., Ltd.Document retrieval device
US6094653A (en)*1996-12-252000-07-25Nec CorporationDocument classification method and apparatus therefor
US6006226A (en)*1997-09-241999-12-21Ricoh Company LimitedMethod and system for document image feature extraction
US5999664A (en)*1997-11-141999-12-07Xerox CorporationSystem for searching a corpus of document images by user specified document layout components
US7194471B1 (en)*1998-04-102007-03-20Ricoh Company, Ltd.Document classification system and method for classifying a document according to contents of the document
US6243501B1 (en)*1998-05-202001-06-05Canon Kabushiki KaishaAdaptive recognition of documents using layout attributes
US7039856B2 (en)*1998-09-302006-05-02Ricoh Co., Ltd.Automatic document classification using text and images
US6393150B1 (en)*1998-12-042002-05-21Eastman Kodak CompanyRegion-based image binarization system
US6976207B1 (en)*1999-04-282005-12-13Ser Solutions, Inc.Classification method and apparatus
US20020022956A1 (en)*2000-05-252002-02-21Igor UkrainczykSystem and method for automatically classifying text
US6823331B1 (en)*2000-08-282004-11-23Entrust LimitedConcept identification system and method for use in reducing and/or representing text content of an electronic document
US7190477B2 (en)*2001-02-222007-03-13Sharp Laboratories Of America, Inc.System and method for managing and processing a print job using print job tickets
US6943905B2 (en)*2001-12-202005-09-13Sharp Laboratories Of America, Inc.Virtual print driver system and method
US20030226100A1 (en)*2002-05-172003-12-04Xerox CorporationSystems and methods for authoritativeness grading, estimation and sorting of documents in large heterogeneous document collections
US6947933B2 (en)*2003-01-232005-09-20Verdasys, Inc.Identifying similarities within large collections of unstructured data
US20050060643A1 (en)*2003-08-252005-03-17Miavia, Inc.Document similarity detection and classification system
US20050117803A1 (en)*2003-11-282005-06-02Canon Kabushiki KaishaDocument recognition device, document recognition method and program, and storage medium
US20050244060A1 (en)*2004-04-302005-11-03Xerox CorporationReformatting binary image data to generate smaller compressed image data size
US20060036649A1 (en)*2004-08-122006-02-16Simske Steven JIndex extraction from documents
US20060190489A1 (en)*2005-02-232006-08-24Janet VohariwattSystem and method for electronically processing document images
US7623712B2 (en)*2005-06-092009-11-24Canon Kabushiki KaishaImage processing method and apparatus
US7783117B2 (en)*2005-08-122010-08-24Seiko Epson CorporationSystems and methods for generating background and foreground images for document compression
US20070118391A1 (en)*2005-10-242007-05-24Capsilon Fsg, Inc.Business Method Using The Automated Processing of Paper and Unstructured Electronic Documents
US7747495B2 (en)*2005-10-242010-06-29Capsilon CorporationBusiness method using the automated processing of paper and unstructured electronic documents
US20070201764A1 (en)*2006-02-272007-08-30Samsung Electronics Co., Ltd.Apparatus and method for detecting key caption from moving picture to provide customized broadcast service
US20070203885A1 (en)*2006-02-282007-08-30Korea Advanced Institute Of Science & TechnologyDocument Classification Method, and Computer Readable Record Medium Having Program for Executing Document Classification Method By Computer
US20070211964A1 (en)*2006-03-092007-09-13Gad AgamImage-based indexing and classification in image databases
US7787711B2 (en)*2006-03-092010-08-31Illinois Institute Of TechnologyImage-based indexing and classification in image databases
US20070247531A1 (en)*2006-04-192007-10-25Yining DengMethod and system to reduce flicker artifacts in captured images
US20080062472A1 (en)*2006-09-122008-03-13Morgan StanleyDocument handling
US20090116756A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for training a document classification system using documents from a plurality of users
US20090116736A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods to automatically classify electronic documents using extracted image and text features and using a machine learning subsystem
US20090119296A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category
US20090116746A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for parallel processing of document recognition and classification using extracted image and text features
US20090116755A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for enabling manual classification of unrecognized documents to complete workflow for electronic jobs and to assist machine learning of a recognition system using automatically extracted features of unrecognized documents
US7797260B2 (en)*2008-02-112010-09-14Yahoo! Inc.Automated document classifier tuning including training set adaptive to user browsing behavior

Cited By (30)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8538184B2 (en)2007-11-062013-09-17Gruntworx, LlcSystems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category
US20090116736A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods to automatically classify electronic documents using extracted image and text features and using a machine learning subsystem
US20090116756A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for training a document classification system using documents from a plurality of users
US20090119296A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category
US20090116746A1 (en)*2007-11-062009-05-07Copanion, Inc.Systems and methods for parallel processing of document recognition and classification using extracted image and text features
US10084846B2 (en)*2009-05-282018-09-25Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US12148533B2 (en)*2009-05-282024-11-19Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US8701167B2 (en)*2009-05-282014-04-15Kjaya, LlcMethod and system for fast access to advanced visualization of medical scans using a dedicated web portal
US9106609B2 (en)2009-05-282015-08-11Kovey KovalanMethod and system for fast access to advanced visualization of medical scans using a dedicated web portal
US10930397B2 (en)*2009-05-282021-02-23Al Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US20160373514A1 (en)*2009-05-282016-12-22Kovey KovalanMethod and system for fast access to advanced visualization of medical scans using a dedicated web portal
US9749389B2 (en)*2009-05-282017-08-29Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US20170374126A1 (en)*2009-05-282017-12-28Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US10726955B2 (en)*2009-05-282020-07-28Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US11676721B2 (en)*2009-05-282023-06-13Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US20210174964A1 (en)*2009-05-282021-06-10Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US20230260657A1 (en)*2009-05-282023-08-17Ai Visualize, Inc.Method and system for fast access to advanced visualization of medical scans using a dedicated web portal
US20110255790A1 (en)*2010-01-152011-10-20Copanion, Inc.Systems and methods for automatically grouping electronic document pages
US8428375B2 (en)*2010-11-172013-04-23Via Technologies, Inc.System and method for data compression and decompression in a graphics processing system
US12045244B1 (en)2011-11-022024-07-23Autoflie Inc.System and method for automatic document management
US10204143B1 (en)2011-11-022019-02-12Dub Software Group, Inc.System and method for automatic document management
RU2641225C2 (en)*2014-01-212018-01-16Общество с ограниченной ответственностью "Аби Девелопмент"Method of detecting necessity of standard learning for verification of recognized text
CN105469028A (en)*2014-09-292016-04-06株式会社东芝Information processing device, information processing method
US10839302B2 (en)2015-11-242020-11-17The Research Foundation For The State University Of New YorkApproximate value iteration with complex returns by bounding
US12169793B2 (en)2015-11-242024-12-17The Research Foundation For The State University Of New YorkApproximate value iteration with complex returns by bounding
US11238422B1 (en)2016-03-182022-02-01Wells Fargo Bank, N.A.Automatic teller machine game-based transaction functionality
US10685354B1 (en)2016-03-182020-06-16Wells Fargo Bank, N.A.Automatic teller machine game-based authentication functionality
US10600040B1 (en)2016-03-182020-03-24Wells Fargo Bank, N.A.Automatic teller machine game-based transaction functionality
US10482462B1 (en)2016-03-182019-11-19Wells Fargo Bank, N.A.Automatic teller machine game-based authentication functionality
US9881053B2 (en)*2016-05-132018-01-30Maana, Inc.Machine-assisted object matching

Also Published As

Publication numberPublication date
WO2009061917A1 (en)2009-05-14
US20090116746A1 (en)2009-05-07
US20090116736A1 (en)2009-05-07
US20090116755A1 (en)2009-05-07
US8538184B2 (en)2013-09-17
US20090119296A1 (en)2009-05-07
US20090116756A1 (en)2009-05-07

Similar Documents

PublicationPublication DateTitle
US8538184B2 (en)Systems and methods for handling and distinguishing binarized, background artifacts in the vicinity of document text and image features indicative of a document category
US8897563B1 (en)Systems and methods for automatically processing electronic documents
US11501061B2 (en)Extracting structured information from a document containing filled form images
US20110249905A1 (en)Systems and methods for automatically extracting data from electronic documents including tables
US11816165B2 (en)Identification of fields in documents with neural networks without templates
US10621727B1 (en)Label and field identification without optical character recognition (OCR)
JP4698289B2 (en) Low resolution OCR for documents acquired with a camera
Christy et al.Mass digitization of early modern texts with optical character recognition
Al-MaadeedText‐Dependent Writer Identification for Arabic Handwriting
Hussain et al.Deep learning-based recognition system for pashto handwritten text: benchmark on PHTI
JosephAdvanced digital image processing technique based optical character recognition of scanned document
Barrett et al.Digital mountain: From granite archive to global access
MarinerOptical Character Recognition (OCR)
Jabde et al.A systematic review of multilingual numeral recognition systems
Semertzidis et al.Social Media: Trends, Events, and Influential Users
Kumar et al.Optical character recognition using Split Profile Algorithm
CN120375391A (en)Method and device for identifying fund warehouse picture, computer equipment and storage medium
MEHRIÉCOLE DOCTORALE S2IM
Mapari et al.A Study Of Devnagri Handwritten Character Recognition System
Gupta et al.Automated transfer of information from paper documents to computer-accessible media

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:COPANION, INC., MASSACHUSETTS

Free format text:PROPRIETARY INFORMATION AND INVENTIONS AGREEMENT;ASSIGNORS:NEOGI, DEPANKAR;GOVINDARAJU, VENUGOPAL;LADD, STEVEN;SIGNING DATES FROM 20061002 TO 20110727;REEL/FRAME:027585/0825

ASAssignment

Owner name:GRUNTWORX, LLC, NORTH CAROLINA

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:COPANION, INC.;REEL/FRAME:027685/0352

Effective date:20110707

ASAssignment

Owner name:GRUNTWORX, LLC, NORTH CAROLINA

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:COPANION, INC.;REEL/FRAME:028157/0982

Effective date:20110727

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp