Movatterモバイル変換


[0]ホーム

URL:


US20020176628A1 - Document imaging and indexing system - Google Patents

Document imaging and indexing system
Download PDF

Info

Publication number
US20020176628A1
US20020176628A1US09/862,728US86272801AUS2002176628A1US 20020176628 A1US20020176628 A1US 20020176628A1US 86272801 AUS86272801 AUS 86272801AUS 2002176628 A1US2002176628 A1US 2002176628A1
Authority
US
United States
Prior art keywords
text
document
file
files
digitized image
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US09/862,728
Inventor
Gary Starkweather
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Microsoft Technology Licensing LLC
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by IndividualfiledCriticalIndividual
Priority to US09/862,728priorityCriticalpatent/US20020176628A1/en
Assigned to MICROSOFT CORPORATIONreassignmentMICROSOFT CORPORATIONASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: STARKWEATHER, GARY K.
Publication of US20020176628A1publicationCriticalpatent/US20020176628A1/en
Priority to US11/053,079prioritypatent/US8380012B2/en
Assigned to MICROSOFT TECHNOLOGY LICENSING, LLCreassignmentMICROSOFT TECHNOLOGY LICENSING, LLCASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: MICROSOFT CORPORATION
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

A document digitizing method digitizes and automatically indexes documents in printed form. The method includes optically scanning the document, forming and storing a digitized image file from the optically scanned document, optically recognizing characters in the optically scanned document, and forming and storing a text file of the optically recognized characters in document. A retrieval method for retrieving the digitized image file for a document includes searching the text files to identify any having a selected text string and providing access to the digitized image files that correspond to those text files. The digital image file and the text file together represent a digitized document data structure that combines a digital image of a document with a text file of optically recognized characters in the digital image.

Description

Claims (26)

US09/862,7282001-05-222001-05-22Document imaging and indexing systemAbandonedUS20020176628A1 (en)

Priority Applications (2)

Application NumberPriority DateFiling DateTitle
US09/862,728US20020176628A1 (en)2001-05-222001-05-22Document imaging and indexing system
US11/053,079US8380012B2 (en)2001-05-222005-02-08Document imaging and indexing system

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
US09/862,728US20020176628A1 (en)2001-05-222001-05-22Document imaging and indexing system

Related Child Applications (1)

Application NumberTitlePriority DateFiling Date
US11/053,079DivisionUS8380012B2 (en)2001-05-222005-02-08Document imaging and indexing system

Publications (1)

Publication NumberPublication Date
US20020176628A1true US20020176628A1 (en)2002-11-28

Family

ID=25339174

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US09/862,728AbandonedUS20020176628A1 (en)2001-05-222001-05-22Document imaging and indexing system
US11/053,079Expired - Fee RelatedUS8380012B2 (en)2001-05-222005-02-08Document imaging and indexing system

Family Applications After (1)

Application NumberTitlePriority DateFiling Date
US11/053,079Expired - Fee RelatedUS8380012B2 (en)2001-05-222005-02-08Document imaging and indexing system

Country Status (1)

CountryLink
US (2)US20020176628A1 (en)

Cited By (16)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20030042319A1 (en)*2001-08-312003-03-06Xerox CorporationAutomatic and semi-automatic index generation for raster documents
US20040010757A1 (en)*2002-07-092004-01-15Mccoy Craig G.Method and system to place a scanned document in the body of an email
US20040218205A1 (en)*2003-04-292004-11-04Cory IrwinMethod and system of using a multifunction printer to identify pages having a text string
US20040228512A1 (en)*2003-05-152004-11-18Warren Joel EdwardMethod and system for communicating and matching electronic files for financial transactions
US20050210047A1 (en)*2004-03-182005-09-22Zenodata CorporationPosting data to a database from non-standard documents using document mapping to standard document types
US20050210048A1 (en)*2004-03-182005-09-22Zenodata CorporationAutomated posting systems and methods
US20060170984A1 (en)*2005-02-012006-08-03Canon Kabushiki KaishaData processing apparatus, image processing apparatus, data processing method, image processing method, and programs for implementing the methods
US7099869B1 (en)*2001-07-112006-08-29Apple Computer, Inc.Method and apparatus for managing file extensions in a digital processing system
US20060245005A1 (en)*2005-04-292006-11-02Hall John MSystem for language translation of documents, and methods
US20090323134A1 (en)*2008-06-302009-12-31Kabushiki Kaisha ToshibaApparatus and method for generating segmentation data of a scanned image
CN102819612A (en)*2012-08-292012-12-12北京鼎盾信息科技有限公司Full text search method based on print documents
US20140380253A1 (en)*2012-03-022014-12-25Sony CorporationInformation processing apparatus and method of processing information
US20170061809A1 (en)*2015-01-302017-03-02Xerox CorporationMethod and system for importing hard copy assessments into an automatic educational system assessment
US11023654B2 (en)*2013-12-102021-06-01International Business Machines CorporationAnalyzing document content and generating an appendix
US20210295033A1 (en)*2020-03-182021-09-23Fujifilm Business Innovation Corp.Information processing apparatus and non-transitory computer readable medium
US11294553B2 (en)*2015-08-242022-04-05Evernote CorporationRestoring full online documents from scanned paper fragments

Families Citing this family (58)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8559764B2 (en)*2004-06-152013-10-15At&T Intellectual Property I, L.P.Editing an image representation of a text
US9171202B2 (en)*2005-08-232015-10-27Ricoh Co., Ltd.Data organization and access for mixed media document system
US8184155B2 (en)*2007-07-112012-05-22Ricoh Co. Ltd.Recognition and tracking using invisible junctions
US7702673B2 (en)*2004-10-012010-04-20Ricoh Co., Ltd.System and methods for creation and use of a mixed media environment
US8600989B2 (en)*2004-10-012013-12-03Ricoh Co., Ltd.Method and system for image matching in a mixed media environment
US8825682B2 (en)2006-07-312014-09-02Ricoh Co., Ltd.Architecture for mixed media reality retrieval of locations and registration of images
US8369655B2 (en)*2006-07-312013-02-05Ricoh Co., Ltd.Mixed media reality recognition using multiple specialized indexes
US8086038B2 (en)*2007-07-112011-12-27Ricoh Co., Ltd.Invisible junction features for patch recognition
US9373029B2 (en)*2007-07-112016-06-21Ricoh Co., Ltd.Invisible junction feature recognition for document security or annotation
US8195659B2 (en)*2005-08-232012-06-05Ricoh Co. Ltd.Integration and use of mixed media documents
US9405751B2 (en)*2005-08-232016-08-02Ricoh Co., Ltd.Database for mixed media document system
US8276088B2 (en)2007-07-112012-09-25Ricoh Co., Ltd.User interface for three-dimensional navigation
US8332401B2 (en)*2004-10-012012-12-11Ricoh Co., LtdMethod and system for position-based image matching in a mixed media environment
US7669148B2 (en)*2005-08-232010-02-23Ricoh Co., Ltd.System and methods for portable device for mixed media system
US7991778B2 (en)*2005-08-232011-08-02Ricoh Co., Ltd.Triggering actions with captured input in a mixed media environment
US7639387B2 (en)*2005-08-232009-12-29Ricoh Co., Ltd.Authoring tools using a mixed media environment
US7587412B2 (en)*2005-08-232009-09-08Ricoh Company, Ltd.Mixed media reality brokerage network and methods of use
US8838591B2 (en)*2005-08-232014-09-16Ricoh Co., Ltd.Embedding hot spots in electronic documents
US8949287B2 (en)*2005-08-232015-02-03Ricoh Co., Ltd.Embedding hot spots in imaged documents
US8856108B2 (en)*2006-07-312014-10-07Ricoh Co., Ltd.Combining results of image retrieval processes
US8144921B2 (en)2007-07-112012-03-27Ricoh Co., Ltd.Information retrieval using invisible junctions and geometric constraints
US9530050B1 (en)2007-07-112016-12-27Ricoh Co., Ltd.Document annotation sharing
US8176054B2 (en)2007-07-122012-05-08Ricoh Co. LtdRetrieving electronic documents by converting them to synthetic text
US7812986B2 (en)2005-08-232010-10-12Ricoh Co. Ltd.System and methods for use of voice mail and email in a mixed media environment
US8005831B2 (en)*2005-08-232011-08-23Ricoh Co., Ltd.System and methods for creation and use of a mixed media environment with geographic location information
US7970171B2 (en)*2007-01-182011-06-28Ricoh Co., Ltd.Synthetic image and video generation from ground truth data
US8156427B2 (en)*2005-08-232012-04-10Ricoh Co. Ltd.User interface for mixed media reality
US8521737B2 (en)*2004-10-012013-08-27Ricoh Co., Ltd.Method and system for multi-tier image matching in a mixed media environment
US8385589B2 (en)*2008-05-152013-02-26Berna ErolWeb-based content detection in images, extraction and recognition
US7917554B2 (en)*2005-08-232011-03-29Ricoh Co. Ltd.Visibly-perceptible hot spots in documents
US7551780B2 (en)*2005-08-232009-06-23Ricoh Co., Ltd.System and method for using individualized mixed document
US9384619B2 (en)*2006-07-312016-07-05Ricoh Co., Ltd.Searching media content for objects specified using identifiers
US7920759B2 (en)*2005-08-232011-04-05Ricoh Co. Ltd.Triggering applications for distributed action execution and use of mixed media recognition as a control input
US8868555B2 (en)2006-07-312014-10-21Ricoh Co., Ltd.Computation of a recongnizability score (quality predictor) for image retrieval
US8989431B1 (en)2007-07-112015-03-24Ricoh Co., Ltd.Ad hoc paper-based networking with mixed media reality
US8510283B2 (en)*2006-07-312013-08-13Ricoh Co., Ltd.Automatic adaption of an image recognition system to image capture devices
US8156116B2 (en)2006-07-312012-04-10Ricoh Co., LtdDynamic presentation of targeted information in a mixed media reality recognition system
US8335789B2 (en)*2004-10-012012-12-18Ricoh Co., Ltd.Method and system for document fingerprint matching in a mixed media environment
US7672543B2 (en)*2005-08-232010-03-02Ricoh Co., Ltd.Triggering applications based on a captured text in a mixed media environment
US7885955B2 (en)*2005-08-232011-02-08Ricoh Co. Ltd.Shared document annotation
US7773822B2 (en)*2005-05-022010-08-10Colormax, Inc.Apparatus and methods for management of electronic images
US7769772B2 (en)2005-08-232010-08-03Ricoh Co., Ltd.Mixed media reality brokerage network with layout-independent recognition
US7567267B2 (en)*2006-07-312009-07-28Hewlett-Packard Development Company, L.P.System and method for calibrating a beam array of a printer
US8073263B2 (en)*2006-07-312011-12-06Ricoh Co., Ltd.Multi-classifier selection and monitoring for MMR-based image recognition
US9176984B2 (en)2006-07-312015-11-03Ricoh Co., LtdMixed media reality retrieval of differentially-weighted links
US8676810B2 (en)*2006-07-312014-03-18Ricoh Co., Ltd.Multiple index mixed media reality recognition using unequal priority indexes
US8489987B2 (en)2006-07-312013-07-16Ricoh Co., Ltd.Monitoring and analyzing creation and usage of visual content using image and hotspot interaction
US9020966B2 (en)*2006-07-312015-04-28Ricoh Co., Ltd.Client device for interacting with a mixed media reality recognition system
US8201076B2 (en)2006-07-312012-06-12Ricoh Co., Ltd.Capturing symbolic information from documents upon printing
US9063952B2 (en)*2006-07-312015-06-23Ricoh Co., Ltd.Mixed media reality recognition with image tracking
JP5415736B2 (en)*2008-10-012014-02-12キヤノン株式会社 Document processing system, control method therefor, program, and storage medium
JP5173721B2 (en)*2008-10-012013-04-03キヤノン株式会社 Document processing system, control method therefor, program, and storage medium
US8385660B2 (en)*2009-06-242013-02-26Ricoh Co., Ltd.Mixed media reality indexing and retrieval for repeated content
US9058331B2 (en)2011-07-272015-06-16Ricoh Co., Ltd.Generating a conversation in a social network based on visual search results
EP3329354A4 (en)*2015-07-312019-03-20WiseTech Global Limited METHODS AND SYSTEMS FOR CREATING CONFIGURABLE FORMS, CONFIGURING FORMS AND FOR FORMULAR AND FORMULA FLOW CORRELATION
WO2018218032A1 (en)2017-05-242018-11-29Taco Marketing LlcConsumer purchasing and inventory control assistant apparatus, system and methods
US12148022B2 (en)2019-07-182024-11-19Taco Marketing LlcConsumer purchasing and inventory control assistant apparatus, system and methods
WO2022241241A1 (en)*2021-05-142022-11-17Taco Marketing LlcConsumer purchasing and inventory control assistant apparatus, system and methods

Citations (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5133024A (en)*1989-10-241992-07-21Horst FroesslImage data bank system with selective conversion
US5168565A (en)*1988-01-201992-12-01Ricoh Company, Ltd.Document retrieval system
US5623679A (en)*1993-11-191997-04-22Waverley Holdings, Inc.System and method for creating and manipulating notes each containing multiple sub-notes, and linking the sub-notes to portions of data objects
US5765176A (en)*1996-09-061998-06-09Xerox CorporationPerforming document image management tasks using an iconic image having embedded encoded information
US5825943A (en)*1993-05-071998-10-20Canon Inc.Selective document retrieval method and system
US6182090B1 (en)*1995-04-282001-01-30Ricoh Company, Ltd.Method and apparatus for pointing to documents electronically using features extracted from a scanned icon representing a destination
US20010041021A1 (en)*2000-02-042001-11-15Boyle Dennis J.System and method for synchronization of image data between a handheld device and a computer
US6341176B1 (en)*1996-11-202002-01-22Matsushita Electric Industrial Co., Ltd.Method and apparatus for character recognition
US6389163B1 (en)*1994-11-182002-05-14Xerox CorporationMethod and apparatus for automatic image segmentation using template matching filters
US6480838B1 (en)*1998-04-012002-11-12William PetermanSystem and method for searching electronic documents created with optical character recognition
US6687404B1 (en)*1997-06-202004-02-03Xerox CorporationAutomatic training of layout parameters in a 2D image model
US6704465B2 (en)*1998-03-122004-03-09Canon Kabushiki KaishaImage processing apparatus and its processing method, storage medium, and image file format
US6765559B2 (en)*2000-03-212004-07-20Nec CorporationPage information display method and device and storage medium storing program for displaying page information
US6836565B1 (en)*1998-10-292004-12-28Canon Kabushiki KaishaImage processing apparatus and method, and recording medium

Family Cites Families (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5819261A (en)*1995-03-281998-10-06Canon Kabushiki KaishaMethod and apparatus for extracting a keyword from scheduling data using the keyword for searching the schedule data file
US5873076A (en)*1995-09-151999-02-16Infonautics CorporationArchitecture for processing search queries, retrieving documents identified thereby, and method for using same
US5987459A (en)*1996-03-151999-11-16Regents Of The University Of MinnesotaImage and document management system for content-based retrieval
US6208988B1 (en)*1998-06-012001-03-27Bigchalk.Com, Inc.Method for identifying themes associated with a search query using metadata and for organizing documents responsive to the search query in accordance with the themes
US6466952B2 (en)*1999-04-082002-10-15Hewlett-Packard CompanyMethod for transferring and indexing data from old media to new media

Patent Citations (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5168565A (en)*1988-01-201992-12-01Ricoh Company, Ltd.Document retrieval system
US5133024A (en)*1989-10-241992-07-21Horst FroesslImage data bank system with selective conversion
US5825943A (en)*1993-05-071998-10-20Canon Inc.Selective document retrieval method and system
US5623679A (en)*1993-11-191997-04-22Waverley Holdings, Inc.System and method for creating and manipulating notes each containing multiple sub-notes, and linking the sub-notes to portions of data objects
US6389163B1 (en)*1994-11-182002-05-14Xerox CorporationMethod and apparatus for automatic image segmentation using template matching filters
US6182090B1 (en)*1995-04-282001-01-30Ricoh Company, Ltd.Method and apparatus for pointing to documents electronically using features extracted from a scanned icon representing a destination
US5765176A (en)*1996-09-061998-06-09Xerox CorporationPerforming document image management tasks using an iconic image having embedded encoded information
US6341176B1 (en)*1996-11-202002-01-22Matsushita Electric Industrial Co., Ltd.Method and apparatus for character recognition
US6687404B1 (en)*1997-06-202004-02-03Xerox CorporationAutomatic training of layout parameters in a 2D image model
US6704465B2 (en)*1998-03-122004-03-09Canon Kabushiki KaishaImage processing apparatus and its processing method, storage medium, and image file format
US6480838B1 (en)*1998-04-012002-11-12William PetermanSystem and method for searching electronic documents created with optical character recognition
US6836565B1 (en)*1998-10-292004-12-28Canon Kabushiki KaishaImage processing apparatus and method, and recording medium
US20010041021A1 (en)*2000-02-042001-11-15Boyle Dennis J.System and method for synchronization of image data between a handheld device and a computer
US6765559B2 (en)*2000-03-212004-07-20Nec CorporationPage information display method and device and storage medium storing program for displaying page information

Cited By (28)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7099869B1 (en)*2001-07-112006-08-29Apple Computer, Inc.Method and apparatus for managing file extensions in a digital processing system
US8190656B2 (en)2001-07-112012-05-29Apple Inc.Method and apparatus for managing file extensions in a digital processing system
US7792881B2 (en)2001-07-112010-09-07Apple Inc.Method and apparatus for managing file extensions in a digital processing system
US20060265419A1 (en)*2001-07-112006-11-23Scott ForstallMethod and apparatus for managing file extensions in a digital processing system
US20030042319A1 (en)*2001-08-312003-03-06Xerox CorporationAutomatic and semi-automatic index generation for raster documents
US20040010757A1 (en)*2002-07-092004-01-15Mccoy Craig G.Method and system to place a scanned document in the body of an email
US20040218205A1 (en)*2003-04-292004-11-04Cory IrwinMethod and system of using a multifunction printer to identify pages having a text string
US7391527B2 (en)*2003-04-292008-06-24Hewlett-Packard Development Company, L.P.Method and system of using a multifunction printer to identify pages having a text string
US7653234B2 (en)2003-05-152010-01-26Federal Reserve Bank Of AtlantaMethod for communicating and matching electronic files for financial transactions
US20040228512A1 (en)*2003-05-152004-11-18Warren Joel EdwardMethod and system for communicating and matching electronic files for financial transactions
US20060140469A1 (en)*2003-05-152006-06-29Warren Joel EMethod for communicating and matching electronic files for financial transactions
US6990224B2 (en)*2003-05-152006-01-24Federal Reserve Bank Of AtlantaMethod and system for communicating and matching electronic files for financial transactions
US20050210048A1 (en)*2004-03-182005-09-22Zenodata CorporationAutomated posting systems and methods
US20050210047A1 (en)*2004-03-182005-09-22Zenodata CorporationPosting data to a database from non-standard documents using document mapping to standard document types
US20060170984A1 (en)*2005-02-012006-08-03Canon Kabushiki KaishaData processing apparatus, image processing apparatus, data processing method, image processing method, and programs for implementing the methods
US7787158B2 (en)*2005-02-012010-08-31Canon Kabushiki KaishaData processing apparatus, image processing apparatus, data processing method, image processing method, and programs for implementing the methods
US20060245005A1 (en)*2005-04-292006-11-02Hall John MSystem for language translation of documents, and methods
US20090323134A1 (en)*2008-06-302009-12-31Kabushiki Kaisha ToshibaApparatus and method for generating segmentation data of a scanned image
US20140380253A1 (en)*2012-03-022014-12-25Sony CorporationInformation processing apparatus and method of processing information
US10198175B2 (en)*2012-03-022019-02-05Sony CorporationInformation processing apparatus for recognizing an inputted character based on coordinate data series
CN102819612A (en)*2012-08-292012-12-12北京鼎盾信息科技有限公司Full text search method based on print documents
US11023654B2 (en)*2013-12-102021-06-01International Business Machines CorporationAnalyzing document content and generating an appendix
US20170061809A1 (en)*2015-01-302017-03-02Xerox CorporationMethod and system for importing hard copy assessments into an automatic educational system assessment
US11294553B2 (en)*2015-08-242022-04-05Evernote CorporationRestoring full online documents from scanned paper fragments
US20220229543A1 (en)*2015-08-242022-07-21Evernote CorporationRestoring full online documents from scanned paper fragments
US11620038B2 (en)*2015-08-242023-04-04Evernote CorporationRestoring full online documents from scanned paper fragments
US11995299B2 (en)*2015-08-242024-05-28Bending Spoons S.P.A.Restoring full online documents from scanned paper fragments
US20210295033A1 (en)*2020-03-182021-09-23Fujifilm Business Innovation Corp.Information processing apparatus and non-transitory computer readable medium

Also Published As

Publication numberPublication date
US8380012B2 (en)2013-02-19
US20050160115A1 (en)2005-07-21

Similar Documents

PublicationPublication DateTitle
US8380012B2 (en)Document imaging and indexing system
US6263121B1 (en)Archival and retrieval of similar documents
US5706365A (en)System and method for portable document indexing using n-gram word decomposition
US5893908A (en)Document management system
JP4260790B2 (en) Filing / retrieval apparatus and filing / retrieval method
US20060085442A1 (en)Document image information management apparatus and document image information management program
US10114821B2 (en)Method and system to access to electronic business documents
JPH0683879A (en)Method and device for labelling document for preservation, handling and introduction
AU2008205134B2 (en)A document management system
JPH08305616A (en)Data management system
JPH11120202A (en) Integrated document management system, integrated document management method, and computer-readable recording medium storing a program for causing a computer to execute the method
US20070214177A1 (en)Document management system, program and method
KR100459832B1 (en) Systems and methods for indexing portable documents using the N-GRAMWORD decomposition principle
JP2005202714A (en) Document search system
JP4288761B2 (en) Mail transmitting apparatus and program storage medium thereof
JP4135659B2 (en) Format conversion device and file search device
JP2008234078A (en) Information processing apparatus, information processing method, information processing program, and recording medium on which information processing program is recorded
JPH07239854A (en) Image file system
JPH0934903A (en) File search device
JP2000020549A (en)Device for assisting input to document database system
JPH10254752A (en) Electronic filing system
US20050094188A1 (en)Image transmission device and transmission data management system
JPH08263512A (en) Document search device
JP3998201B2 (en) Document search method
JP2004078343A (en)Document management system

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:MICROSOFT CORPORATION, WASHINGTON

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:STARKWEATHER, GARY K.;REEL/FRAME:011840/0027

Effective date:20010518

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION

ASAssignment

Owner name:MICROSOFT TECHNOLOGY LICENSING, LLC, WASHINGTON

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNOR:MICROSOFT CORPORATION;REEL/FRAME:034766/0001

Effective date:20141014


[8]ページ先頭

©2009-2025 Movatter.jp