Movatterモバイル変換


[0]ホーム

URL:


US20060129843A1 - Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents - Google Patents

Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents
Download PDF

Info

Publication number
US20060129843A1
US20060129843A1US11/198,798US19879805AUS2006129843A1US 20060129843 A1US20060129843 A1US 20060129843A1US 19879805 AUS19879805 AUS 19879805AUS 2006129843 A1US2006129843 A1US 2006129843A1
Authority
US
United States
Prior art keywords
information
event
application specific
documents
web
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US11/198,798
Inventor
Narayan Srinivasa
Swarup Medasani
Yuri Owechko
Deepak Khosla
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by IndividualfiledCriticalIndividual
Priority to US11/198,798priorityCriticalpatent/US20060129843A1/en
Publication of US20060129843A1publicationCriticalpatent/US20060129843A1/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

An apparatus and method is disclosed for providing application specific multi-dimensional information to an application running on a user computing device, wherein at least one dimension of the information is a category, from a plurality of member documents electronically extracted from a library of electronically searchable documents, which may comprise an application specific multidimensional information extractor adapted to extract occurrences of prospective representations of dimensions of application specific multidimensional information from the member documents, and to extract occurrences of non-application specific multidimensional information from the member documents; and, an encoder adapted to encode the occurrences of prospective dimensions of application specific multidimensional information and non-application specific multidimensional information contained in member documents according to a dimension specific coded representation of each dimension of application specific multidimensional information and a non-application specific coded representation of each non-application specific multidimensional information element. The apparatus and method may further comprise a member document identifier adapted to determine whether a member document contains coded formatting, and if not, whether the member document is a dense document, and if not, for rejecting the document from further processing, and the coded formatting may comprise network markup language coding. The apparatus and method may further comprise an application specific multidimensional information verification unit adapted verify the extraction of application specific multi-dimensional information from the member documents, and may further comprise a database for storing the application specific multi-dimensional information adapted to provide an application running on a user computing device access to the application specific multidimensional information. The application specific multidimensional information may be scheduled events having the dimensions of time, location and event identity, and the application running on the user computer can be an electronic calendar or other similar scheduling software program.

Description

Claims (5)

US11/198,7982001-12-192005-08-05Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documentsAbandonedUS20060129843A1 (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
US11/198,798US20060129843A1 (en)2001-12-192005-08-05Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US10/026,065US6965900B2 (en)2001-12-192001-12-19Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents
US11/198,798US20060129843A1 (en)2001-12-192005-08-05Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents

Related Parent Applications (1)

Application NumberTitlePriority DateFiling Date
US10/026,065ContinuationUS6965900B2 (en)2001-12-192001-12-19Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents

Publications (1)

Publication NumberPublication Date
US20060129843A1true US20060129843A1 (en)2006-06-15

Family

ID=21829685

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US10/026,065Expired - Fee RelatedUS6965900B2 (en)2001-12-192001-12-19Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents
US11/198,798AbandonedUS20060129843A1 (en)2001-12-192005-08-05Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents

Family Applications Before (1)

Application NumberTitlePriority DateFiling Date
US10/026,065Expired - Fee RelatedUS6965900B2 (en)2001-12-192001-12-19Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents

Country Status (1)

CountryLink
US (2)US6965900B2 (en)

Cited By (38)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050149858A1 (en)*2003-12-292005-07-07Stern Mia K.System and method for managing documents with expression of dates and/or times
US20070143282A1 (en)*2005-03-312007-06-21Betz Jonathan TAnchor text summarization for corroboration
US20070226321A1 (en)*2006-03-232007-09-27R R Donnelley & Sons CompanyImage based document access and related systems, methods, and devices
US20070250497A1 (en)*2006-04-192007-10-25Apple Computer Inc.Semantic reconstruction
US20080033951A1 (en)*2006-01-202008-02-07Benson Gregory PSystem and method for managing context-rich database
US20080104048A1 (en)*2006-09-152008-05-01Microsoft CorporationTracking Storylines Around a Query
US20080255826A1 (en)*2007-04-162008-10-16Sony CorporationDictionary data generating apparatus, character input apparatus, dictionary data generating method, and character input method
US20080270117A1 (en)*2007-04-242008-10-30Grinblat Zinovy DMethod and system for text compression and decompression
US20090150887A1 (en)*2007-12-052009-06-11Microsoft CorporationProcess Aware Change Management
US20090248666A1 (en)*2008-03-312009-10-01Yahoo! Inc.Information retrieval using dynamic guided navigation
US20100049761A1 (en)*2008-08-212010-02-25Bijal MehtaSearch engine method and system utilizing multiple contexts
US20100250235A1 (en)*2009-03-242010-09-30Microsoft CorporationText analysis using phrase definitions and containers
US7865461B1 (en)*2005-08-302011-01-04At&T Intellectual Property Ii, L.P.System and method for cleansing enterprise data
US20110137908A1 (en)*2006-03-102011-06-09Byron Edward DomAssigning into one set of categories information that has been assigned to other sets of categories
US20110153383A1 (en)*2009-12-172011-06-23International Business Machines CorporationSystem and method for distributed elicitation and aggregation of risk information
US20120117023A1 (en)*2009-04-302012-05-10Damien TrogMethod and device for ontology evolution
US20120136859A1 (en)*2007-07-232012-05-31Farhan ShamsiEntity Type Assignment
US20120185935A1 (en)*2011-01-172012-07-19International Business Machines CorporationImplementing automatic access control list validation using automatic categorization of unstructured text
US20130013291A1 (en)*2011-07-062013-01-10Invertix CorporationSystems and methods for sentence comparison and sentence-based search
US8719260B2 (en)2005-05-312014-05-06Google Inc.Identifying the unifying subject of a set of facts
US8751498B2 (en)2006-10-202014-06-10Google Inc.Finding and disambiguating references to entities on web pages
US8812553B2 (en)*2009-04-302014-08-19Collibra Nv/SaMethod and device for improved ontology engineering
US8812435B1 (en)2007-11-162014-08-19Google Inc.Learning objects and facts from documents
US8825471B2 (en)2005-05-312014-09-02Google Inc.Unsupervised extraction of facts
US20150039368A1 (en)*2013-07-302015-02-05Delonaco LimitedSocial Event Scheduler
US8996470B1 (en)2005-05-312015-03-31Google Inc.System for ensuring the internal consistency of a fact repository
WO2015084759A1 (en)*2013-12-022015-06-11Qbase, LLCSystems and methods for in-memory database search
US9092495B2 (en)2006-01-272015-07-28Google Inc.Automatic object reference identification and linking in a browseable fact repository
US9201931B2 (en)2013-12-022015-12-01Qbase, LLCMethod for obtaining search suggestions from fuzzy score matching and population frequencies
US9208204B2 (en)2013-12-022015-12-08Qbase, LLCSearch suggestions using fuzzy-score matching and entity co-occurrence
US9230041B2 (en)2013-12-022016-01-05Qbase, LLCSearch suggestions of related entities based on co-occurrence and/or fuzzy-score matching
US9298824B1 (en)*2010-07-072016-03-29Symantec CorporationFocused crawling to identify potentially malicious sites using Bayesian URL classification and adaptive priority calculation
US9361317B2 (en)2014-03-042016-06-07Qbase, LLCMethod for entity enrichment of digital content to enable advanced search functionality in content management systems
US9449108B2 (en)*2006-11-072016-09-20At&T Intellectual Property I, L.P.Determining sort order by distance
US9619571B2 (en)2013-12-022017-04-11Qbase, LLCMethod for searching related entities through entity co-occurrence
US9892132B2 (en)2007-03-142018-02-13Google LlcDetermining geographic locations for place names in a fact repository
US9916368B2 (en)2013-12-022018-03-13QBase, Inc.Non-exclusionary search within in-memory databases
US20180159876A1 (en)*2016-12-052018-06-07International Business Machines CorporationConsolidating structured and unstructured security and threat intelligence with knowledge graphs

Families Citing this family (120)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US7024624B2 (en)*2002-01-072006-04-04Kenneth James HintzLexicon-based new idea detector
US7483910B2 (en)*2002-01-112009-01-27International Business Machines CorporationAutomated access to web content based on log analysis
US20030154071A1 (en)*2002-02-112003-08-14Shreve Gregory M.Process for the document management and computer-assisted translation of documents utilizing document corpora constructed by intelligent agents
US8527495B2 (en)*2002-02-192013-09-03International Business Machines CorporationPlug-in parsers for configuring search engine crawler
US7949648B2 (en)*2002-02-262011-05-24Soren Alain MortensenCompiling and accessing subject-specific information from a computer network
EP1485871A2 (en)*2002-02-272004-12-15Michael Rik Frans BrandsA data integration and knowledge management solution
US8260786B2 (en)2002-05-242012-09-04Yahoo! Inc.Method and apparatus for categorizing and presenting documents of a distributed database
US7231395B2 (en)*2002-05-242007-06-12Overture Services, Inc.Method and apparatus for categorizing and presenting documents of a distributed database
JP2004062446A (en)*2002-07-262004-02-26Ibm Japan Ltd Information collection system, application server, information collection method, and program
US7570262B2 (en)*2002-08-082009-08-04Reuters LimitedMethod and system for displaying time-series data and correlated events derived from text mining
US7076484B2 (en)*2002-09-162006-07-11International Business Machines CorporationAutomated research engine
WO2004025490A1 (en)*2002-09-162004-03-25The Trustees Of Columbia University In The City Of New YorkSystem and method for document collection, grouping and summarization
US20060242180A1 (en)*2003-07-232006-10-26Graf James AExtracting data from semi-structured text documents
US8548995B1 (en)*2003-09-102013-10-01Google Inc.Ranking of documents based on analysis of related documents
GB0321213D0 (en)*2003-09-102003-10-08British TelecommDiary management method and system
DE10342594B4 (en)*2003-09-152005-09-15Océ Document Technologies GmbH Method and system for collecting data from a plurality of machine readable documents
US7636919B2 (en)*2003-09-162009-12-22International Business Machines CorporationUser-centric policy creation and enforcement to manage visually notified state changes of disparate applications
DE10345526A1 (en)*2003-09-302005-05-25Océ Document Technologies GmbH Method and system for collecting data from machine-readable documents
US7475021B2 (en)*2003-10-222009-01-06International Business Machines CorporationMethod and storage medium for importing calendar data from a computer screen into a calendar application
US7451131B2 (en)*2003-12-082008-11-11Iac Search & Media, Inc.Methods and systems for providing a response to a query
US7181447B2 (en)*2003-12-082007-02-20Iac Search And Media, Inc.Methods and systems for conceptually organizing and presenting information
US20060230040A1 (en)*2003-12-082006-10-12Andy CurtisMethods and systems for providing a response to a query
US10346620B2 (en)2004-02-062019-07-09Early Warning Service, LLCSystems and methods for authentication of access based on multi-data source information
US20050177542A1 (en)*2004-02-062005-08-11Glen SgambatiAccount-owner verification database
US7363279B2 (en)*2004-04-292008-04-22Microsoft CorporationMethod and system for calculating importance of a block within a display page
US7519621B2 (en)*2004-05-042009-04-14Pagebites, Inc.Extracting information from Web pages
US20060053382A1 (en)*2004-09-032006-03-09Biowisdom LimitedSystem and method for facilitating user interaction with multi-relational ontologies
US20060053174A1 (en)*2004-09-032006-03-09Bio Wisdom LimitedSystem and method for data extraction and management in multi-relational ontology creation
US20060053172A1 (en)*2004-09-032006-03-09Biowisdom LimitedSystem and method for creating, editing, and using multi-relational ontologies
US7496593B2 (en)*2004-09-032009-02-24Biowisdom LimitedCreating a multi-relational ontology having a predetermined structure
US20060053173A1 (en)*2004-09-032006-03-09Biowisdom LimitedSystem and method for support of chemical data within multi-relational ontologies
US7505989B2 (en)*2004-09-032009-03-17Biowisdom LimitedSystem and method for creating customized ontologies
US20060053099A1 (en)*2004-09-032006-03-09Biowisdom LimitedSystem and method for capturing knowledge for integration into one or more multi-relational ontologies
US20060053175A1 (en)*2004-09-032006-03-09Biowisdom LimitedSystem and method for creating, editing, and utilizing one or more rules for multi-relational ontology creation and maintenance
US20060074833A1 (en)*2004-09-032006-04-06Biowisdom LimitedSystem and method for notifying users of changes in multi-relational ontologies
US20060053135A1 (en)*2004-09-032006-03-09Biowisdom LimitedSystem and method for exploring paths between concepts within multi-relational ontologies
US7493333B2 (en)*2004-09-032009-02-17Biowisdom LimitedSystem and method for parsing and/or exporting data from one or more multi-relational ontologies
US20060053171A1 (en)*2004-09-032006-03-09Biowisdom LimitedSystem and method for curating one or more multi-relational ontologies
US7480667B2 (en)*2004-12-242009-01-20Microsoft CorporationSystem and method for using anchor text as training data for classifier-based search systems
US7831438B2 (en)*2004-12-302010-11-09Google Inc.Local item extraction
JP2006236140A (en)*2005-02-252006-09-07Toshiba Corp Information management apparatus, information management method, and information management program
US7587387B2 (en)2005-03-312009-09-08Google Inc.User interface for facts query engine with snippets from information sources that include query terms and answer terms
US8682913B1 (en)2005-03-312014-03-25Google Inc.Corroborating facts extracted from multiple sources
US7461044B2 (en)*2005-04-272008-12-02International Business Machines CorporationIt resource event situation classification and semantics
US7467147B2 (en)*2005-06-012008-12-16Groundspeak, Inc.System and method for facilitating ad hoc compilation of geospatial data for on-line collaboration
US7606816B2 (en)*2005-06-032009-10-20Yahoo! Inc.Record boundary identification and extraction through pattern mining
US20060282270A1 (en)*2005-06-092006-12-14First Data CorporationIdentity verification noise filter systems and methods
US9026511B1 (en)2005-06-292015-05-05Google Inc.Call connection via document browsing
US8109435B2 (en)*2005-07-142012-02-07Early Warning Services, LlcIdentity verification switch
US7483903B2 (en)*2005-08-172009-01-27Yahoo! Inc.Unsupervised learning tool for feature correction
US20070073592A1 (en)*2005-09-282007-03-29Redcarpet, Inc.Method and system for network-based comparision shopping
US20070106548A1 (en)*2005-11-042007-05-10Steven Leonard BrattInternet based calendar system linking all parties relevant to the automated maintenance of scheduled events
WO2007064375A2 (en)*2005-11-302007-06-07Selective, Inc.Selective latent semantic indexing method for information retrieval applications
US20070202481A1 (en)*2006-02-272007-08-30Andrew Smith LewisMethod and apparatus for flexibly and adaptively obtaining personalized study content, and study device including the same
KR100678126B1 (en)*2006-03-242007-02-02삼성전자주식회사 Duplicate Schedule Management Method in Mobile Communication Terminal
US7933890B2 (en)*2006-03-312011-04-26Google Inc.Propagating useful information among related web pages, such as web pages of a website
US7627571B2 (en)*2006-03-312009-12-01Microsoft CorporationExtraction of anchor explanatory text by mining repeated patterns
US20070260586A1 (en)*2006-05-032007-11-08Antonio SavonaSystems and methods for selecting and organizing information using temporal clustering
CN101094194B (en)*2006-06-192010-06-23腾讯科技(深圳)有限公司Method for picking up web information needed by user in web page
US20080034305A1 (en)*2006-08-032008-02-07International Business Machines CorporationMethod for providing flexible selection time components
US8429702B2 (en)2006-09-112013-04-23At&T Intellectual Property I, L.P.Methods and apparatus for selecting and pushing customized electronic media content
US8244694B2 (en)*2006-09-122012-08-14International Business Machines CorporationDynamic schema assembly to accommodate application-specific metadata
KR100849497B1 (en)*2006-09-292008-07-31한국전자통신연구원Method of Protein Name Normalization Using Ontology Mapping
US7725466B2 (en)*2006-10-242010-05-25Tarique MustafaHigh accuracy document information-element vector encoding server
US7873640B2 (en)*2007-03-272011-01-18Adobe Systems IncorporatedSemantic analysis documents to rank terms
US8051372B1 (en)*2007-04-122011-11-01The New York Times CompanySystem and method for automatically detecting and extracting semantically significant text from a HTML document associated with a plurality of HTML documents
US20080281827A1 (en)*2007-05-102008-11-13Microsoft CorporationUsing structured database for webpage information extraction
US20080301120A1 (en)*2007-06-042008-12-04Precipia Systems Inc.Method, apparatus and computer program for managing the processing of extracted data
US7958050B2 (en)*2007-07-022011-06-07Early Warning Services, LlcPayment account monitoring system and method
CN101809574A (en)*2007-09-282010-08-18日本电气株式会社Method for classifying data and device for classifying data
US8825693B2 (en)*2007-12-122014-09-02Trend Micro IncorporatedConditional string search
US7840548B2 (en)*2007-12-272010-11-23Yahoo! Inc.System and method for adding identity to web rank
US7853583B2 (en)*2007-12-272010-12-14Yahoo! Inc.System and method for generating expertise based search results
US8046675B2 (en)*2007-12-282011-10-25Yahoo! Inc.Method of creating graph structure from time-series of attention data
US8005855B2 (en)*2007-12-282011-08-23Microsoft CorporationInterface with scheduling information during defined period
US8583639B2 (en)*2008-02-192013-11-12International Business Machines CorporationMethod and system using machine learning to automatically discover home pages on the internet
US7885944B1 (en)2008-03-282011-02-08Symantec CorporationHigh-accuracy confidential data detection
US10055392B2 (en)*2008-05-122018-08-21Adobe Systems IncorporatedHistory-based archive management
US8843384B2 (en)*2008-07-102014-09-23Avinoam EdenMethod for selecting a spatial allocation
US8180771B2 (en)2008-07-182012-05-15Iac Search & Media, Inc.Search activity eraser
TWI377478B (en)*2008-10-072012-11-21Mitac Int CorpSelf-learning method for keyword based human machine interaction and portable navigation device using the method
US9904681B2 (en)*2009-01-122018-02-27Sri InternationalMethod and apparatus for assembling a set of documents related to a triggering item
CN101876981B (en)*2009-04-292015-09-23阿里巴巴集团控股有限公司A kind of method and device building knowledge base
US20100332531A1 (en)*2009-06-262010-12-30Microsoft CorporationBatched Transfer of Arbitrarily Distributed Data
US20100332550A1 (en)*2009-06-262010-12-30Microsoft CorporationPlatform For Configurable Logging Instrumentation
US20110029516A1 (en)*2009-07-302011-02-03Microsoft CorporationWeb-Used Pattern Insight Platform
US8082247B2 (en)*2009-07-302011-12-20Microsoft CorporationBest-bet recommendations
US8135753B2 (en)*2009-07-302012-03-13Microsoft CorporationDynamic information hierarchies
US8392380B2 (en)*2009-07-302013-03-05Microsoft CorporationLoad-balancing and scaling for analytics data
US8954893B2 (en)*2009-11-062015-02-10Hewlett-Packard Development Company, L.P.Visually representing a hierarchy of category nodes
US9436726B2 (en)2011-06-232016-09-06BCM International Regulatory Analytics LLCSystem, method and computer program product for a behavioral database providing quantitative analysis of cross border policy process and related search capabilities
US8707163B2 (en)*2011-10-042014-04-22Wesley John BoudvilleTransmitting and receiving data via barcodes through a cellphone for privacy and anonymity
CN104137151B (en)*2012-02-202017-03-01三菱电机株式会社Graphic processing data device and graphic data processing system
JP5364184B2 (en)*2012-03-302013-12-11楽天株式会社 Information providing apparatus, information providing method, program, information storage medium, and information providing system
US9495664B2 (en)*2012-12-272016-11-15International Business Machines CorporationDelivering electronic meeting content
US10540373B1 (en)*2013-03-042020-01-21Jpmorgan Chase Bank, N.A.Clause library manager
US9940679B2 (en)*2014-02-142018-04-10Google LlcSystems, methods, and computer-readable media for event creation and notification
US9513961B1 (en)*2014-04-022016-12-06Google Inc.Monitoring application loading
US10565219B2 (en)2014-05-302020-02-18Apple Inc.Techniques for automatically generating a suggested contact based on a received message
US10579212B2 (en)2014-05-302020-03-03Apple Inc.Structured suggestions
US20150379010A1 (en)*2014-06-252015-12-31International Business Machines CorporationDynamic Concept Based Query Expansion
US11025565B2 (en)2015-06-072021-06-01Apple Inc.Personalized prediction of responses for instant messaging
US10885042B2 (en)*2015-08-272021-01-05International Business Machines CorporationAssociating contextual structured data with unstructured documents on map-reduce
US10445425B2 (en)2015-09-152019-10-15Apple Inc.Emoji and canned responses
US20170169032A1 (en)*2015-12-122017-06-15Hewlett-Packard Development Company, L.P.Method and system of selecting and orderingcontent based on distance scores
US11216491B2 (en)*2016-03-312022-01-04Splunk Inc.Field extraction rules from clustered data samples
US11249710B2 (en)2016-03-312022-02-15Splunk Inc.Technology add-on control console
US20180068330A1 (en)*2016-09-072018-03-08International Business Machines CorporationDeep Learning Based Unsupervised Event Learning for Economic Indicator Predictions
US10885024B2 (en)*2016-11-032021-01-05Pearson Education, Inc.Mapping data resources to requested objectives
US10319255B2 (en)2016-11-082019-06-11Pearson Education, Inc.Measuring language learning using standardized score scales and adaptive assessment engines
CN106547742B (en)*2016-11-302019-05-03百度在线网络技术(北京)有限公司Semantic parsing result treating method and apparatus based on artificial intelligence
US11158012B1 (en)2017-02-142021-10-26Casepoint LLCCustomizing a data discovery user interface based on artificial intelligence
US11275794B1 (en)*2017-02-142022-03-15Casepoint LLCCaseAssist story designer
US10740557B1 (en)2017-02-142020-08-11Casepoint LLCTechnology platform for data discovery
US12002010B2 (en)*2017-06-022024-06-04Apple Inc.Event extraction systems and methods
US11847246B1 (en)*2017-09-142023-12-19United Services Automobile Association (Usaa)Token based communications for machine learning systems
CN108073561A (en)*2017-12-182018-05-25广东广业开元科技有限公司The edit methods and Press release of a kind of Press release are write robot system
US10241992B1 (en)2018-04-272019-03-26Open Text Sa UlcTable item information extraction with continuous machine learning through local and global models
CN109933647A (en)*2019-02-122019-06-25北京百度网讯科技有限公司 Method, apparatus, electronic device and computer storage medium for determining description information
CN113177541B (en)*2021-05-172023-12-19上海云扩信息科技有限公司Method for extracting text content in PDF document and picture by computer program

Citations (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5619709A (en)*1993-09-201997-04-08Hnc, Inc.System and method of context vector generation and retrieval
US5649193A (en)*1993-03-121997-07-15Kabushiki Kaisha ToshibaDocument detection system using detection result presentation for facilitating user's comprehension
US5873056A (en)*1993-10-121999-02-16The Syracuse UniversityNatural language processing system for semantic vector representation which accounts for lexical ambiguity
US5991755A (en)*1995-11-291999-11-23Matsushita Electric Industrial Co., Ltd.Document retrieval system for retrieving a necessary document
US7024407B2 (en)*2000-08-242006-04-04Content Analyst Company, LlcWord sense disambiguation

Family Cites Families (15)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US4839853A (en)*1988-09-151989-06-13Bell Communications Research, Inc.Computer information retrieval using latent semantic structure
US5675819A (en)*1994-06-161997-10-07Xerox CorporationDocument information retrieval using global word co-occurrence patterns
US5675710A (en)*1995-06-071997-10-07Lucent Technologies, Inc.Method and apparatus for training a text classifier
US6076088A (en)*1996-02-092000-06-13Paik; WoojinInformation extraction system and method using concept relation concept (CRC) triples
US6018343A (en)*1996-09-272000-01-25Timecruiser Computing Corp.Web calendar architecture and uses thereof
US5960406A (en)*1998-01-221999-09-28Ecal, Corp.Scheduling system for use between users on the web
US6446061B1 (en)*1998-07-312002-09-03International Business Machines CorporationTaxonomy generation for document collections
US6651218B1 (en)*1998-12-222003-11-18Xerox CorporationDynamic content database for multiple document genres
US6629097B1 (en)*1999-04-282003-09-30Douglas K. KeithDisplaying implicit associations among items in loosely-structured data sets
US6560597B1 (en)*2000-03-212003-05-06International Business Machines CorporationConcept decomposition using clustering
JP3672234B2 (en)*2000-06-122005-07-20インターナショナル・ビジネス・マシーンズ・コーポレーション Method for retrieving and ranking documents from a database, computer system, and recording medium
US7072061B2 (en)*2001-02-132006-07-04Ariba, Inc.Method and system for extracting information from RFQ documents and compressing RFQ files into a common RFQ file type
US20020138492A1 (en)*2001-03-072002-09-26David KilData mining application with improved data mining algorithm selection
US6778979B2 (en)*2001-08-132004-08-17Xerox CorporationSystem for automatically generating queries
US6732090B2 (en)*2001-08-132004-05-04Xerox CorporationMeta-document management system with user definable personalities

Patent Citations (5)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US5649193A (en)*1993-03-121997-07-15Kabushiki Kaisha ToshibaDocument detection system using detection result presentation for facilitating user's comprehension
US5619709A (en)*1993-09-201997-04-08Hnc, Inc.System and method of context vector generation and retrieval
US5873056A (en)*1993-10-121999-02-16The Syracuse UniversityNatural language processing system for semantic vector representation which accounts for lexical ambiguity
US5991755A (en)*1995-11-291999-11-23Matsushita Electric Industrial Co., Ltd.Document retrieval system for retrieving a necessary document
US7024407B2 (en)*2000-08-242006-04-04Content Analyst Company, LlcWord sense disambiguation

Cited By (57)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050149858A1 (en)*2003-12-292005-07-07Stern Mia K.System and method for managing documents with expression of dates and/or times
US20070143282A1 (en)*2005-03-312007-06-21Betz Jonathan TAnchor text summarization for corroboration
US9208229B2 (en)2005-03-312015-12-08Google Inc.Anchor text summarization for corroboration
US8996470B1 (en)2005-05-312015-03-31Google Inc.System for ensuring the internal consistency of a fact repository
US9558186B2 (en)2005-05-312017-01-31Google Inc.Unsupervised extraction of facts
US8719260B2 (en)2005-05-312014-05-06Google Inc.Identifying the unifying subject of a set of facts
US8825471B2 (en)2005-05-312014-09-02Google Inc.Unsupervised extraction of facts
US7865461B1 (en)*2005-08-302011-01-04At&T Intellectual Property Ii, L.P.System and method for cleansing enterprise data
US20080033951A1 (en)*2006-01-202008-02-07Benson Gregory PSystem and method for managing context-rich database
US8150857B2 (en)2006-01-202012-04-03Glenbrook Associates, Inc.System and method for context-rich database optimized for processing of concepts
US7941433B2 (en)2006-01-202011-05-10Glenbrook Associates, Inc.System and method for managing context-rich database
US9092495B2 (en)2006-01-272015-07-28Google Inc.Automatic object reference identification and linking in a browseable fact repository
US20110137908A1 (en)*2006-03-102011-06-09Byron Edward DomAssigning into one set of categories information that has been assigned to other sets of categories
US20070226321A1 (en)*2006-03-232007-09-27R R Donnelley & Sons CompanyImage based document access and related systems, methods, and devices
US7603351B2 (en)*2006-04-192009-10-13Apple Inc.Semantic reconstruction
US20070250497A1 (en)*2006-04-192007-10-25Apple Computer Inc.Semantic reconstruction
US7801901B2 (en)*2006-09-152010-09-21Microsoft CorporationTracking storylines around a query
US20080104048A1 (en)*2006-09-152008-05-01Microsoft CorporationTracking Storylines Around a Query
US8751498B2 (en)2006-10-202014-06-10Google Inc.Finding and disambiguating references to entities on web pages
US9760570B2 (en)2006-10-202017-09-12Google Inc.Finding and disambiguating references to entities on web pages
US9449108B2 (en)*2006-11-072016-09-20At&T Intellectual Property I, L.P.Determining sort order by distance
US9892132B2 (en)2007-03-142018-02-13Google LlcDetermining geographic locations for place names in a fact repository
US10459955B1 (en)2007-03-142019-10-29Google LlcDetermining geographic locations for place names
US20080255826A1 (en)*2007-04-162008-10-16Sony CorporationDictionary data generating apparatus, character input apparatus, dictionary data generating method, and character input method
US20080270117A1 (en)*2007-04-242008-10-30Grinblat Zinovy DMethod and system for text compression and decompression
US20120136859A1 (en)*2007-07-232012-05-31Farhan ShamsiEntity Type Assignment
US8812435B1 (en)2007-11-162014-08-19Google Inc.Learning objects and facts from documents
US8276152B2 (en)2007-12-052012-09-25Microsoft CorporationValidation of the change orders to an I T environment
US20090150887A1 (en)*2007-12-052009-06-11Microsoft CorporationProcess Aware Change Management
US9798806B2 (en)*2008-03-312017-10-24Excalibur Ip, LlcInformation retrieval using dynamic guided navigation
US20090248666A1 (en)*2008-03-312009-10-01Yahoo! Inc.Information retrieval using dynamic guided navigation
US20100049761A1 (en)*2008-08-212010-02-25Bijal MehtaSearch engine method and system utilizing multiple contexts
US8433559B2 (en)*2009-03-242013-04-30Microsoft CorporationText analysis using phrase definitions and containers
US20100250235A1 (en)*2009-03-242010-09-30Microsoft CorporationText analysis using phrase definitions and containers
US9171022B2 (en)*2009-04-302015-10-27Collibra Nv/SaMethod and device for ontology evolution
US20150046392A1 (en)*2009-04-302015-02-12Collibra Nv/SaMethod and device for ontology evolution
US20120117023A1 (en)*2009-04-302012-05-10Damien TrogMethod and device for ontology evolution
US8849874B2 (en)*2009-04-302014-09-30Collibra Nv/SaMethod and device for ontology evolution
US8812553B2 (en)*2009-04-302014-08-19Collibra Nv/SaMethod and device for improved ontology engineering
US20110153383A1 (en)*2009-12-172011-06-23International Business Machines CorporationSystem and method for distributed elicitation and aggregation of risk information
US9298824B1 (en)*2010-07-072016-03-29Symantec CorporationFocused crawling to identify potentially malicious sites using Bayesian URL classification and adaptive priority calculation
US20120185935A1 (en)*2011-01-172012-07-19International Business Machines CorporationImplementing automatic access control list validation using automatic categorization of unstructured text
US8739279B2 (en)*2011-01-172014-05-27International Business Machines CorporationImplementing automatic access control list validation using automatic categorization of unstructured text
US9176949B2 (en)*2011-07-062015-11-03Altamira Technologies CorporationSystems and methods for sentence comparison and sentence-based search
US20130013291A1 (en)*2011-07-062013-01-10Invertix CorporationSystems and methods for sentence comparison and sentence-based search
US20150039368A1 (en)*2013-07-302015-02-05Delonaco LimitedSocial Event Scheduler
US10068205B2 (en)*2013-07-302018-09-04Delonaco LimitedSocial event scheduler
US9208204B2 (en)2013-12-022015-12-08Qbase, LLCSearch suggestions using fuzzy-score matching and entity co-occurrence
US9613166B2 (en)2013-12-022017-04-04Qbase, LLCSearch suggestions of related entities based on co-occurrence and/or fuzzy-score matching
US9619571B2 (en)2013-12-022017-04-11Qbase, LLCMethod for searching related entities through entity co-occurrence
US9507834B2 (en)2013-12-022016-11-29Qbase, LLCSearch suggestions using fuzzy-score matching and entity co-occurrence
US9230041B2 (en)2013-12-022016-01-05Qbase, LLCSearch suggestions of related entities based on co-occurrence and/or fuzzy-score matching
US9916368B2 (en)2013-12-022018-03-13QBase, Inc.Non-exclusionary search within in-memory databases
US9201931B2 (en)2013-12-022015-12-01Qbase, LLCMethod for obtaining search suggestions from fuzzy score matching and population frequencies
WO2015084759A1 (en)*2013-12-022015-06-11Qbase, LLCSystems and methods for in-memory database search
US9361317B2 (en)2014-03-042016-06-07Qbase, LLCMethod for entity enrichment of digital content to enable advanced search functionality in content management systems
US20180159876A1 (en)*2016-12-052018-06-07International Business Machines CorporationConsolidating structured and unstructured security and threat intelligence with knowledge graphs

Also Published As

Publication numberPublication date
US20030115189A1 (en)2003-06-19
US6965900B2 (en)2005-11-15

Similar Documents

PublicationPublication DateTitle
US6965900B2 (en)Method and apparatus for electronically extracting application specific multidimensional information from documents selected from a set of documents electronically extracted from a library of electronically searchable documents
US20030115188A1 (en)Method and apparatus for electronically extracting application specific multidimensional information from a library of searchable documents and for providing the application specific information to a user application
Ceri et al.Web information retrieval
Paliwal et al.Semantics-based automated service discovery
RU2377645C2 (en)Method and system for classifying display pages using summaries
US7516397B2 (en)Methods, apparatus and computer programs for characterizing web resources
Sarawagi et al.Open-domain quantity queries on web tables: annotation, response, and consensus models
Chuang et al.Taxonomy generation for text segments: A practical web-based approach
CN102184262A (en)Web-based text classification mining system and web-based text classification mining method
US20120109925A1 (en)Taxonomy-Based Object Classification
US20080086457A1 (en)Method and apparatus for preprocessing a plurality of documents for search and for presenting search result
SchenkerGraph-theoretic techniques for web content mining
CN101535945A (en)Full text query and search systems and method of use
CN102119383A (en)Method and subsystem for information acquisition and aggregation to facilitate ontology and language-model generation within a content-search-service system
CN101655857A (en)Method for mining data in construction regulation field based on associative regulation mining technology
WO2004013772A2 (en)System and method for indexing non-textual data
Sabri et al.Network page building methodical reviews using involuntary manuscript classification procedures founded on deep learning
KR102753536B1 (en)System for author identification using artificial intelligence learning model and a method thereof
Wong et al.Finding structure and characteristic of Web documents for classification
CN102254025B (en)Information memory retrieving method
CN119377490B (en) A person-job matching recommendation method based on BERT and latent semantic algorithm model
Lerman et al.Semantic labeling of online information sources
CN101088082A (en)Full text query and search systems and methods of use
CN101310274B (en)A knowledge correlation search engine
ShahReview of indexing techniques applied in information retrieval

Legal Events

DateCodeTitleDescription
STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp