Movatterモバイル変換


[0]ホーム

URL:


US20040243632A1 - Adaptive evaluation of text search queries with blackbox scoring functions - Google Patents

Adaptive evaluation of text search queries with blackbox scoring functions
Download PDF

Info

Publication number
US20040243632A1
US20040243632A1US10/740,702US74070203AUS2004243632A1US 20040243632 A1US20040243632 A1US 20040243632A1US 74070203 AUS74070203 AUS 74070203AUS 2004243632 A1US2004243632 A1US 2004243632A1
Authority
US
United States
Prior art keywords
score
candidate location
candidate
iterator
computer program
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Abandoned
Application number
US10/740,702
Inventor
Kevin Beyer
Robert Lyle
Sridhar Rajagopalan
Eugene Shekita
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
International Business Machines Corp
Original Assignee
International Business Machines Corp
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by International Business Machines CorpfiledCriticalInternational Business Machines Corp
Priority to US10/740,702priorityCriticalpatent/US20040243632A1/en
Assigned to INTERNATIONAL BUSINESS MACHINES CORPORATIONreassignmentINTERNATIONAL BUSINESS MACHINES CORPORATIONASSIGNMENT OF ASSIGNORS INTEREST (SEE DOCUMENT FOR DETAILS).Assignors: SHEKITA, EUGENE, BEYER, KEVIN SCOTT, LYLE, ROBERT W., RAJAGOPALAN, SRIDHAR
Publication of US20040243632A1publicationCriticalpatent/US20040243632A1/en
Priority to US11/561,949prioritypatent/US7991771B2/en
Abandonedlegal-statusCriticalCurrent

Links

Images

Classifications

Definitions

Landscapes

Abstract

Disclosed is an evaluation technique for text search with black-box scoring functions, where it is unnecessary for the evaluation engine to maintain details of the scoring function. Included is a description of a system for dealing with blackbox searching, proofs of correctness, as well experimental evidence showing that the performance of the technique is comparable in efficiency to those techniques used in custom-built engines.

Description

Claims (19)

What is claimed is:
1. A computer program product embodied on a computer readable medium, the computer program product comprising computer instructions that implement a search algorithm comprising a function having an input for receiving, while there is at least one candidate location in an order of locations, a score range for the candidate location, the algorithm comparing the score range to a threshold within a range of possible scores, wherein if a lower bound of the score range for the candidate location exceeds the threshold then the candidate location is retained as a result and a next location is selected, and wherein if an upper bound of the score range is at or below the threshold the candidate location is discarded and the next location is selected, and wherein if the score of the candidate location is indeterminate, then the score range for the candidate location is refined.
2. The computer program product as inclaim 1, wherein the next location is selected by choosing the next location greater than the candidate location such that the upper bound exceeds the threshold.
3. The computer program product as inclaim 1, wherein the algorithm receives the score range from a blackbox scoring function.
4. The computer program product as inclaim 3, wherein the blackbox scoring function comprises a Boolean function.
5. The computer program product as inclaim 4, wherein the function comprises instructions for selecting the next location by choosing the next location greater than the candidate location such that the upper bound exceeds the threshold.
6. The computer program product as inclaim 3, wherein the blackbox scoring function comprises at least one of a Term Frequency Inverted Document Frequency (TFIDF) function, a Static Rank function, a Searching by Numbers function, a Lexical Affinities (LA) function a Salience Levels (SL) function and a threshold predicate function.
7. The computer program product as inclaim 3, wherein the blackbox scoring function provides an under-estimate of the score range.
8. The computer program product as inclaim 1, wherein the score range for the candidate location is refined by advancing an input iterator that is set before the candidate location to the candidate location and reevaluating the score range.
9. The computer program product as inclaim 8, wherein advancing the input iterator comprises advancing a randomly selected input iterator.
10. The computer program product as inclaim 8, wherein advancing the input iterator comprises advancing the input iterator in a round robin fashion.
11. The computer program as inclaim 8, wherein advancing the input iterator comprises advancing the input iterator that is the least expensive to advance.
12. The computer program product as inclaim 8, wherein advancing the input iterator comprises advancing the sparsest input iterator.
13. The computer program product as inclaim 12, wherein advancing the sparsest input iterator comprises measuring the effectiveness of the input iterator to advance the candidate location.
14. The computer program product as inclaim 13, wherein measuring the effectiveness comprises dividing total progress attributed to the input iterator by a number of toss operations for the input iterator.
15. The computer program product as inclaim 8, wherein advancing the input iterator comprises selecting a first iterator in an order of iterators set before the candidate location such that when all iterators after the chosen iterator are assumed to occur at the candidate location, and all iterators set before the chosen iterator are assumed to not occur at the candidate location, and the upper bound is above the threshold when the chosen iterator is assumed to occur at the candidate location, and the upper bound is below the threshold when the chosen iterator is assumed to not occur at the candidate location.
16. The computer program product as inclaim 8, wherein advancing the input iterator comprises selecting a combination of techniques.
17. The computer program product as inclaim 1, wherein the candidate location comprises a predetermined score for the score range.
18. A system for implementing a search algorithm, comprising:
a processor for operating an algorithm that comprises an input for receiving from a blackbox scoring function a score for at least one candidate location in an order of locations, wherein the algorithm compares the score to a threshold, and if the score exceeds the threshold then the candidate location is stored as a result and a next location is selected, and if the score is at or below the threshold the candidate location is discarded and the next location is selected, and wherein if the score of the candidate location is indeterminate, then the score for the candidate location is refined; wherein each result is stored in a table of results ordered by relevance.
19. A method for implementing a search of locations in a body of data for relevant terms, the method comprising:
providing an index of locations comprised of terms, wherein a score for the relevant terms in a candidate location is provided by a scoring function and associated with the candidate location; and,
while there are candidate locations:
refining the score range if the score of the candidate location is indeterminate, otherwise, storing each candidate location as a result if a lower bound of the score range for the candidate location exceeds a threshold within a range of possible scores, discarding the candidate location if the score range is at or below an upper bound for the score range and selecting a next location.
US10/740,7022003-05-302003-12-19Adaptive evaluation of text search queries with blackbox scoring functionsAbandonedUS20040243632A1 (en)

Priority Applications (2)

Application NumberPriority DateFiling DateTitle
US10/740,702US20040243632A1 (en)2003-05-302003-12-19Adaptive evaluation of text search queries with blackbox scoring functions
US11/561,949US7991771B2 (en)2003-05-302006-11-21Adaptive evaluation of text search queries with blackbox scoring functions

Applications Claiming Priority (2)

Application NumberPriority DateFiling DateTitle
US47487703P2003-05-302003-05-30
US10/740,702US20040243632A1 (en)2003-05-302003-12-19Adaptive evaluation of text search queries with blackbox scoring functions

Related Child Applications (1)

Application NumberTitlePriority DateFiling Date
US11/561,949DivisionUS7991771B2 (en)2003-05-302006-11-21Adaptive evaluation of text search queries with blackbox scoring functions

Publications (1)

Publication NumberPublication Date
US20040243632A1true US20040243632A1 (en)2004-12-02

Family

ID=33457646

Family Applications (2)

Application NumberTitlePriority DateFiling Date
US10/740,702AbandonedUS20040243632A1 (en)2003-05-302003-12-19Adaptive evaluation of text search queries with blackbox scoring functions
US11/561,949Expired - Fee RelatedUS7991771B2 (en)2003-05-302006-11-21Adaptive evaluation of text search queries with blackbox scoring functions

Family Applications After (1)

Application NumberTitlePriority DateFiling Date
US11/561,949Expired - Fee RelatedUS7991771B2 (en)2003-05-302006-11-21Adaptive evaluation of text search queries with blackbox scoring functions

Country Status (1)

CountryLink
US (2)US20040243632A1 (en)

Cited By (8)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050149504A1 (en)*2004-01-072005-07-07Microsoft CorporationSystem and method for blending the results of a classifier and a search engine
US20050154746A1 (en)*2004-01-092005-07-14Yahoo!, Inc.Content presentation and management system associating base content and relevant additional content
US20060143197A1 (en)*2004-12-232006-06-29Become, Inc.Method for assigning relative quality scores to a collection of linked documents
US20080010251A1 (en)*2006-07-072008-01-10Yahoo! Inc.System and method for budgeted generalization search in hierarchies
US20080126319A1 (en)*2006-08-252008-05-29Ohad Lisral BukaiAutomated short free-text scoring method and system
US20100211569A1 (en)*2009-02-182010-08-19Avaya Inc.System and Method for Generating Queries
US20130254220A1 (en)*2009-01-302013-09-26LexisnexisMethods and systems for creating and using an adaptive thesaurus
CN117910573A (en)*2023-12-192024-04-19国家移民管理局常备力量第二总队 A hybrid table text question answering method and system based on multi-task

Families Citing this family (10)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US8244722B1 (en)*2005-06-302012-08-14Google Inc.Ranking documents
US7904303B2 (en)*2007-08-242011-03-08Yahoo! Inc.Engagement-oriented recommendation principle
US8041710B2 (en)*2008-11-132011-10-18Microsoft CorporationAutomatic diagnosis of search relevance failures
US9529908B2 (en)2010-11-222016-12-27Microsoft Technology Licensing, LlcTiering of posting lists in search engine index
US8713024B2 (en)2010-11-222014-04-29Microsoft CorporationEfficient forward ranking in a search engine
US8478704B2 (en)2010-11-222013-07-02Microsoft CorporationDecomposable ranking for efficient precomputing that selects preliminary ranking features comprising static ranking features and dynamic atom-isolated components
US9424351B2 (en)2010-11-222016-08-23Microsoft Technology Licensing, LlcHybrid-distribution model for search engine indexes
US8620907B2 (en)2010-11-222013-12-31Microsoft CorporationMatching funnel for large document index
JP5928248B2 (en)*2012-08-272016-06-01富士通株式会社 Evaluation method, information processing apparatus, and program
US11182437B2 (en)*2017-10-262021-11-23International Business Machines CorporationHybrid processing of disjunctive and conjunctive conditions of a search query for a similarity search

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6144958A (en)*1998-07-152000-11-07Amazon.Com, Inc.System and method for correcting spelling errors in search queries
US6353825B1 (en)*1999-07-302002-03-05Verizon Laboratories Inc.Method and device for classification using iterative information retrieval techniques
US6785671B1 (en)*1999-12-082004-08-31Amazon.Com, Inc.System and method for locating web-based product offerings

Family Cites Families (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6243713B1 (en)*1998-08-242001-06-05Excalibur Technologies Corp.Multimedia document retrieval by application of multimedia queries to a unified index of multimedia data for a plurality of multimedia data types
US6915308B1 (en)*2000-04-062005-07-05Claritech CorporationMethod and apparatus for information mining and filtering
US6947920B2 (en)*2001-06-202005-09-20Oracle International CorporationMethod and system for response time optimization of data query rankings and retrieval
US20040172374A1 (en)*2003-02-282004-09-02Forman George HenryPredictive data mining process analysis and tool

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US6144958A (en)*1998-07-152000-11-07Amazon.Com, Inc.System and method for correcting spelling errors in search queries
US6353825B1 (en)*1999-07-302002-03-05Verizon Laboratories Inc.Method and device for classification using iterative information retrieval techniques
US6785671B1 (en)*1999-12-082004-08-31Amazon.Com, Inc.System and method for locating web-based product offerings
US20050004889A1 (en)*1999-12-082005-01-06Bailey David R.Search engine system and associated content analysis methods for locating web pages with product offerings

Cited By (14)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20050149504A1 (en)*2004-01-072005-07-07Microsoft CorporationSystem and method for blending the results of a classifier and a search engine
US7424469B2 (en)*2004-01-072008-09-09Microsoft CorporationSystem and method for blending the results of a classifier and a search engine
US7483891B2 (en)*2004-01-092009-01-27Yahoo, Inc.Content presentation and management system associating base content and relevant additional content
US20050154746A1 (en)*2004-01-092005-07-14Yahoo!, Inc.Content presentation and management system associating base content and relevant additional content
US20060143197A1 (en)*2004-12-232006-06-29Become, Inc.Method for assigning relative quality scores to a collection of linked documents
US7797344B2 (en)*2004-12-232010-09-14Become, Inc.Method for assigning relative quality scores to a collection of linked documents
US20080010251A1 (en)*2006-07-072008-01-10Yahoo! Inc.System and method for budgeted generalization search in hierarchies
US7991769B2 (en)*2006-07-072011-08-02Yahoo! Inc.System and method for budgeted generalization search in hierarchies
US20080126319A1 (en)*2006-08-252008-05-29Ohad Lisral BukaiAutomated short free-text scoring method and system
US20130254220A1 (en)*2009-01-302013-09-26LexisnexisMethods and systems for creating and using an adaptive thesaurus
US9141728B2 (en)*2009-01-302015-09-22Lexisnexis, A Division Of Reed Elsevier Inc.Methods and systems for creating and using an adaptive thesaurus
US20100211569A1 (en)*2009-02-182010-08-19Avaya Inc.System and Method for Generating Queries
US8301619B2 (en)*2009-02-182012-10-30Avaya Inc.System and method for generating queries
CN117910573A (en)*2023-12-192024-04-19国家移民管理局常备力量第二总队 A hybrid table text question answering method and system based on multi-task

Also Published As

Publication numberPublication date
US7991771B2 (en)2011-08-02
US20070150467A1 (en)2007-06-28

Similar Documents

PublicationPublication DateTitle
US7991771B2 (en)Adaptive evaluation of text search queries with blackbox scoring functions
Dalvi et al.Keyword search on external memory data graphs
US6513029B1 (en)Interesting table-subset selection for database workload materialized view selection
US8326847B2 (en)Graph search system and method for querying loosely integrated data
US7698267B2 (en)Searching digital information and databases
US6356890B1 (en)Merging materialized view pairs for database workload materialized view selection
US20070192293A1 (en)Method for presenting search results
US9946753B2 (en)Method and system for document indexing and data querying
US20070150473A1 (en)Search By Document Type And Relevance
US20030028520A1 (en)Method and system for response time optimization of data query rankings and retrieval
US20050210006A1 (en)Field weighting in text searching
Ilyas et al.Adaptive rank-aware query optimization in relational databases
Pavani et al.A novel web crawling method for vertical search engines
Haas et al.Discovering and exploiting statistical properties for query optimization in relational databases: A survey
Levandoski et al.Flexible and extensible preference evaluation in database systems
Vermeij et al.Monetdb, a novel spatial columnstore dbms
HiemstraA database approach to content-based XML retrieval
Phillips et al.InterJoin: Exploiting indexes and materialized views in XPath evaluation
Sauvagnat et al.Searching XML documents using relevance propagation
Corral et al.On approximate algorithms for distance-based queries using r-trees
Chen et al.A similarity-based method for retrieving documents from the SCI/SSCI database
Hassler et al.Searching XML Documents–Preliminary Work
Kader et al.Overview of query optimization in XML database systems
de Vries et al.The multi-model DBMS architecture and XML information retrieval
Aboulnaga et al.Building XML statistics for the hidden web

Legal Events

DateCodeTitleDescription
ASAssignment

Owner name:INTERNATIONAL BUSINESS MACHINES CORPORATION, NEW Y

Free format text:ASSIGNMENT OF ASSIGNORS INTEREST;ASSIGNORS:BEYER, KEVIN SCOTT;LYLE, ROBERT W.;RAJAGOPALAN, SRIDHAR;AND OTHERS;REEL/FRAME:015176/0475;SIGNING DATES FROM 20040211 TO 20040216

STCBInformation on status: application discontinuation

Free format text:ABANDONED -- FAILURE TO RESPOND TO AN OFFICE ACTION


[8]ページ先頭

©2009-2025 Movatter.jp