






相关申请案交叉参照Related Application Cross Reference
本申请案根据35U.S.C.§119(e)主张基于2004年3月29日提出申请的第60/557,783号申请案及2004年12月1日提出申请的第11/002,474号申请案的权利,上述申请案均以引用方式并入本文中。本申请案还以引用方式将第10/646,331号(2003年8月22日提出申请)及第10/641,489号(2003年8月14日提出申请)美国专利申请案并入本文中;所有上述申请案均与本申请案共同拥有。This application asserts rights under 35 U.S.C. § 119(e) based on Application Nos. 60/557,783, filed March 29, 2004, and 11/002,474, filed December 1, 2004, All of the above applications are incorporated herein by reference. This application also incorporates by reference U.S. Patent Application Nos. 10/646,331 (filed August 22, 2003) and 10/641,489 (filed August 14, 2003); all of the above applications are jointly owned with this application.
技术领域technical field
本申请案大体而言涉及在线搜索引擎,且具体而言涉及对搜索结果的个性化。This application relates generally to online search engines, and in particular to personalizing search results.
背景技术Background technique
在线搜索引擎技术的当前技术水平在其检索可回应于查询项的文档(例如网页或其某些部分、图像等等)的能力方面非常先进。现今的搜索引擎可快速检索与查询项相匹配的特定文档。然而,当前的搜索引擎常常返回虽然准确地对应于特定查询项但根本反映不出用户的基本兴趣的文档。因此,两个不同用户(其中一个对体育非常感兴趣,而另一个对政治感兴趣)在一给定查询中将得到完全相同的结果,例如“棒球中的药检”,即使第一个用户可能对了解哪些队已实施了药检更感兴趣,而后一个用户可能对了解与在体育中进行药检相关的立法更感兴趣。因此,需要提供一种根据用户的兴趣使搜索结果个性化的机理及方法。The current state of the art of online search engine technology is very advanced in its ability to retrieve documents (eg, web pages or portions thereof, images, etc.) that respond to query terms. Today's search engines quickly retrieve specific documents that match query terms. However, current search engines often return documents that, while accurately corresponding to specific query terms, do not reflect the user's underlying interests at all. Thus, two different users (one of whom is very interested in sports and the other interested in politics) will get exactly the same results for a given query, such as "drug test in baseball", even though the first user may While more interested in knowing which teams have implemented drug testing, the latter user may be more interested in knowing legislation related to drug testing in sports. Therefore, it is necessary to provide a mechanism and method for personalizing search results according to user interests.
此外,虽然用户可能具有可适用于处理其搜索查询的特定的一组兴趣,但用户可能并不想总是使这些兴趣影响查询结果。因此,将希望提供一种使用户可对他的兴趣影响一给定查询的结果的程度进行可变调整的机理及方法。Furthermore, while a user may have a specific set of interests that may be applicable to addressing their search query, the user may not always want these interests to influence the query results. Accordingly, it would be desirable to provide a mechanism and method by which a user can variably adjust the degree to which his interests affect the results of a given query.
发明内容Contents of the invention
本发明提供使一用户能够对他或她的个人兴趣影响一给定查询的结果表示形式的程度进行可变调整的系统及方式。本发明具有多个实施例,包括一种用于提供一搜索引擎的系统、一种用于处理一搜索查询的方法及一种供用户对所述影响程度进行可变调整的用户界面。The present invention provides systems and methods that enable a user to variably adjust the degree to which his or her personal interests affect the result presentation for a given query. The invention has multiple embodiments, including a system for providing a search engine, a method for processing a search query, and a user interface for variably adjusting the degree of influence by a user.
从用户的观点而言,本发明提供一种用于使搜索结果个性化的机理。所述用户可规定一组兴趣(例如所述用户感兴趣的一组主题或类别)作为一兴趣配置文件。然后,所述用户将一搜索查询提交至一搜索引擎。所述搜索引擎对与所述查询相关的文档进行检索并提供这些文档作为搜索结果。首先(虽然未必尽然),在不考虑用户兴趣的情况下对所述搜索结果进行排列。此使用户能够看到所述文档的相关性而不受他的个人兴趣影响。用户可按可变方式控制对搜索结果集合中文档的排列。在一较佳实施例中,此通过直接操纵图形控制元件(一种个性化控制)来进行。此图形元件可具有不同的图形表示形式,且在一实施例中将其表示成一滑标。调整所述控制元件便能根据他的兴趣配置文件中所表达的用户兴趣来调整所述结果的排列程度。响应于所述控制元件的位置变化,对所述搜索结果进行重新排列,从而使所述用户能够看到他的兴趣如何影响对结果的排序。From the user's point of view, the present invention provides a mechanism for personalizing search results. The user may specify a set of interests (eg, a set of topics or categories of interest to the user) as an interest profile. The user then submits a search query to a search engine. The search engine retrieves documents relevant to the query and provides these documents as search results. First (although not necessarily), the search results are ranked without regard to user interests. This enables the user to see the relevance of said documents regardless of his personal interests. The user may control the arrangement of documents in the search result set in a variable manner. In a preferred embodiment, this is done by direct manipulation of graphical control elements (a type of personalization control). This graphical element can have different graphical representations, and in one embodiment it is represented as a slider. Adjusting the control element adjusts the ranking of the results according to the user's interests expressed in his interest profile. In response to a change in the position of the control element, the search results are rearranged, thereby enabling the user to see how his interests affect the ordering of the results.
在一实施例中,搜索结果的排列会起增加与主题目录相关联的文档的相关性得分的作用。所述目录中的每一主题均与一组被认为与所述主题相关的文档(例如网站)相关联。对于每一文档(同样为网站),均提供一指示所述站点与所述主题的相关程度的提升值。一给定站点内的所有页面均获得该站点的提升值。所述提升值可理解为一表示所述站点与所述主题的相关性的比例因数。在查询处理期间,返回该组文档并进行计分而不考虑其包含于所述主题目录中。另外,确定每一文档的单独得分,其中所述文档的基本相关性得分是通过其相关联的提升值(如果有)来进行调整,例如通过将所述相关性得分乘以所述提升值,从而按比例缩放所述文档与所述查询的相关性。一般而言,只有属于所述目录中所列站点的那些文档才会具有一提升的相关性得分,而所述搜索结果中的其它文档将不会具有所述提升的相关性得分。因此,根据文档的提升的相关性得分对文档进行重新排列会随个性化控制的位置而变化-所述个性化控制的位置从毫无影响(其中所述提升值对所述相关性得分毫无影响)到完全影响(其中将每一文档的提升值100%地应用至所述相关性得分)、及任意数量的中间应用级。因此,用户可通过调整所述控制的位置来立即对所述搜索结果中的文档进行重新排列。In one embodiment, the ranking of the search results acts to increase the relevance score of the documents associated with the subject category. Each topic in the directory is associated with a set of documents (eg, web sites) that are considered to be related to the topic. For each document (also a website), a boost value is provided indicating how relevant the site is to the topic. All pages within a given site receive that site's boost value. The boost value can be understood as a scaling factor representing the relevance of the site to the topic. During query processing, the set of documents is returned and scored regardless of their inclusion in the subject catalog. Additionally, an individual score is determined for each document, wherein the document's base relevance score is adjusted by its associated boost value (if any), for example by multiplying the relevance score by the boost value, The relevance of the document to the query is thereby scaled. In general, only those documents belonging to sites listed in the directory will have a boosted relevance score, while other documents in the search results will not have the boosted relevance score. Thus, rearranging documents according to their boosted relevance scores varies with the position of the personalization control - the position of the personalization control never having any effect (wherein the boost value has no effect on the relevance score Impact) to full impact (where 100% of each document's boost value is applied to the relevance score), and any number of intermediate levels of application. Thus, the user can instantly rearrange the documents in the search results by adjusting the position of the control.
一系统实施例包括一用于向用户提供主题(类别)的主题目录、一用于响应于用户的查询而搜索带索引的网站的搜索引擎及索引、及一用于根据用户的兴趣为所述搜索结果确定所述搜索结果的至少一种(及较佳一组)排列的个性化组件。一用户界面实施例包括一具有各种主题的主题目录、及其中所述用户可选择兴趣主题的若干图形控制器(例如复选框)、及一允许用户根据他的兴趣配置文件来对所述搜索结果排列进行调整的图形控制器。One embodiment of the system includes a subject directory for providing topics (categories) to users, a search engine and index for searching indexed websites in response to user queries, and a system for providing information to the users based on their interests. The search results determine at least one (and preferably one set) permutation personalization component of said search results. One user interface embodiment includes a topic directory with various topics, and several graphical controls (such as checkboxes) where the user can select topics of interest, and a user interface that allows the user to select the topics according to his interest profile. Search results arrangement adjusted for graphics controllers.
接下来,参照各图式及图解以及技术信息来阐述本发明。所述图式只是出于举例说明的目的来描绘本发明各实施例。所属领域的技术人员通过下文说明将易知,也可采用所示及所述结构、方法及功能的替代实施例,此并不背离本发明的原理。Next, the present invention is explained with reference to the drawings and diagrams and technical information. The drawings depict various embodiments of the invention for purposes of illustration only. Those skilled in the art will readily appreciate from the following description that alternative embodiments of the structures, methods, and functions shown and described may be employed without departing from the principles of the invention.
附图说明Description of drawings
图1为一用于一搜索引擎的入口网页。FIG. 1 is a portal page for a search engine.
图2为用于编辑一用户的个人主题配置文件的网页。Figure 2 is a web page for editing a user's personal theme profile.
图3为另一主题及副主题网页。Figure 3 is another topic and subtopic webpage.
图4为另一副主题网页。Figure 4 is another sub-theme webpage.
图5显示带有查询项“stanford”的搜索引擎的入口网页。FIG. 5 shows the entry web page of the search engine with the query term "stanford".
图6显示关于查询项“stanford”的初始搜索结果以及一用于在对所述搜索结果进行排列时控制所述用户的兴趣配置文件的影响程度的控制器608。FIG. 6 shows initial search results for the query term "stanford" and a
图7显示在所述用户已将控制器608调整至一中间位置后图6所示搜索结果的排列。FIG. 7 shows the arrangement of the search results shown in FIG. 6 after the user has adjusted the
图8显示在所述用户已将控制器608调整至一最大位置后图6所示搜索结果的排列。FIG. 8 shows the arrangement of the search results shown in FIG. 6 after the user has adjusted the
图9显示在所述用户删除一“Computer”主题并选择一“Music”主题后的一主题网页。Figure 9 shows a topic web page after the user deletes a "Computer" topic and selects a "Music" topic.
图10显示在图9中改变主题兴趣后图6所示搜索结果的排列。FIG. 10 shows the arrangement of the search results shown in FIG. 6 after changing the subject interests in FIG. 9 .
具体实施方式Detailed ways
现在参见图1,图中显示一搜索引擎的入口(或登陆页面)100。所述搜索引擎包括一其中用户输入一搜索查询项的查询输入栏102。一搜索按钮106将所述查询提交给所述搜索引擎进行处理。所述用户在首次访问所述搜索引擎时,将还没有对一用于使搜索结果个性化的用户配置文件进行界定,且对此显示一消息。所述用户可访问编辑Edit Profile(配置文件)链接104以便创建其个人兴趣。Referring now to FIG. 1, an entry (or landing page) 100 for a search engine is shown. The search engine includes a
点击编辑链接104便能访问图2中所示页面200。此页面显示一主题目录202的顶层主题204。所述目录中的主题可为(例如)在存在于http://dmoz.org/处的OpenDirectory Project(开放式目录计划)中所规定的主题或者可为某一其它主题目录。(在本揭示内容中,用语“主题”与“类别”可互换,“副主题”与“子类别”也可互换。)所述顶层主题链接可具有一组由箭头206指示的副主题。在任一时刻,用户均可通过点击StartSearching(开始搜索)按钮208来开始一搜索。点击一特定顶层主题便会将用户带到一与所述主题的副主题相关联的页面。Clicking on the edit link 104 accesses the
图3图解说明其中用户已点击页面200上的主题“Computer”的此步骤的结果。如图3中所示,主题“Computer”的副主题302已显示在右手的列中。紧挨着每一副主题302的是一复选框314。用户可选择复选框314来指示他对一特定副主题感兴趣。或者,用户可针对整个副主题本身“Computer(General)”选择复选框316。此指示用户对主题“Computer”的所有副主题感兴趣。用户所选择的该组主题可称作用户配置文件、个性化配置文件或兴趣配置文件。在图3的左侧,在用户选择了一个或多个兴趣主题后,在兴趣栏306中列出这些主题。紧挨着每一主题名称的是一使用户能够分别删除各个主题的删除链接308。用户也可通过选择“Delete all preferences”链接310来删除所有主题。FIG. 3 illustrates the result of this step where the user has clicked on the topic "Computer" on
所述副主题本身可具有任一数量的附加副主题层。例如,图4图解说明副主题“Games”进一步具有副主题402“Board Games”、“Card Games”等等。这些游戏响应于用户点击紧挨着“Games”主题标签的副主题箭头206而呈现给用户。同样,用户可通过选择复选框404或各个副主题复选框406中的任何一个复选框来选择整个子类别。在一实施例中,除顶层主题204外,还有两个副主题层。此会提供足够的主题覆盖宽度及间隔尺寸,而不会施加不必要的计算复杂度。The subtopic itself may have any number of additional subtopic layers. For example, FIG. 4 illustrates that the subtopic "Games" further has subtopics 402 "Board Games," "Card Games," and so on. These games are presented to the user in response to the user clicking on the
重新参见图3,用户已指示他通常对计算机感兴趣。通过点击Start Searching(开始搜索)按钮208会再次调用搜索页面100,搜索页面100不再显示请求用户创建个性化配置文件的消息。这时,用户在所述搜索栏中输入一搜索查询,此处所述查询为图5中所示的“Stanford”。搜索引擎对其索引实施搜索并提供搜索结果。图6图解说明初始搜索结果页面600。此外,搜索结果602包括一组分别由一URL 617指示的文档604连同一通往所述页面的链接615及一向用户显示所述文档上下文的一部分的页面文本片断619。靠近所述页面顶部的是个性化控制器608。在本申请案的上下文中,“文档”应理解为可由一搜索引擎加索引并检索的任一类型的媒体,包括网页、图像、多媒体文件、文本文档、PDF等等。使用通用术语“文档”并不意含着限制本发明的范围。Referring back to Figure 3, the user has indicated that he is generally interested in computers. By clicking the Start Searching (start searching)
个性化控制器608使用户能够根据他的兴趣(较佳如所述个性化配置文件中所表示)来对搜索结果的各个文档/页面的排列加以调整。所述个性化控制器可解释为控制用户的兴趣配置文件对搜索结果的排列的影响或作用的程度。图6图解说明在一其中控制器608为一滑标型控制器的实施例中的控制器608,控制器608包括一在最小值610至最大值614之间沿线性维度移动的控制手柄610。此类型的控制器608通常非常直观且易于用户了解其操作方式。替代实施例包括同样在最小与最大值之间提供可变调整的刻度盘类型控制器、及若干分别具有一相关的预定调整量的无线电按钮(例如五个分别与0%、20%、40%、80%及100%相关联的无线电按钮)。用户界面设计领域的技术人员将了解,也可采用其它类型的控制器来达到基本相同的效果。The
在图6所示的实例中,个性化控制器608一直设定至其最小值612。因此,根据搜索结果与查询的相关性及与查询无关的信息(例如PageRank)来对相应地搜索结果进行排列,PageRank揭示于名称为“链接数据库中的节点排列方法(Method for noderanking in a linked database)”的第6,285,999号美国专利中,该专利的全文及其全部教示内容以引用方式并入本文中。应注意,所述搜索结果通常与Stanford University(斯坦福大学)有关。In the example shown in FIG. 6 , the
现在参见图7,图中显示在用户已将滑标610调整至一位于最小值与最大值之间大约处于中点的中间位置后对搜索结果602的排列。这时,各个文档604得到重新排列,且与用户的兴趣更为相关的那些(前面标识为“Computers(General))”具有更高的次序。为了使用户辨别与他的兴趣更为相关的文档,那些文档由一紧挨着链接文本615的个性化图标618指示。注意到,在滑标610处于中间位置的情况下,大约每隔一个文档便为一个性化结果,尽管并非必需如此。进一步注意到,片断619文本具有与计算机相关的文本(例如“hacking(非法闯入)”),即使该用语并未包括于原始搜索查询中。Referring now to FIG. 7, there is shown the ranking of
现在参见图8,图中显示在用户已将滑标610调整至其最大位置614后搜索结果602的排列。首先,在此实例中,搜索结果602中的所有文档604均反映用户的个性化配置文件并以个性化图标618来注解。其次,所述搜索结果中的文档与计算机及Stanford大学相关,从而反映用户对计算机的兴趣,因为其与查询“Stanford”相关。例如,前两个结果与在Stanford的“非法闯入”相关;接下来的三个文档具体与Stanford的各种计算机科学项目相关。因此,第三个文档是针对Stanford的Artificial Intelligence Laboratory(人工智能实验室),第四个文档是针对Stanford的Center for the Study of Language and Information(语言与信息研究中心),而第五个文档是针对Stanford的Database Group(数据库小组)。其余文档同样与用户的“Computers”兴趣相关。进一步注意到,此前在当不应用个性化(图6)及当只应用中等个性化(图7)时作为第一(例如最相关的)结果的与Stanford大学的主页的链接不再出现于前八个结果中。Referring now to FIG. 8 , there is shown the arrangement of
在图6-8所示实例中,用户可实时地对文档的排列加以动态调整,而搜索引擎无需重新计算搜索结果。此进一步有助于用户能够探究所述搜索结果。In the example shown in Figures 6-8, the user can dynamically adjust the arrangement of the documents in real time, and the search engine does not need to recalculate the search results. This further facilitates the user's ability to explore the search results.
如图6-8所示,一Edit Profile(编辑配置文件)链接616使用户能够随时对他的用户配置文件的兴趣进行编辑。图9图解说明用户已返回至图2中所示的主题目录页面200、删除了他现有的Computers主题兴趣、并创建了“Music(音乐)”这一不同的兴趣。然后,用户重新选择Start Searching(开始搜索)按钮208。搜索引擎提供经更新的一组搜索结果,所述经更新的一组搜索结果的文档将与以前的相同,因为尚未改变所述查询术语。然而,对于最后一次搜索,用户已将控制器610定位于完全个性化处,且因此所述搜索引擎应用此设置并根据现有的(及新的)个性化配置文件(此时针对“Music(音乐)”)对文档进行排列。图10图解说明此在所述搜索结果中所产生的巨大差异。此处,顶部的六个结果均与音乐及艺术相关,而先前关于计算机的结果均不处于所述搜索结果列表的顶部。此表明用户的个性化配置文件可如何显著地改变对给定的一组文档的排列。As shown in Figures 6-8, an Edit Profile (edit configuration file)
在本发明的一实施例中,对搜索结果的个性化是由下述方法实现的。搜索引擎设计领域的技术人员可容易理解其中可采用此方法的搜索引擎的基本结构及其操作。In an embodiment of the present invention, the personalization of search results is realized by the following method. Those skilled in the art of search engine design can readily understand the basic structure of a search engine in which this method can be employed and its operation.
1)当用户选择主题(类别)时,所述主题(类别)被添加至一包含他的兴趣IN(其中IN为“interests(兴趣)”的缩写)的列表的跟踪文件中。所述兴趣可表示成文本符记(例如“computers(计算机)”)或表示成加入目录中的索引,其中每一主题/副主题均具有一索引号。当用户访问所述站点时,系统自所述跟踪文件检索此兴趣列表。或者,所述站点可提供一使用户在其中登记一用户名及口令的用户登记页面,并将用户的兴趣在搜索引擎站点处存储于一数据库中,并响应于用户登录来对其进行检索。1) When a user selects a topic (category), said topic (category) is added to a tracking file containing a list of his interests IN (where IN is an abbreviation for "interests"). The interest can be represented as a text notation (such as "computers") or as an index added to a table of contents, where each topic/subtopic has an index number. The system retrieves this interest list from the tracking file when the user visits the site. Alternatively, the site may provide a user registration page where the user registers a username and password, and store the user's interests in a database at the search engine site and retrieve them in response to user login.
2)每当用户发出一搜索查询Q时,便将所述列表IN发送至一前端网络服务器。2) Whenever a user issues a search query Q, the list IN is sent to a front-end web server.
3)然后,所述前端网络服务器将两个请求(Q,空)及(Q,IN)发给一后端搜索引擎,所述后端搜索引擎向所述前端网络服务器返回两个别为R_base(常规的)及R_in(个性化的)结果集合。3) Then, the front-end web server sends two requests (Q, empty) and (Q, IN) to a back-end search engine, and the back-end search engine returns two R_base( regular) and R_in (personalized) result sets.
4)所述基础查询(Q,空)与在搜索引擎上在不施加任何个性化情况下对Q进行的搜索相同,因而R_base为所述搜索引擎通常将响应于所述查询而返回的结果集合。如上文所述,根据其与所述查询的相关性及例如PageRank等任何其它因素来相应地对所述搜索结果进行排列,PageRank揭示于名称为“链接数据库中的节点排列方法(Method for node ranking in a linked database)的第6,285,999号美国专利中,该专利的全文及其全部教示内容以引用方式并入本文中。4) The base query (Q, empty) is the same as a search on Q performed on a search engine without any personalization applied, so R_base is the set of results that the search engine would normally return in response to the query . As mentioned above, the search results are ranked accordingly according to their relevance to the query and any other factors such as PageRank disclosed in the article titled "Method for node ranking in link database". 6,285,999 in a linked database), which is hereby incorporated by reference in its entirety and for its entire teachings.
5)个性化的查询(Q,IN)包含用户的兴趣列表。所述搜索引擎遵循一类似于常规查询的处理控制流程,但其在记分功能期间激活一个性化组件,以“提升”与所述兴趣IN集合相关的文档。5) The personalized query (Q, IN) contains the user's interest list. The search engine follows a process control flow similar to a regular query, but it activates a personalization component during the scoring function to "promote" documents relevant to the IN set of interest.
在本发明的一实施例中,使用一组下列形式的映射(所述目录中的每一主题/副主题各一个映射)来实施这些提升:In one embodiment of the invention, these promotions are implemented using a set of maps (one for each topic/subtopic in the catalog) of the form:
{站点->提升}{site->boost}
因而例如,类别“Health(健康)”具有如下形式的映射So for example, the category "Health" has a mapping of the form
{站点->提升}{site->boost}
其包含下列输入:It contains the following inputs:
nih.gov->5.8nih.gov->5.8
cdc.gov-->7.9cdc.gov-->7.9
med.stanford.edu->3.5med.stanford.edu->3.5
其中nih.gov,cdc.gov及med.Stanford.edu为已确定出(以手动或自动方式)与主题“Health”相关的各个站点。换句话说,对于所述目录中的每一主题,均存在一组已确定出与所述主题相关的站点,且对于这些站点中的每一站点,均界定一提升值。Among them, nih.gov, cdc.gov and med.Stanford.edu are the sites that have been determined (manually or automatically) related to the topic "Health". In other words, for each topic in the directory there is a set of sites that have been determined to be relevant to the topic, and for each of these sites a lift value is defined.
通常按下述方式来确定所述主题目录中所列出的站点的提升:The promotion of a site listed in the subject directory is generally determined as follows:
a)产生一“站点图”,其中所述图的各节点为站点(基本上为同一主机上的页面),并根据来自一个站点的链接至另一主机上的页面的的页面数量来对各节点之间的边缘进行加权。可使用此同一类型的图来计算所有主题提升映射。a) Create a "site graph" where the nodes of the graph are sites (essentially pages on the same host) and rank each Edges between nodes are weighted. All topic lift maps can be calculated using this same type of graph.
b)对于所述目录中的每一主题,例如“Health”,选择若干个其主页列示于OpenDirectory(开放式目录)中的站点作为“起始”站点S0。例如,对于一所大学,如Stanford,起始站点可选择成任一以.stanford.edu结尾的站点。b) For each topic in the directory, eg "Health", select several sites whose home pages are listed in OpenDirectory (Open Directory) as "starting" sites S0. For example, for a university such as Stanford, the starting site can be any site ending in .stanford.edu.
c)分两遍进行计算:c) Calculate in two passes:
i)首先,识别一组由S0中的那些站点所大量链接的站点S1,其中根据S1中的每一站点如何大量地被S0中的站点所链接来为其指配一权重。i) First, identify a set of sites S1 that are heavily linked by those sites in S0, where each site in S1 is assigned a weight according to how heavily linked by sites in S0.
ii)然后,识别那些由S1中的那些站点所大量链接的站点S2,并如同S1一样对其进行加权。ii) Then, identify those sites S2 that are heavily linked by those in S1 and weight them like S1.
d)以其所指配到的权重来提升S2中的站点。d) Promote the station in S2 by its assigned weight.
当个性化查询具有一兴趣集合(例如{Health(健康),Music(音乐)})时,则所述个性化组件查阅各自的映射,且随后对于处于这些映射中的查询结果(即文档),所述个性化组件提升这些查询结果的IR得分,以便其将获得更高的排列次序。所述IR得分为所述搜索引擎为每一文档提供的信息检索得分,并可随文档相关性及例如PageRank等其它因素而变化。When a personalized query has a collection of interests (e.g., {Health (health), Music (music)}), then the personalization component consults the respective mappings, and then for the query results (i.e., documents) in these mappings, The personalization component boosts the IR score of these query results so that it will get a higher rank. The IR score is the information retrieval score provided by the search engine for each document and may vary with document relevance and other factors such as PageRank.
例如,假如http://www.med.stanford.edu/research/为查询Q及包含“Health”的IN的结果之一,则因所述页面位于站点med.stanford.edu上,所述结果的IR得分将得到一3.5倍的倍增提升。对于多个兴趣而言,将各个对应映射中的值一起相乘以得到一组合效果。一种用于为所述目录提升映射中的站点确定所述提升值的实施方式的进一步细节阐述于第10/646,331号美国专利申请案中,该专利申请案以引用方式并入本文中。For example, if http://www.med.stanford.edu/research/ is one of the results for query Q and IN containing "Health", then since the page is on the site med.stanford.edu, the The IR score will get a 3.5x multiplier. For multiple interests, the values in each corresponding map are multiplied together to obtain a combined effect. Further details of an embodiment for determining the boost value for a site in the directory boost map are set forth in US patent application Ser. No. 10/646,331, which is incorporated herein by reference.
6)所述前端网络服务器接收R_base及R_in。R_base代表在个性化控制器608设定为0(或其最小位置612)处时用户所看到的结果602。R_in代表在所述滑标处于其最大位置614(即10)时用户所看到的结果602。在一实施例中,所述控制器具有11个位置,但也可使用任何数量的位置。对显示给用户的介于0与10之间的排列进行内插;这意味着,借助R_base及R_in,所述个性化组件通过使每一单独结果沿一自其在控制器608处于其最小位置时的位置至在控制器608处于其最大位置时的位置的虚拟路径移动来构造中间排列(R_1...R_9)。6) The front-end network server receives R_base and R_in. R_base represents the
7)如何向用户提供滑标效果:7) How to provide users with a sliding effect:
所述前端网络服务器将R_in中不处于R_base中的结果附加至R_base,以获得一称作R_all的关于所有唯一结果(在R_base或R_in中)的列表。然后,所述前端网络服务器为R_all中的结果指配连续的识别号以充当识别符。然后,所述前端网络服务器向用户的浏览器发送两个项:The front-end web server appends the results in R_in that are not in R_base to R_base to obtain a list of all unique results (either in R_base or R_in) called R_all. The front-end web server then assigns consecutive identification numbers to the results in R_all to serve as identifiers. The front-end web server then sends two items to the user's browser:
a)R_all中每一结果的映射{识别号->结果},对于每一文档其均在所述结果中包含其识别号;及a) a mapping {ID -> Result} for each result in R_all, which includes, for each document, its ID in the result; and
b)N个向量,其标记用户应看到(以结果识别号表示)的排列,其中N为滑标的位置数量,且每一向量均代表根据个性化程度的结果排序。在此实例中,将具有11个向量对应于控制器608的位置0至10。在一实施例中,向量0为[0,1,2,3,4,5,6,7,8,9,10],其反映对指配有以R_0开始的本地ID的结果的非个性化排列。向量10为完全个性化的结果R_in中的排序;举例而言,其可能是[15,17,5,0,1,2,3,4,8,9,10],其中文档15及17(例如)为其IR得分根据映射图中的提升值及与控制器608的位置相关联的值得到提升的主题目录的相关映射中的文档。将所述N个排列向量传输至客户端便会使所述客户端浏览器能够快速地重新计算所述排列,而不是使服务器承担所述计算。b) N vectors, which mark the arrangement that the user should see (represented by the result identification number), where N is the number of positions of the slider, and each vector represents the ranking of results according to the degree of personalization. In this example, there will be 11 vectors corresponding to positions 0-10 of the
8)最后,所述结果页面中的Javascript代码使用户的浏览器响应于用户改变控制器608的位置而使用所述11个位置排列向量中适当的一个来对所述搜索结果进行重新排序。上述步骤6-8的一实施例的进一步细节阐述于第10/641,489号美国申请案中,该申请案以引用方式并入本文中。8) Finally, the Javascript code in the results page causes the user's browser to reorder the search results using the appropriate one of the 11 position ordering vectors in response to the user changing the position of the
上文已参照一可能的实施例具体地详细阐述了本发明。所属领域的技术人员将了解,本发明也可实施为其它实施例。首先,对各组件的具体命名、大写的用语、属性、数据结构、或任何其它编程或结构方面均不是强制性的或有特殊意义的,且用于构建本发明或其特征的机理可具有不同的名称、格式或协议。此外,所述系统可如上文所述通过一硬件与软件的组合构建而成,或完全构建于硬件元件中。此外,本文中所述的在各种系统组件之间进行的特定的功能划分仅为实例性的,而不是强制性的;由单个系统组件所实施的功能也可由多个组件实施,而由多个组件所实施的功能也可改为由单个组件实施。The invention has been explained in detail above with reference to a possible embodiment. Those skilled in the art will appreciate that the invention can be practiced in other embodiments as well. First, the specific naming, capitalization of terms, attributes, data structures, or any other programming or structural aspects of each component are not mandatory or have special meaning, and the mechanisms used to construct the invention or its features may have different The name, format or protocol of the . Furthermore, the system may be implemented as described above by a combination of hardware and software, or entirely in hardware elements. In addition, the specific functional division among various system components described herein is only exemplary, not mandatory; functions implemented by a single system component may also be implemented by multiple components, and multiple components may be implemented by multiple components. Functions implemented by individual components may instead be implemented by a single component.
上述说明的某些部分以对信息的操作的算法及符号表示形式来提供本发明的特征。这些算法说明及表示形式是数据处理领域的技术人员用于最有效地将其工作的实质传达给所属领域的其它技术人员的手段。这些操作在以功能或逻辑方式加以阐述时应理解为由计算机程序实施。此外,还已证明,有时将这些操作安排称作模块或按功能名称来称谓是方便的,此并不失其一般性。Certain portions of the above description present features of the invention in terms of algorithms and symbolic representations of operations on information. These algorithmic descriptions and representations are the means used by those skilled in the data processing arts to most effectively convey the substance of their work to others skilled in the art. These operations, when described in a functional or logical manner, should be understood as being implemented by computer programs. Furthermore, it has also proven convenient at times, to refer to these arrangements of operations as modules or by functional names, without loss of generality.
除非根据上述说明显而易见地另有具体规定,否则应了解,在本说明的通篇中,利用诸如“计算”或“确定”或“识别”或类似用语进行的论述是指一计算机系统或类似电子计算装置所进行的步骤及过程,所述计算机系统或类似电子计算装置对在计算机系统存储器或寄存器或其它此类信息存储器、传输或显示装置中被表示成物理(电子)量的数据进行处理及变换。Unless it is obvious from the above description that it is specifically stated otherwise, it should be understood that throughout this specification, discussions using terms such as "calculate" or "determine" or "identify" or similar terms refer to a computer system or similar electronic steps and processes performed by a computing device that processes and processes data represented as physical (electronic) quantities in computer system memory or registers or other such information storage, transmission or display devices transform.
本发明的某些方面包括在本文中描述成算法形式的过程步骤及指令。应注意,本发明的过程步骤及指令可收录于软件、固件或硬件中,且当收录于软件中时,可下载驻存于实时网络操作系统所使用的不同平台上并可自所述不同平台运行。Certain aspects of the invention include process steps and instructions described herein in algorithmic form. It should be noted that the process steps and instructions of the present invention may be embodied in software, firmware, or hardware, and when embodied in software, may be downloaded, resident, and accessible from different platforms used by real-time network operating systems. run.
本发明还涉及一种用于实施本文中的操作的设备。此设备可针对所需用途专门构造,或者其可包括一通用计算机,所述通用计算机由一存储于一可由计算机存取的计算可读媒体上的计算机程序有选择地启动或重新配置。此种计算机程序可存储于一计算机可读存储媒体中,例如(但不限于)任何类型的磁盘,包括软盘、光盘、CD-ROM、磁-光盘、只读存储器(ROM)、随机存取存储器(RAM)、EPROM、EEPROM、磁卡或光卡,或适于存储电子指令并分别耦接至一计算机系统总线的任何类型的媒体。集成电路设计及视频编解码器领域的技术人员应了解,本发明可容易地制作于基于上述功能及结构说明的各种类型的集成电路(包括应用专用集成电路(ASIC))中。另外,本发明可包含于各种类型的视频编码装置内。The present invention also relates to an apparatus for carrying out the operations herein. This apparatus may be specially constructed for the required purposes, or it may comprise a general purpose computer selectively activated or reconfigured by a computer program stored on a computer-readable medium accessible by the computer. Such a computer program may be stored on a computer readable storage medium such as, but not limited to, any type of disk, including floppy disk, compact disk, CD-ROM, magneto-optical disk, read-only memory (ROM), random-access memory (RAM), EPROM, EEPROM, magnetic or optical card, or any type of media suitable for storing electronic instructions and respectively coupled to a computer system bus. Those skilled in the art of integrated circuit design and video codecs should understand that the present invention can be easily fabricated in various types of integrated circuits (including application specific integrated circuits (ASICs)) based on the above functional and structural descriptions. Additionally, the present invention can be included within various types of video encoding devices.
本文中所提供的算法及操作并非与任一特定计算机或其它设备内在地相关。各种通用系统也可与根据本文教示内容的程序一起使用,或者可证明能方便地构造更专门的设备来实施所需方法步骤。所属领域的技术人员将易知各种此类系统所需的结构以及等效变化形式。另外,本发明未参照任一特定编程语言加以阐述。应了解,可使用各种编程语言来实施本文中所述的本发明教示内容,且特定语言的任何参考均旨在揭示本发明的实现方式及最佳模式。The algorithms and operations presented herein are not inherently related to any particular computer or other device. Various general-purpose systems may also be used with programs in accordance with the teachings herein, or it may prove convenient to construct more specialized apparatus to carry out the required method steps. The required structure, and equivalent variations, for a variety of such systems will be readily apparent to those skilled in the art. In addition, the invention has not been described with reference to any particular programming language. It should be appreciated that a variety of programming languages can be used to implement the teachings of the invention described herein, and any reference to a specific language is intended to reveal the best mode and implementation of the invention.
最后,应注意,本说明书中所使用的语言原则上是出于可读性和指导性目的而选取,且可能不是选取用来描绘或限制本发明标的物。因此,本发明揭示内容旨在例示而非限制本发明的范围。Finally, it should be noted that the language used in this specification has been chosen for readability and instructional purposes in principle, and may not have been chosen to delineate or limit the inventive subject matter. Accordingly, the present disclosure is intended to be illustrative, not limiting, of the scope of the present invention.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| US55778304P | 2004-03-29 | 2004-03-29 | |
| US60/557,783 | 2004-03-29 | ||
| US11/002,474 | 2004-12-01 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN2010100006457ADivisionCN101814079B (en) | 2004-03-29 | 2005-03-21 | Method and apparatus for variable personalization of search results |
| Publication Number | Publication Date |
|---|---|
| CN101080709Atrue CN101080709A (en) | 2007-11-28 |
| CN100593168C CN100593168C (en) | 2010-03-03 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN200580014339AExpired - Fee RelatedCN100593168C (en) | 2004-03-29 | 2005-03-21 | Method and apparatus for variable personalization of search results |
| Country | Link |
|---|---|
| CN (1) | CN100593168C (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101853308A (en)* | 2010-06-11 | 2010-10-06 | 中兴通讯股份有限公司 | Method and application terminal for personalized meta-search |
| CN102687145A (en)* | 2009-12-28 | 2012-09-19 | 乐天株式会社 | Information searching apparatus, number deciding method, information searching program, information searching system and recording medium |
| CN101556614B (en)* | 2008-04-11 | 2012-10-17 | 富士通株式会社 | Conveniently display an interactive dynamic project cloud about one or more input projects |
| CN105205139A (en)* | 2015-09-17 | 2015-12-30 | 罗旭斌 | Personalized literature searching method |
| CN109299378A (en)* | 2018-10-26 | 2019-02-01 | Oppo广东移动通信有限公司 | Method, device, terminal and storage medium for displaying search results |
| CN113508248A (en)* | 2019-03-04 | 2021-10-15 | 利纳克有限公司 | Linear actuator system and method for setting and programming a power limit value of the actuator system |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| DE102020204619A1 (en)* | 2020-04-09 | 2021-10-14 | Siemens Aktiengesellschaft | Monitoring an electrical power transmission system |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US6421675B1 (en)* | 1998-03-16 | 2002-07-16 | S. L. I. Systems, Inc. | Search engine |
| US20020078045A1 (en)* | 2000-12-14 | 2002-06-20 | Rabindranath Dutta | System, method, and program for ranking search results using user category weighting |
| US20030018659A1 (en)* | 2001-03-14 | 2003-01-23 | Lingomotors, Inc. | Category-based selections in an information access environment |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| CN101556614B (en)* | 2008-04-11 | 2012-10-17 | 富士通株式会社 | Conveniently display an interactive dynamic project cloud about one or more input projects |
| CN102687145A (en)* | 2009-12-28 | 2012-09-19 | 乐天株式会社 | Information searching apparatus, number deciding method, information searching program, information searching system and recording medium |
| CN102687145B (en)* | 2009-12-28 | 2015-05-20 | 乐天株式会社 | Information retrieval device, method for determining number of items, information retrieval program, information retrieval system, and recording medium |
| CN101853308A (en)* | 2010-06-11 | 2010-10-06 | 中兴通讯股份有限公司 | Method and application terminal for personalized meta-search |
| US8898155B2 (en) | 2010-06-11 | 2014-11-25 | Zte Corporation | Personalized meta-search method and application terminal thereof |
| CN105205139A (en)* | 2015-09-17 | 2015-12-30 | 罗旭斌 | Personalized literature searching method |
| CN105205139B (en)* | 2015-09-17 | 2019-06-14 | 罗旭斌 | A kind of personalization document retrieval method |
| CN109299378A (en)* | 2018-10-26 | 2019-02-01 | Oppo广东移动通信有限公司 | Method, device, terminal and storage medium for displaying search results |
| CN113508248A (en)* | 2019-03-04 | 2021-10-15 | 利纳克有限公司 | Linear actuator system and method for setting and programming a power limit value of the actuator system |
| CN113508248B (en)* | 2019-03-04 | 2023-12-22 | 利纳克有限公司 | Linear actuator system and method for setting and programming a power limit value of the actuator system |
| Publication number | Publication date |
|---|---|
| CN100593168C (en) | 2010-03-03 |
| Publication | Publication Date | Title |
|---|---|---|
| CN101814079B (en) | Method and apparatus for variable personalization of search results | |
| US10929487B1 (en) | Customization of search results for search queries received from third party sites | |
| TWI452476B (en) | Method,computer readable storage media,and system for relevancy sorting of user's browser history | |
| US7707208B2 (en) | Identifying sight for a location | |
| US7657504B2 (en) | User interface for displaying images of sights | |
| US7917464B2 (en) | Geotext searching and displaying results | |
| US8341143B1 (en) | Multi-category searching | |
| CN101405734A (en) | Automated tool for human-assisted excavation and capturing of accurate results | |
| WO2004013776A2 (en) | Desktop client interaction with a geographic text search system | |
| KR20080091821A (en) | Automated tool for human aid mining and capturing detailed results | |
| CN100593168C (en) | Method and apparatus for variable personalization of search results |
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| C14 | Grant of patent or utility model | ||
| GR01 | Patent grant | ||
| CP01 | Change in the name or title of a patent holder | Address after:California, USA Patentee after:Google Inc. Address before:California, USA Patentee before:Google Inc. | |
| CP01 | Change in the name or title of a patent holder | ||
| CF01 | Termination of patent right due to non-payment of annual fee | Granted publication date:20100303 | |
| CF01 | Termination of patent right due to non-payment of annual fee |