技术领域technical field
本发明涉及互联网搜索技术领域,尤其涉及一种搜索结果中图片网页自定义摘要的显示方法和系统。The invention relates to the technical field of Internet search, in particular to a method and system for displaying a self-defined abstract of a picture webpage in search results.
背景技术Background technique
随着国内互联网技术的迅速发展,如何在互联网上快速准确的找到自己所需要的信息,成为互联网搜索中的一个关键问题。With the rapid development of domestic Internet technology, how to quickly and accurately find the information you need on the Internet has become a key issue in Internet search.
在现有的搜索结果页显示中,一般是按照关键词与服务器中存储的关键词的匹配程度来进行排序显示,或者通过统计用户对该关键词的网页的历史点击率进行排序显示。但是,在通过上述排序得到的搜索结果页中,如果仅仅呈现的是各个搜索结果项以及搜索结果项的相关简介,需要用户一一查看搜索结果项的简介才能找到感兴趣的搜索内容,因而就有了在搜索结果列表中显示网页摘要的需求。In the display of existing search result pages, it is generally sorted and displayed according to the degree of matching between the keyword and the keyword stored in the server, or sorted and displayed by counting the historical click-through rate of the web pages of the keyword. However, in the search result page obtained by the above sorting, if only each search result item and the relevant brief introduction of the search result item are presented, the user needs to view the brief introductions of the search result item one by one to find the search content of interest. There is a need to display web page snippets in the search result list.
现有的网页摘要中,一种是根据网页的主题为中心生成的摘要,这种方法虽然很好地表达了网页中内容的意思,但是可以并不是用户需要的信息;一种是根据用户的关键词,从网页中提取与关键词最相关的部分作为摘要,但是很多时候关键词并不能体现用户的查询需求,因此该摘要也很难包括用户需要的信息。Among the existing webpage summaries, one is based on the theme of the webpage as the center. Although this method expresses the meaning of the content in the webpage well, it may not be the information that the user needs; the other is based on the user's Keywords, extract the most relevant part of the keyword from the webpage as a summary, but in many cases keywords cannot reflect the user's query needs, so it is difficult for the summary to include the information that the user needs.
因此,如何提供一种摘要生成方法,使得摘要能够满足用户的查询需求,并尽可能多的提供相关信息,成为当前需要解决的问题之一。Therefore, how to provide a summary generation method so that the summary can meet the user's query requirements and provide as much relevant information as possible has become one of the current problems to be solved.
发明内容Contents of the invention
鉴于上述问题,提出了本发明以便提供一种克服上述问题或者至少部分地解决或者减缓上述问题的一种搜索结果中图片网页自定义摘要的显示方法及系统。In view of the above problems, the present invention is proposed to provide a method and system for displaying custom abstracts of image webpages in search results that overcome the above problems or at least partially solve or alleviate the above problems.
根据本发明的一个方面,提供一种搜索结果中图片网页自定义摘要的显示方法,包括:According to one aspect of the present invention, there is provided a method for displaying a custom abstract of an image webpage in search results, including:
获取待生成自定义摘要的网页源代码数据;Obtain the source code data of the webpage to generate the custom summary;
提取所述网页源代码数据中第一meta标签的type字段,确定网页类型是否为image;Extract the type field of the first meta tag in the webpage source code data, and determine whether the webpage type is image;
如果所述网页类型是image时,进一步提取其余meta标签中的一个或多个图片信息数据字段;If the webpage type is image, further extract one or more image information data fields in the remaining meta tags;
基于所述一个或多个图片信息数据字段构建自定义摘要,并在搜索结果中显示。Build a custom summary based on the one or more image information data fields, and display it in the search results.
优选地,所述在获取待生成自定义摘要的网页源代码数据之前,还包括:Preferably, before obtaining the source code data of the webpage to generate the custom abstract, it also includes:
通过第三方在所述网页源代码数据的头部部分添加以meta标签标识的自定义数据字段。A custom data field identified by a meta tag is added to the head part of the webpage source code data by a third party.
其中,所述提取其余meta标签中的一个或多个图片信息数据字段,进一步包括:Wherein, said extracting one or more picture information data fields in the remaining meta tags further includes:
根据字段标识信息从所述网页源代码数据中识别图片的URL字段。The URL field of the picture is identified from the webpage source code data according to the field identification information.
其中,所述基于所述一个或多个图片信息数据字段构建自定义摘要,进一步包括:Wherein, said constructing a custom summary based on said one or more picture information data fields further includes:
根据所述字段标识信息将所述一个或多个图片信息数据字段的内容嵌入在与上述字段标识信息对应的预设区域,构建成自定义摘要。Embedding the content of the one or more picture information data fields in a preset area corresponding to the field identification information according to the field identification information to construct a custom summary.
优选地,所述在获取待生成自定义摘要的网页源代码数据之前,还包括:Preferably, before obtaining the source code data of the webpage to generate the custom abstract, it also includes:
确定所述图片信息数据字段中每个字段的重要度;以及determining the importance of each field in the picture information data fields; and
所述基于所述一个或多个图片信息数据字段构建自定义摘要,并在搜索结果中显示,进一步包括:根据所述重要度在搜索结果列表中显示所述图片信息数据字段。The constructing a custom summary based on the one or more picture information data fields and displaying it in the search results further includes: displaying the picture information data fields in the search result list according to the importance.
其中,所述基于所述一个或多个图片信息数据字段构建自定义摘要并在搜索结果中显示,具体为:Wherein, the self-defined abstract is constructed based on the one or more picture information data fields and displayed in the search results, specifically:
在搜索结果列表中的预定区域内显示所述自定义摘要。The custom summary is displayed in a predetermined area in the search result list.
其中,所述基于所述一个或多个图片信息数据字段构建自定义摘要并在搜索结果中显示,具体为:Wherein, the self-defined abstract is constructed based on the one or more picture information data fields and displayed in the search results, specifically:
在搜索结果列表中的预定区域的左侧显示图片。Displays pictures to the left of a predetermined area in the search results list.
优选地,所述基于所述一个或多个图片信息数据字段构建自定义摘要并在搜索结果中显示,具体为:Preferably, said constructing a custom abstract based on said one or more image information data fields and displaying it in the search results is specifically:
根据预定版面设计将各个图片信息数据字段在预定位置进行显示。Each image information data field is displayed at a predetermined position according to a predetermined layout design.
其中,所述第三方为网站管理者或者网页提供者。Wherein, the third party is a website manager or a web page provider.
根据本发明的另一个方面,提供一种搜索结果中图片网页自定义摘要的显示系统,包括:According to another aspect of the present invention, there is provided a system for displaying custom summaries of image webpages in search results, including:
网页源代码数据获取单元,用于获取待生成自定义摘要的网页源代码数据;A webpage source code data acquisition unit, configured to acquire webpage source code data to be generated from a custom summary;
网页类型判断单元,用于提取所述网页源代码数据中第一meta标签的type字段,确定网页类型是否为image;A webpage type judging unit, used to extract the type field of the first meta tag in the webpage source code data, to determine whether the webpage type is image;
图片信息数据字段提取单元,用于当所述网页类型是image时,进一步提取其余meta标签中的一个或多个图片信息数据字段;Image information data field extraction unit, used for further extracting one or more image information data fields in the remaining meta tags when the webpage type is image;
自定义摘要生成单元,用于基于所述一个或多个图片信息数据字段构建自定义摘要,并在搜索结果中显示。A custom abstract generation unit, configured to construct a custom abstract based on the one or more image information data fields, and display it in the search results.
优选地,所述系统还包括:Preferably, the system also includes:
数据字段添加单元,用于通过第三方在所述网页源代码数据的头部部分添加以meta标签标识的自定义数据字段。The data field adding unit is used to add a custom data field identified by a meta tag to the head part of the web page source code data through a third party.
其中,所述图片信息数据字段提取单元包括:URL字段提取子单元,用于根据字段标识信息从所述网页源代码数据中识别图片的URL字段。Wherein, the picture information data field extraction unit includes: a URL field extraction subunit, configured to identify the URL field of the picture from the webpage source code data according to the field identification information.
其中,所述自定义摘要生成单元根据所述字段标识信息将所述一个或多个图片信息数据字段的内容嵌入在与上述字段标识信息对应的预设区域,构建成自定义摘要。Wherein, the self-defined abstract generation unit embeds the content of the one or more picture information data fields in a preset area corresponding to the above-mentioned field identification information according to the field identification information to construct a self-defined abstract.
优选地,所述系统还包括:Preferably, the system also includes:
重要度确定单元,用于确定所述图片信息数据字段中每个字段的重要度。An importance determining unit, configured to determine the importance of each field in the picture information data fields.
其中,所述自定义摘要生成单元在搜索结果列表中的预定区域内显示所述自定义摘要。Wherein, the custom abstract generation unit displays the custom abstract in a predetermined area in the search result list.
其中,所述自定义摘要生成单元在搜索结果列表中的预定区域的左侧显示图片。Wherein, the custom summary generating unit displays pictures on the left side of a predetermined area in the search result list.
优选地,所述自定义摘要生成单元根据预定版面设计将各个图片信息数据字段在预定位置进行显示。Preferably, the self-defined abstract generation unit displays each image information data field at a predetermined position according to a predetermined layout.
其中,所述第三方为网站管理者或者网页提供者。Wherein, the third party is a website manager or a web page provider.
本发明所述的搜索结果中图片网页自定义摘要的显示方法及系统,通过在网页源代码数据的头部部分添加辅助标签,从而可以识别网页源代码数据中的字段数据类型,同时通过对数据字段进行分类并嵌入在预定的位置,可以使生成的摘要能够通过预先的设定显示需要的内容,并可以通过设置不同字段数据的重要度来选择显示的数据内容以及顺序,同时,本发明的摘要可以将显示的数据通过图文并茂的形式显示出来,从而使得摘要信息更加全面丰富。The method and system for displaying the self-defined summary of picture web pages in the search results according to the present invention can identify the field data types in the web page source code data by adding auxiliary tags to the head part of the web page source code data, and at the same time through the data The fields are classified and embedded in the predetermined position, so that the generated summary can display the required content through the preset setting, and the displayed data content and order can be selected by setting the importance of different field data. At the same time, the present invention The summary can display the displayed data in the form of pictures and texts, so that the summary information is more comprehensive and rich.
上述说明仅是本发明技术方案的概述,为了能够更清楚了解本发明的技术手段,而可依照说明书的内容予以实施,并且为了让本发明的上述和其它目的、特征和优点能够更明显易懂,以下特举本发明的具体实施方式。The above description is only an overview of the technical solution of the present invention. In order to better understand the technical means of the present invention, it can be implemented according to the contents of the description, and in order to make the above and other purposes, features and advantages of the present invention more obvious and understandable , the specific embodiments of the present invention are enumerated below.
附图说明Description of drawings
通过阅读下文优选实施方式的详细描述,各种其他的优点和益处对于本领域普通技术人员将变得清楚明了。附图仅用于示出优选实施方式的目的,而并不认为是对本发明的限制。而且在整个附图中,用相同的参考符号表示相同的部件。在附图中:Various other advantages and benefits will become apparent to those of ordinary skill in the art upon reading the following detailed description of the preferred embodiment. The drawings are only for the purpose of illustrating a preferred embodiment and are not to be considered as limiting the invention. Also throughout the drawings, the same reference numerals are used to designate the same components. In the attached picture:
图1示意性地示出了本发明实施例的搜索结果中图片网页自定义摘要的显示方法的流程图。Fig. 1 schematically shows a flowchart of a method for displaying a custom summary of a picture web page in a search result according to an embodiment of the present invention.
图2示意性地示出了本发明实施例的搜索结果中图片网页自定义摘要的显示系统的流程图。FIG. 2 schematically shows a flow chart of a system for displaying custom abstracts of picture webpages in search results according to an embodiment of the present invention.
具体实施方式detailed description
下面详细描述本发明的实施例,所述实施例的示例在附图中示出,其中自始至终相同或类似的标号表示相同或类似的元件或具有相同或类似功能的元件。下面通过参考附图描述的实施例是示例性的,仅用于解释本发明,而不能解释为对本发明的限制。Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.
本技术领域技术人员可以理解,除非另外定义,这里使用的所有术语(包括技术术语和科学术语),具有与本发明所属领域中的普通技术人员的一般理解相同的意义。还应该理解的是,诸如通用字典中定义的那些术语,应该被理解为具有与现有技术的上下文中的意义一致的意义,并且除非像这里一样被特定定义,否则不会用理想化或过于正式的含义来解释。Those skilled in the art can understand that, unless otherwise defined, all terms (including technical terms and scientific terms) used herein have the same meaning as commonly understood by those of ordinary skill in the art to which this invention belongs. It should also be understood that terms, such as those defined in commonly used dictionaries, should be understood to have meanings consistent with their meaning in the context of the prior art, and unless specifically defined as herein, are not intended to be idealized or overly Formal meaning to explain.
在本发明的一个实施例中,提供了一种一种搜索结果中图片网页自定义摘要的显示方法。In one embodiment of the present invention, a method for displaying a custom summary of a picture web page in a search result is provided.
图1示意性地示出了本发明实施例的搜索结果中图片网页自定义摘要的显示方法的流程图。Fig. 1 schematically shows a flowchart of a method for displaying a custom summary of a picture web page in a search result according to an embodiment of the present invention.
参照图1,本发明实施例的搜索结果中图片网页自定义摘要的显示方法包括步骤:With reference to Fig. 1, the display method of the self-defined abstract of the picture webpage in the search result of the embodiment of the present invention comprises the steps:
S1、通过第三方在所述网页源代码数据的头部部分添加以meta标签标识的自定义数据字段,第三方可以根据该图片网页的需要添加对应的字段,可以提供网页内容的准确度。S1. Through the third party adding a custom data field identified by a meta tag to the head part of the webpage source code data, the third party can add corresponding fields according to the needs of the image webpage, which can provide the accuracy of the webpage content.
在本发明的实施例中,添加的自定义数据字段以meta进行标识,从而提高了数据字段的可读性。同时,为了在提取数据字段时可以选择需要数据字段,确定添加的每个每个数据字段的重要度。In the embodiment of the present invention, the added custom data field is identified by meta, thereby improving the readability of the data field. At the same time, in order to select the required data fields when extracting the data fields, the importance of each added data field is determined.
在网页源代码中添加数据字段的代码示例如下:A code example for adding a data field in the source code of a webpage is as follows:
<metaproperty="og:type"content="image"/><metaproperty="og:type"content="image"/>
<metaproperty="og:image”content=”http://e.com/e.GIF"/><metaproperty="og:image"content="http://e.com/e.GIF"/>
在上述代码中,metaproperty为辅助标签,用于对各个字段进行标识;og:type标识网页的类型,image表示当前的网页为图片网页;og:image表示图片的地址,在网页中,图片的地址可以有多个,因此该字段可以有多个。In the above code, metaproperty is an auxiliary tag used to identify each field; og:type identifies the type of webpage, image indicates that the current webpage is a picture webpage; og:image indicates the address of the picture, and in the webpage, the address of the picture There can be more than one, so this field can have more than one.
上述图片类数据字段的代码中,meta是html程序语言head区的一个辅助标签,用于提高html程序语言的可读性,property表明权限为私有权限,即网站管理者或者网页提供者可以对网页进行自定义,提高了生成网页摘要的灵活性与适用性。In the code of the image data field above, meta is an auxiliary tag in the head area of the html programming language, which is used to improve the readability of the html programming language, and property indicates that the authority is a private authority, that is, the website administrator or webpage provider can modify Customization improves the flexibility and applicability of generating web page snippets.
当然,本发明并不限定于必须使用meta标签,或必须使用property来标注结构化字段,本领域技术人员可以使用其他的标签或字段标识也能实现生成自定义摘要的技术方案,当然也落在本发明的保护范围之内。Of course, the present invention is not limited to the need to use meta tags, or must use property to label structured fields. Those skilled in the art can also use other tags or field identifiers to realize the technical solution of generating custom summaries, which of course also falls within within the protection scope of the present invention.
S2、获取待生成自定义摘要的网页源代码数据;S2. Obtaining the source code data of the webpage to be generated with a custom abstract;
S3、提取所述网页源代码数据中第一meta标签的type字段,确定网页类型是否为image;S3, extracting the type field of the first meta tag in the webpage source code data, and determining whether the webpage type is image;
在上述实施例中,源代码中第一标签是<metaproperty="og:type"content="image"/>,从该标签中可以看出,该网页类型为图片网页,因此,通过第一个meta标签的type字段,可以判断提取的网页源代码数据是否为图片类型的数据。In the above embodiment, the first tag in the source code is <metaproperty="og:type" content="image"/>, it can be seen from this tag that the type of the webpage is an image webpage, therefore, through the first The type field of the meta tag can determine whether the extracted web page source code data is image type data.
在生成自定义摘要时,为了判断该网页是否是图片网页,因此,网页源代码数据中的第一meta标签的type字段,是必须提取的数据字段,通过提取该网页类型字段,可以判断该网页是否为图片类网页。When generating a custom summary, in order to determine whether the webpage is a picture webpage, the type field of the first meta tag in the webpage source code data must be extracted. By extracting the webpage type field, the webpage can be judged Whether it is an image page.
S4、如果所述网页类型是image时,进一步提取其余meta标签中的一个或多个图片信息数据字段;S4. If the webpage type is image, further extract one or more image information data fields in the remaining meta tags;
在生成自定义摘要时,在图片网页类的源代码数据中提取需要的图片信息数据字段数据,在上述实施例的字段中,可以根据字段标识信息从网页源代码数据中提取图片的URL字段,以及其他的如对图盘进行描述的字段以及图片类型字段等。When generating a custom summary, extract the required picture information data field data in the source code data of the picture web page class, in the fields of the above-mentioned embodiments, the URL field of the picture can be extracted from the web page source code data according to the field identification information, And other fields such as the description of the drawing board and the picture type field.
S5、基于所述一个或多个图片信息数据字段构建自定义摘要,并在搜索结果中显示。S5. Construct a custom abstract based on the one or more image information data fields, and display it in the search result.
在构件自定义摘要时,根据所述字段标识信息将提取的一个或多个图片信息数据字段的内容嵌入在与上述字段标识信息对应的预设区域,构建成自定义摘要。When constructing a custom abstract, the content of one or more extracted picture information data fields is embedded in a preset area corresponding to the above-mentioned field identification information according to the field identification information to construct a custom abstract.
另外,由于在源代码数据中确定了每个字段的重要度,因此在显示图片信息数据字段时,根据重要度在搜索结果列表中显示所述图片信息数据字段。In addition, since the importance of each field is determined in the source code data, when the image information data field is displayed, the image information data field is displayed in the search result list according to the importance.
在另一个实施例中,在显示自定义摘要时,在搜索结果列表中的预定区域内显示该自定义摘要。In another embodiment, when the custom summary is displayed, the custom summary is displayed in a predetermined area in the search result list.
在搜索结果中显示该自定义摘要时,如果自定义摘要中包括了该图片网页的缩略图、图片的地址以及相关的描述信息等图盘信息数据字段,则在搜索结果列表中的预定区域的一侧显示图片,如在左侧显示图片。When the custom summary is displayed in the search results, if the custom summary includes the thumbnail image of the picture web page, the address of the picture, and related description information and other disk information data fields, then in the predetermined area of the search result list Displaying pictures on one side, such as displaying pictures on the left.
另外,可以通过第三方自适应地设计显示版面,根据预定版面设计将各个图片信息数据字段在预定位置进行显示。In addition, the display layout can be adaptively designed by a third party, and each image information data field can be displayed at a predetermined position according to the predetermined layout design.
在上述实施例中,可以在网页中不同的数据字段中增加重要度的标识字段。In the foregoing embodiment, an identification field of importance may be added to different data fields in the webpage.
进一步地,根据不同重要度的数据字段与用户搜索查询词的相关性,来确定对应自定义摘要数据嵌入在搜索结果列表中的排序位置。如果重要度高的数据字段与用户搜索词相关,则确定对应自定义摘要数据整体与用户搜索词相关度高,嵌入在搜索结果列表中的优先位置;如果重要度低的字段与用户搜索词相关,则确定对应自定义摘要数据整体与用户搜索词相关度低,嵌入在搜索结果列表中的非优先位置。Further, according to the correlation between the data fields of different importance and the user's search query words, the sorting position of the corresponding custom abstract data embedded in the search result list is determined. If the data field with high importance is related to the user's search term, it is determined that the corresponding custom summary data is highly related to the user's search term as a whole, and is embedded in the search result list in the priority position; if the field with low importance is related to the user's search term , it is determined that the corresponding custom summary data has a low correlation with the user's search term as a whole and is embedded in a non-preferred position in the search result list.
进一步地,当用户通过移动终端输入搜索查询词时,查找是否存在与所述搜索查询词匹配的网页自定义摘要数据,如果存在,优先返回自定义摘要数据给移动终端;Further, when the user inputs a search query word through the mobile terminal, it is searched whether there is web page custom summary data matching the search query word, and if it exists, the custom summary data is preferentially returned to the mobile terminal;
进一步地,所述优先返回自定义摘要数据给移动终端,基于移动终端的屏幕尺寸绘制基于所述接自定义摘要数据的整屏交互页面并返回给移动终端进行显示。Further, the user-defined abstract data is returned to the mobile terminal first, and a full-screen interactive page based on the user-defined abstract data is drawn based on the screen size of the mobile terminal and returned to the mobile terminal for display.
进一步地,返回给移动终端的整屏交互页面不仅提供摘要数据信息,而且提供一个或多个可供用户交互的操作对象。Further, the full-screen interactive page returned to the mobile terminal not only provides summary data information, but also provides one or more operation objects available for user interaction.
进一步地,所述操作对象也通过自定义标签来标识,生成自定义摘要中嵌入可调用所述操作对象的脚本控件。基于此,更增加了自定义摘要提供的交互性功能,使得摘要数据更加智能化。Further, the operation object is also identified by a custom label, and a script control that can call the operation object is embedded in the generated custom summary. Based on this, the interactive function provided by the custom summary is added to make the summary data more intelligent.
通过上述方法生成的图片网页自定义摘要,通过在网页源代码中添加辅助标签将每个图片数据字段数据进行标识,然后选择需要的字段数据生成自定义摘要,同时,在将摘要进行显示时,可以根据摘要中的字段数据自适应地设置每个字段的相应位置,从而可以在搜索结果中将该图片类网页的重要内容结构化地展现出来。For the custom summary of the picture webpage generated by the above method, add auxiliary tags to the source code of the webpage to identify the field data of each picture data, and then select the required field data to generate a custom summary. At the same time, when the summary is displayed, The corresponding position of each field can be adaptively set according to the field data in the abstract, so that the important content of the image web page can be displayed in a structured manner in the search results.
在本发明的另一个实施例中,提供一种搜索结果中图片网页自定义摘要的显示系统。In another embodiment of the present invention, a system for displaying custom abstracts of image webpages in search results is provided.
图2示意性地示出了本发明实施例的搜索结果中图片网页自定义摘要的显示系统的结构图。Fig. 2 schematically shows a structural diagram of a system for displaying custom abstracts of picture webpages in search results according to an embodiment of the present invention.
参照图2,本发明实施例的搜索结果中图片网页自定义摘要的显示系统包括:With reference to Fig. 2, the display system of the self-defined abstract of the picture webpage in the search result of the embodiment of the present invention comprises:
网页源代码数据获取单元10,用于获取待生成自定义摘要的网页源代码数据;Webpage source code data acquisition unit 10, used to acquire the webpage source code data to be generated from the custom summary;
网页类型判断单元20,用于提取所述网页源代码数据中第一meta标签的type字段,确定网页类型是否为image;The web page type judging unit 20 is used to extract the type field of the first meta tag in the web page source code data to determine whether the web page type is image;
图片信息数据字段提取单元30,用于当所述网页类型是image时,进一步提取其余meta标签中的一个或多个图片信息数据字段;Picture information data field extracting unit 30, for when described webpage type is image, further extract one or more picture information data fields in the remaining meta tags;
自定义摘要生成单元40,用于基于所述一个或多个图片信息数据字段构建自定义摘要,并在搜索结果中显示。A custom abstract generating unit 40, configured to construct a custom abstract based on the one or more image information data fields, and display it in the search results.
在另一个实施例中,该系统还包括:In another embodiment, the system also includes:
数据字段添加单元,用于通过第三方在所述网页源代码数据的头部部分添加以meta标签标识的自定义数据字段。The data field adding unit is used to add a custom data field identified by a meta tag to the head part of the web page source code data through a third party.
另外,为了在提取字段数据时便于选择需要的字段数据,该系统还包括重要度确定单元,用于确定所述图片信息数据字段中每个字段的重要度。如网页类型是用于判断该网页是否是图片类网页,因此该数据字段为必须字段,而图片的地址是用于链接该图片的地址,因此属于图片类网页中的重要数据字段,在选择数据字段时,可以根据重要度选择数据字段,避免选择的数据字段的重要性不高,造成自定义摘要的准确度降低。In addition, in order to facilitate selection of required field data when extracting field data, the system further includes an importance determination unit, configured to determine the importance of each field in the picture information data fields. For example, the webpage type is used to judge whether the webpage is a picture webpage, so this data field is a required field, and the address of the picture is the address used to link the picture, so it is an important data field in the picture webpage. When selecting a field, you can select the data field according to the importance, so as to avoid the low importance of the selected data field, resulting in a decrease in the accuracy of the custom summary.
另外,图片信息数据字段提取单元30可以包括URL字段提取子单元,用于根据字段标识信息从所述网页源代码数据中识别图片的URL字段。In addition, the picture information data field extraction unit 30 may include a URL field extraction subunit, configured to identify the URL field of the picture from the webpage source code data according to the field identification information.
在构建自定义摘要时,自定义摘要生成单元40根据所述字段标识信息将所述一个或多个图片信息数据字段的内容嵌入在与上述字段标识信息对应的预设区域,构建成自定义摘要。When constructing a custom abstract, the custom abstract generation unit 40 embeds the content of the one or more picture information data fields in the preset area corresponding to the above-mentioned field identification information according to the field identification information, and constructs a custom abstract .
在显示自定义摘要时,自定义摘要生成单元40在搜索结果列表中的预定区域内显示该自定义摘要。When displaying the custom abstract, the custom abstract generation unit 40 displays the custom abstract in a predetermined area in the search result list.
另外,在搜索结果中显示该自定义摘要时,如果自定义摘要中包括了该图片网页的缩略图、图片的地址以及相关的描述信息等图盘信息数据字段,则自定义摘要生成单元40在搜索结果列表中的预定区域的左侧显示图片。In addition, when the custom abstract is displayed in the search results, if the custom abstract includes thumbnails of the picture web page, the address of the picture, and related description information and other disk information data fields, then the custom abstract generating unit 40 The image is displayed to the left of the predetermined area in the search results list.
另一个实施例中,可以通过第三方自适应地设计显示版面,自定义摘要生成单元40根据预定版面设计将各个图片信息数据字段在预定位置进行显示。In another embodiment, the display layout can be adaptively designed by a third party, and the custom abstract generation unit 40 displays each image information data field at a predetermined position according to a predetermined layout design.
在上述实施例中,第三方为网站管理者或者网页提供者。In the above embodiments, the third party is a website manager or a web page provider.
本发明所述的搜索结果中图片网页自定义摘要的显示方法及系统,通过在网页源代码数据的头部部分添加辅助标签,从而可以识别网页源代码数据中的字段数据类型,同时通过对数据字段进行分类并嵌入在预定的位置,可以使生成的摘要能够通过预先的设定显示需要的内容,并可以通过设置不同字段数据的重要度来选择显示的数据内容以及顺序,同时,本发明的摘要可以将显示的数据通过图文并茂的形式显示出来,从而使得摘要信息更加全面丰富。The method and system for displaying the self-defined summary of picture web pages in the search results according to the present invention can identify the field data types in the web page source code data by adding auxiliary tags to the head part of the web page source code data, and at the same time, by The fields are classified and embedded in the predetermined position, so that the generated summary can display the required content through the preset setting, and the displayed data content and order can be selected by setting the importance of different field data. At the same time, the present invention The summary can display the displayed data in the form of pictures and texts, so that the summary information is more comprehensive and rich.
需要说明的是:It should be noted:
在此提供的算法和显示不与任何特定计算机、虚拟系统或者其它设备固有相关。各种通用系统也可以与基于在此的示教一起使用。根据上面的描述,构造这类系统所要求的结构是显而易见的。此外,本发明也不针对任何特定编程语言。应当明白,可以利用各种编程语言实现在此描述的本发明的内容,并且上面对特定语言所做的描述是为了披露本发明的最佳实施方式。The algorithms and displays presented herein are not inherently related to any particular computer, virtual system, or other device. Various generic systems can also be used with the teachings based on this. The structure required to construct such a system is apparent from the above description. Furthermore, the present invention is not specific to any particular programming language. It should be understood that various programming languages can be used to implement the content of the present invention described herein, and the above description of specific languages is for disclosing the best mode of the present invention.
在此处所提供的说明书中,说明了大量具体细节。然而,能够理解,本发明的实施例可以在没有这些具体细节的情况下实践。在一些实例中,并未详细示出公知的方法、结构和技术,以便不模糊对本说明书的理解。In the description provided herein, numerous specific details are set forth. However, it is understood that embodiments of the invention may be practiced without these specific details. In some instances, well-known methods, structures and techniques have not been shown in detail in order not to obscure the understanding of this description.
类似地,应当理解,为了精简本公开并帮助理解各个发明方面中的一个或多个,在上面对本发明的示例性实施例的描述中,本发明的各个特征有时被一起分组到单个实施例、图、或者对其的描述中。然而,并不应将该公开的方法解释成反映如下意图:即所要求保护的本发明要求比在每个权利要求中所明确记载的特征更多的特征。更确切地说,如下面的权利要求书所反映的那样,发明方面在于少于前面公开的单个实施例的所有特征。因此,遵循具体实施方式的权利要求书由此明确地并入该具体实施方式,其中每个权利要求本身都作为本发明的单独实施例。Similarly, it should be appreciated that in the foregoing description of exemplary embodiments of the invention, in order to streamline this disclosure and to facilitate an understanding of one or more of the various inventive aspects, various features of the invention are sometimes grouped together in a single embodiment, figure, or its description. This method of disclosure, however, is not to be interpreted as reflecting an intention that the claimed invention requires more features than are expressly recited in each claim. Rather, as the following claims reflect, inventive aspects lie in less than all features of a single foregoing disclosed embodiment. Thus, the claims following the Detailed Description are hereby expressly incorporated into this Detailed Description, with each claim standing on its own as a separate embodiment of this invention.
本领域那些技术人员可以理解,可以对实施例中的设备中的模块进行自适应性地改变并且把它们设置在与该实施例不同的一个或多个设备中。可以把实施例中的模块或单元或组件组合成一个模块或单元或组件,以及此外可以把它们分成多个子模块或子单元或子组件。除了这样的特征和/或过程或者单元中的至少一些是相互排斥之外,可以采用任何组合对本说明书(包括伴随的权利要求、摘要和附图)中公开的所有特征以及如此公开的任何方法或者设备的所有过程或单元进行组合。除非另外明确陈述,本说明书(包括伴随的权利要求、摘要和附图)中公开的每个特征可以由提供相同、等同或相似目的的替代特征来代替。Those skilled in the art can understand that the modules in the device in the embodiment can be adaptively changed and arranged in one or more devices different from the embodiment. Modules or units or components in the embodiments may be combined into one module or unit or component, and furthermore may be divided into a plurality of sub-modules or sub-units or sub-assemblies. All features disclosed in this specification (including accompanying claims, abstract and drawings) and any method or method so disclosed may be used in any combination, except that at least some of such features and/or processes or units are mutually exclusive. All processes or units of equipment are combined. Each feature disclosed in this specification (including accompanying claims, abstract and drawings) may be replaced by alternative features serving the same, equivalent or similar purpose, unless expressly stated otherwise.
此外,本领域的技术人员能够理解,尽管在此所述的一些实施例包括其它实施例中所包括的某些特征而不是其它特征,但是不同实施例的特征的组合意味着处于本发明的范围之内并且形成不同的实施例。例如,在下面的权利要求书中,所要求保护的实施例的任意之一都可以以任意的组合方式来使用。Furthermore, those skilled in the art will understand that although some embodiments described herein include some features included in other embodiments but not others, combinations of features from different embodiments are meant to be within the scope of the invention. and form different embodiments. For example, in the following claims, any of the claimed embodiments may be used in any combination.
本发明的各个部件实施例可以以硬件实现,或者以在一个或者多个处理器上运行的软件模块实现,或者以它们的组合实现。本领域的技术人员应当理解,可以在实践中使用微处理器或者数字信号处理器(DSP)来实现根据本发明实施例的服务器、客户端,和系统等设备中的一些或者全部部件的一些或者全部功能。本发明还可以实现为用于执行这里所描述的方法的一部分或者全部的设备或者装置程序(例如,计算机程序和计算机程序产品)。这样的实现本发明的程序可以存储在计算机可读介质上,或者可以具有一个或者多个信号的形式。这样的信号可以从因特网网站上下载得到,或者在载体信号上提供,或者以任何其他形式提供。The various component embodiments of the present invention may be implemented in hardware, or in software modules running on one or more processors, or in a combination thereof. Those skilled in the art should understand that a microprocessor or a digital signal processor (DSP) can be used in practice to implement some or all of some or all of the components in the server, client, and system devices according to the embodiments of the present invention. Full functionality. The present invention can also be implemented as an apparatus or an apparatus program (for example, a computer program and a computer program product) for performing a part or all of the methods described herein. Such a program for realizing the present invention may be stored on a computer-readable medium, or may be in the form of one or more signals. Such a signal may be downloaded from an Internet site, or provided on a carrier signal, or provided in any other form.
应该注意的是上述实施例对本发明进行说明而不是对本发明进行限制,并且本领域技术人员在不脱离所附权利要求的范围的情况下可设计出替换实施例。在权利要求中,不应将位于括号之间的任何参考符号构造成对权利要求的限制。单词“包含”不排除存在未列在权利要求中的元件或步骤。位于元件之前的单词“一”或“一个”不排除存在多个这样的元件。本发明可以借助于包括有若干不同元件的硬件以及借助于适当编程的计算机来实现。在列举了若干装置的单元权利要求中,这些装置中的若干个可以是通过同一个硬件项来具体体现。单词第一、第二、以及第三等的使用不表示任何顺序。可将这些单词解释为名称。It should be noted that the above-mentioned embodiments illustrate rather than limit the invention, and that those skilled in the art will be able to design alternative embodiments without departing from the scope of the appended claims. In the claims, any reference signs placed between parentheses shall not be construed as limiting the claim. The word "comprising" does not exclude the presence of elements or steps not listed in a claim. The word "a" or "an" preceding an element does not exclude the presence of a plurality of such elements. The invention can be implemented by means of hardware comprising several distinct elements, and by means of a suitably programmed computer. In a unit claim enumerating several means, several of these means can be embodied by one and the same item of hardware. The use of the words first, second, and third, etc. does not indicate any order. These words can be interpreted as names.
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410806619.1ACN105786835A (en) | 2014-12-22 | 2014-12-22 | Method and system for displaying user-defined abstract of picture webpage in search result |
| Application Number | Priority Date | Filing Date | Title |
|---|---|---|---|
| CN201410806619.1ACN105786835A (en) | 2014-12-22 | 2014-12-22 | Method and system for displaying user-defined abstract of picture webpage in search result |
| Publication Number | Publication Date |
|---|---|
| CN105786835Atrue CN105786835A (en) | 2016-07-20 |
| Application Number | Title | Priority Date | Filing Date |
|---|---|---|---|
| CN201410806619.1APendingCN105786835A (en) | 2014-12-22 | 2014-12-22 | Method and system for displaying user-defined abstract of picture webpage in search result |
| Country | Link |
|---|---|
| CN (1) | CN105786835A (en) |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120011432A1 (en)* | 2009-08-19 | 2012-01-12 | Vitrue, Inc. | Systems and methods for associating social media systems and web pages |
| US20120303722A1 (en)* | 2011-05-26 | 2012-11-29 | Michael Judd Richter | Social Data Inputs |
| CN103324622A (en)* | 2012-03-21 | 2013-09-25 | 北京百度网讯科技有限公司 | Method and device for automatic generating of front page abstract |
| CN104077388A (en)* | 2014-06-27 | 2014-10-01 | 北京奇虎科技有限公司 | Summary information extraction method and device based on search engine and search engine |
| CN104156452A (en)* | 2014-08-18 | 2014-11-19 | 中国人民解放军国防科学技术大学 | Method and device for generating webpage text summarization |
| Publication number | Priority date | Publication date | Assignee | Title |
|---|---|---|---|---|
| US20120011432A1 (en)* | 2009-08-19 | 2012-01-12 | Vitrue, Inc. | Systems and methods for associating social media systems and web pages |
| US20120303722A1 (en)* | 2011-05-26 | 2012-11-29 | Michael Judd Richter | Social Data Inputs |
| CN103324622A (en)* | 2012-03-21 | 2013-09-25 | 北京百度网讯科技有限公司 | Method and device for automatic generating of front page abstract |
| CN104077388A (en)* | 2014-06-27 | 2014-10-01 | 北京奇虎科技有限公司 | Summary information extraction method and device based on search engine and search engine |
| CN104156452A (en)* | 2014-08-18 | 2014-11-19 | 中国人民解放军国防科学技术大学 | Method and device for generating webpage text summarization |
| Title |
|---|
| 怀志和: "《Android移动网站开发详解》", 31 December 2013, 清华大学出版社* |
| 朱印宏: "《中文版Dreamweaver CS6标准教程》", 31 January 2014, 中国电力出版社* |
| 王爱华,等: "《HTML+CSS+JavaScript网页制作简明教程》", 30 September 2014, 清华大学出版社* |
| Publication | Publication Date | Title |
|---|---|---|
| KR102006396B1 (en) | Identifying matching applications based on browsing activity | |
| CN103514299B (en) | Information search method and device | |
| CN104077387B (en) | A kind of web page contents display methods and browser device | |
| CN103258057B (en) | The method and apparatus for showing point of interest POI in electronic map interface | |
| WO2016101754A1 (en) | Method and device for web page switching, and device for providing integrated page | |
| CN105786847A (en) | Method and system for displaying structured abstracts of commodity web page in e-commerce website | |
| CN104063489A (en) | Method and device for determining webpage image relevancy and displaying retrieved result | |
| CN103838862B (en) | Video searching method, device and terminal | |
| JP2021512415A (en) | Backdrop rendering of digital components | |
| CN104462512A (en) | Chinese information search method and device based on knowledge graph | |
| CN105574174B (en) | Searching method and device based on Search Hints | |
| CN105117205A (en) | Web page editing method and web page editing device | |
| CN105608170A (en) | Display method and device of search result page | |
| CN104036003A (en) | Search result integration method and device | |
| CN104598571A (en) | Method and device for playing multimedia resource | |
| WO2015024522A1 (en) | Search method and system, search engine and client | |
| CN104699836A (en) | Multi-keyword search prompting method and multi-keyword search prompting device | |
| CN105653581A (en) | Processing method and device for increasing video result satisfaction | |
| CN105786849A (en) | Method and system for generating document web page custom abstract | |
| CN106951405B (en) | Data processing method and device based on typesetting engine | |
| CN103064943B (en) | A kind of client device | |
| CN105447191A (en) | Intelligent abstracting method for providing graphic guidance steps and corresponding device | |
| CN105786853A (en) | Display method and system for smart abstract of forum post | |
| CN105786834A (en) | Method and system for generating structured abstract of social webpage | |
| CN105808623A (en) | Search-based page access event association method and device |
| Date | Code | Title | Description |
|---|---|---|---|
| C06 | Publication | ||
| PB01 | Publication | ||
| C10 | Entry into substantive examination | ||
| SE01 | Entry into force of request for substantive examination | ||
| RJ01 | Rejection of invention patent application after publication | ||
| RJ01 | Rejection of invention patent application after publication | Application publication date:20160720 |