Movatterモバイル変換


[0]ホーム

URL:


CN103186615B - A kind of Search Hints method and system - Google Patents

A kind of Search Hints method and system
Download PDF

Info

Publication number
CN103186615B
CN103186615BCN201110457125.3ACN201110457125ACN103186615BCN 103186615 BCN103186615 BCN 103186615BCN 201110457125 ACN201110457125 ACN 201110457125ACN 103186615 BCN103186615 BCN 103186615B
Authority
CN
China
Prior art keywords
dictionary
search
prompt
cue
candidate
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201110457125.3A
Other languages
Chinese (zh)
Other versions
CN103186615A (en
Inventor
刘志超
于晓明
杨建武
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Peking University
Peking University Founder Group Co Ltd
Beijing Founder Electronics Co Ltd
Original Assignee
Peking University
Peking University Founder Group Co Ltd
Beijing Founder Electronics Co Ltd
Filing date
Publication date
Application filed by Peking University, Peking University Founder Group Co Ltd, Beijing Founder Electronics Co LtdfiledCriticalPeking University
Priority to CN201110457125.3ApriorityCriticalpatent/CN103186615B/en
Publication of CN103186615ApublicationCriticalpatent/CN103186615A/en
Application grantedgrantedCritical
Publication of CN103186615BpublicationCriticalpatent/CN103186615B/en
Expired - Fee Relatedlegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Abstract

Translated fromChinese

本发明公开了一种搜索提示方法及系统,其中,所述搜索提示方法,包括:获得用户输入的搜索信息;判断所述搜索信息是否合法;在所述搜索信息合法时,基于生成的搜索提示词典,并返回与所述搜索信息对应的搜索提示词;在所述搜索信息不合法时,基于生成的拼写错误提示词典,并返回与所述搜索信息对应的至少两条错误提示信息。采用上述方案,由于基于双数组构建了提示词典,在搜索提示查询时,提高了查询效率,解决了现有技术中存在的搜索提示信息和拼写错误提示信息不准确或者选项很少的问题。

The present invention discloses a search prompt method and system, wherein the search prompt method includes: obtaining search information input by a user; judging whether the search information is legal; when the search information is legal, based on the generated search prompt dictionary, and return the search prompt word corresponding to the search information; when the search information is illegal, prompt the dictionary based on the generated spelling error, and return at least two error prompt information corresponding to the search information. With the above solution, since the prompt dictionary is constructed based on the double array, the query efficiency is improved when searching for prompt queries, and the problem of inaccurate search prompt information and misspelling prompt information or few options in the prior art is solved.

Description

Translated fromChinese
一种搜索提示方法及系统A search prompt method and system

技术领域technical field

本发明涉及计算机互联网技术领域,尤其涉及一种搜索提示方法及系统。The invention relates to the technical field of computer internet, in particular to a search prompt method and system.

背景技术Background technique

随着因特网在全球范围内的飞速发展,网络媒体已被公认为是继报纸、广播、电视之后的“第四媒体”,网络成为人们日常生活、学习和工作必不可少的一部分。网络信息传播迅速,而且具有海量数据的特点。而用户迫切需要的是如何能在这海量数据中快速地检索到有用的信息。With the rapid development of the Internet in the world, network media has been recognized as the "fourth media" after newspapers, radio and television, and the network has become an indispensable part of people's daily life, study and work. Network information disseminates rapidly and has the characteristics of massive data. What users urgently need is how to quickly retrieve useful information from this massive data.

用户输入提示也称搜索框提示,通过用户在搜索框中输入部分查询的信息给出相应完整的信息提示,是一种辅助用户体验工具,节省了用户大量的输入成本,提高了检索的效率。User input prompts are also called search box prompts, which give corresponding complete information prompts through users inputting part of the query information in the search box. It is an auxiliary user experience tool, which saves users a lot of input costs and improves retrieval efficiency.

发明人在发明过程中,发现目前的搜索提示和拼写错误提示存在提示信息不准,甚至出现提示内容为错误的情况,如搜索“舆情预警”对应的提示为“寓情于景”,而且对于拼写错误提示只给出一个候选项,使用户没有可选的余地,从而造成有的错误提示也是不准确的,失去了提示的意义。During the invention process, the inventor found that the current search prompts and spelling error prompts have inaccurate prompt information, and even the content of the prompts is wrong. The spelling error prompt only gives one candidate option, so that the user has no choice, and some error prompts are also inaccurate, losing the meaning of the prompt.

发明内容Contents of the invention

本申请实施例提供了一种搜索提示方法及系统,用于解决现有技术中存在的搜索提示信息和拼写错误提示信息不准确或者选项很少的问题。The embodiments of the present application provide a search prompt method and system, which are used to solve the problems in the prior art that the search prompt information and misspelling prompt information are inaccurate or there are few options.

本申请实施例一提供了一种构建提示词典的方法,具体包括:Embodiment 1 of the present application provides a method for constructing a prompt dictionary, which specifically includes:

基于原始词典,通过双数组算法生成原始双数组词典;Based on the original dictionary, the original double-array dictionary is generated through the double-array algorithm;

基于所述原始双数组词典,生成搜索提示词典;和/或generating a dictionary of search hints based on the original double array dictionary; and/or

基于所述原始双数组词典,生成拼写错误提示词典。Based on the original double-array dictionary, a spelling error prompt dictionary is generated.

其中,如实施例一提供的方法,所述基于所述原始双数组词典,生成搜索提示词典,具体包括:Wherein, as in the method provided in Embodiment 1, the generating a search prompt dictionary based on the original double-array dictionary specifically includes:

基于所述原始双数组词典,生成待排序搜索提示词典;Based on the original double-array dictionary, generate a search prompt dictionary to be sorted;

按第一打分标准,对所述待排序搜索提示词典中的候选提示词进行打分,获得每个候选提示词的分值;Scoring the candidate prompt words in the search prompt dictionary to be sorted according to the first scoring standard to obtain the score of each candidate prompt word;

按分值由高到低,排序所述候选提示词,获得搜索提示词典。The candidate prompt words are sorted according to the scores from high to low to obtain a search prompt dictionary.

所述按分值由高到低,排序所述候选提示词,获得搜索提示词典,具体包括:The described candidate prompt words are sorted from high to low according to the scores to obtain a search prompt dictionary, which specifically includes:

按分值由高到低,排序所述候选提示词;According to the score from high to low, sort the candidate prompt words;

从所述候选提示词中获取分值大于一预设分值的候选提示词;Obtaining candidate prompt words with a score greater than a preset score from the candidate prompt words;

将所述分值大于一预设分值的候选提示词作为所述搜索提示词典。The candidate prompt words whose scores are greater than a preset score are used as the search prompt dictionary.

如实施例一所述的方法,所述基于所述原始双数组词典,生成拼写错误提示词典,具体包括:According to the method described in Embodiment 1, the generation of a spelling error prompt dictionary based on the original double-array dictionary specifically includes:

基于所述原始双数组词典,生成与所述原始双数组词典平行的待排序拼写错误提示词典;Based on the original double-array dictionary, generate a misspelling prompt dictionary to be sorted parallel to the original double-array dictionary;

按第二打分标准,对所述待排序拼写错误词典中的候选提示词进行打分,获得每个候选提示词的分值;According to the second scoring standard, score the candidate prompt words in the misspelling dictionary to be sorted, and obtain the score of each candidate prompt word;

按分值由高到低,排序所述候选提示词,获得拼写错误提示词典。The candidate prompt words are sorted according to the scores from high to low to obtain a misspelling prompt dictionary.

其中,所述按分值由高到低,排序所述候选提示词,获得拼写错误提示词典,具体包括:Wherein, the said candidate prompting words are sorted from high to low according to the score value to obtain a spelling error prompting dictionary, which specifically includes:

按分值由高到低,排序所述候选提示词;According to the score from high to low, sort the candidate prompt words;

从所述候选提示词中获取分值大于一预设分值的候选提示词;Obtaining candidate prompt words with a score greater than a preset score from the candidate prompt words;

将所述分值大于一预设分值的候选提示词作为所述拼写错误提示词典。The candidate prompt words whose scores are greater than a preset score are used as the spelling error prompt dictionary.

在所述基于所述原始双数组词典,生成搜索提示词典之后,所述方法还包括:After generating the search prompt dictionary based on the original double array dictionary, the method further includes:

基于用户输入的搜索信息,更新所述搜索提示词典,以获得更新过的搜索提示词典。Based on the search information input by the user, the search prompt dictionary is updated to obtain an updated search prompt dictionary.

在所述基于所述原始双数组词典,生成拼写错误提示词典之后,所述方法还包括:After generating the typo prompt dictionary based on the original double-array dictionary, the method further includes:

基于用户输入的搜索信息,更新所述拼写错误提示词典,以获得更新过的拼写错误提示词典。Based on the search information input by the user, the spelling error prompting dictionary is updated to obtain an updated spelling error prompting dictionary.

本申请实施例二提供一种搜索提示方法,该方法具体包括:Embodiment 2 of the present application provides a search prompt method, which specifically includes:

获得用户输入的搜索信息;Obtain the search information entered by the user;

判断所述搜索信息是否合法;Determine whether the search information is legitimate;

在所述搜索信息合法时,基于实施例一中的方法生成的搜索提示词典,并返回与所述搜索信息对应的搜索提示词;When the search information is legal, based on the search prompt dictionary generated by the method in the first embodiment, and return the search prompt words corresponding to the search information;

在所述搜索信息不合法时,基于实施例一中的方法生成的拼写错误提示词典,并返回与所述搜索信息对应的至少两条错误提示信息。When the search information is invalid, the dictionary is generated based on the method in the first embodiment, and at least two pieces of error prompt information corresponding to the search information are returned.

本申请实施例三提供一种搜索提示系统,具体包括:Embodiment 3 of the present application provides a search prompt system, which specifically includes:

获得单元,用于获得用户输入的搜索信息;an obtaining unit, configured to obtain search information input by the user;

判断单元,用于判断所述搜索信息是否合法;a judging unit, configured to judge whether the search information is legal;

搜索提示词返回单元,用于在所述搜索信息合法时,基于实施例一中的方法生成的搜索提示词典,并返回与所述搜索信息对应的搜索提示词;A search prompt word returning unit, configured to return a search prompt word corresponding to the search information based on the search prompt dictionary generated by the method in Embodiment 1 when the search information is legal;

错误提示信息返回单元,用于在所述搜索信息不合法时,基于实施例一中的方法生成的拼写错误提示词典,并返回与所述搜索信息对应的至少两条错误提示信息。The error prompt information returning unit is configured to return at least two pieces of error prompt information corresponding to the search information based on the spelling error prompt dictionary generated by the method in Embodiment 1 when the search information is illegal.

本申请通过上述一个或多个实施例提供的技术方案,具有以下有益效果或者优点:The technical solutions provided by the present application through the above-mentioned one or more embodiments have the following beneficial effects or advantages:

由于基于双数组构建了提示词典,在搜索提示查询时,效率很高;Since the prompt dictionary is constructed based on the double array, it is very efficient when searching for prompt queries;

对搜索提示词和错误提示信息进行排序,显示的时候会显示出至少两个搜索提示词或者至少两条错误提示信息,方便于用户进行选择;Sort the search prompt words and error prompt information, and at least two search prompt words or at least two error prompt messages will be displayed when displaying, which is convenient for users to choose;

系统会记录用户输入的搜索词以及频度,定时自动更新后台数据,对搜索提示词和错误提示信息进行重新排序,以便提示出与用户搜索信息更加对应的搜索提示词以及错误提示信息。The system will record the search terms and frequency entered by the user, automatically update the background data regularly, and reorder the search prompt words and error prompt information, so as to prompt the search prompt words and error prompt information that are more corresponding to the user's search information.

附图说明Description of drawings

图1为实施例一中构建提示词典的方法流程图;Fig. 1 is the flow chart of the method for constructing prompt dictionary in embodiment one;

图2为实施例一中生成搜索提示词典的步骤流程图;Fig. 2 is a flow chart of steps for generating a search prompt dictionary in Embodiment 1;

图3为实施例一中生成拼写错误提示词典的步骤流程图;Fig. 3 is a flow chart of the steps of generating a spelling error prompt dictionary in Embodiment 1;

图4为实施例二中搜索提示方法流程图;FIG. 4 is a flow chart of the search prompt method in Embodiment 2;

图5为实施例二中搜索提示系统图。Fig. 5 is a diagram of the search prompt system in the second embodiment.

具体实施方式detailed description

本申请实施例提供了一种搜索提示方法及系统,用于解决现有技术中存在的搜索提示信息和拼写错误提示信息不准确或者选项很少的问题。The embodiments of the present application provide a search prompt method and system, which are used to solve the problems in the prior art that the search prompt information and misspelling prompt information are inaccurate or there are few options.

下面结合说明书附图以及具体的实施方式对上述技术方案进行详细的说明和阐述。The above technical solutions will be described and explained in detail below in conjunction with the accompanying drawings and specific implementation manners.

实施例1Example 1

如图1所示,本申请实施例一提供了一种构建提示词典的方法,具体包括:As shown in Figure 1, Embodiment 1 of the present application provides a method for constructing a prompt dictionary, which specifically includes:

步骤101,基于原始词典,通过双数组算法生成原始双数组词典;Step 101, based on the original dictionary, generate the original double-array dictionary through the double-array algorithm;

步骤102,基于原始双数组词典,生成搜索提示词典;和/或Step 102, generating a search prompt dictionary based on the original double array dictionary; and/or

步骤103,基于原始双数组词典,生成拼写错误提示词典。Step 103, generate a spelling error prompt dictionary based on the original double-array dictionary.

对于步骤101,基于原始词典,通过双数组算法生成原始双数组词典,其构建过程如下:For step 101, based on the original dictionary, the original double-array dictionary is generated through the double-array algorithm, and the construction process is as follows:

首先遍历整个词典将所有词转化为以字节为单位的Trie树(字典树)。First traverse the entire dictionary to convert all words into a Trie tree (dictionary tree) in bytes.

然后构建双数组,构建过程主要有以下几个步骤:Then build a double array. The construction process mainly has the following steps:

1、初始化活动节点列表,把第一层节点加入列表。将base[]数组和check[]数组的元素初始化为0,当base[i]=check[i]=0时表示该位置为空,当base[i]的值为负值时表示为一个可结束的状态,特别当base[i]<0且abs(base[i])=i时表示该节点为叶子节点。1. Initialize the list of active nodes, and add the first-level nodes to the list. Initialize the elements of the base[] array and check[] array to 0. When base[i]=check[i]=0, it means that the position is empty. When the value of base[i] is negative, it means that it can be The end state, especially when base[i]<0 and abs(base[i])=i, indicates that the node is a leaf node.

2、如果活动列表不为空,选取Trie树中直接子节点数最多的节点为当前节点,否则构建完成。2. If the activity list is not empty, select the node with the largest number of direct child nodes in the Trie tree as the current node, otherwise the construction is completed.

3、访问该节点决定其base值并确定直接子节点的下标位置,令各直接子节点在check数组中的值为当前节点的下标值。3. Visit the node to determine its base value and determine the subscript position of the direct child node, so that the value of each direct child node in the check array is the subscript value of the current node.

4、将当前的直接子节点加入活动列表,重复步骤2。4. Add the current direct child node to the active list, and repeat step 2.

如图2所示,如实施例一提供的方法,对于步骤102,基于原始双数组词典,生成搜索提示词典,具体包括:As shown in Figure 2, in the method provided in Embodiment 1, for step 102, based on the original double-array dictionary, a search prompt dictionary is generated, specifically including:

步骤s102a,基于原始双数组词典,生成待排序搜索提示词典;根据用户输入的词前缀给出完整的提示词,由于前缀相同的词会有很多,所以需要对前缀相同的词按照打分标准,由高到低进行排序,生成待排序搜索提示词典。Step s102a, based on the original double-array dictionary, generate a search prompt dictionary to be sorted; give a complete prompt word according to the prefix of the word input by the user. Since there are many words with the same prefix, it is necessary to score the words with the same prefix according to the scoring standard. Sort from high to low to generate a search prompt dictionary to be sorted.

步骤s102b,按第一打分标准,对待排序搜索提示词典中的候选提示词进行打分,获得每个候选提示词的分值;Step s102b, according to the first scoring standard, scoring the candidate prompt words in the prompt dictionary to be sorted and searched, and obtaining the score of each candidate prompt word;

第一打分标准包括以下几个因素:The first scoring criteria include the following factors:

1,该词在语料中的词频nCorpus,权重为CorpusWeight;1. The word frequency nCorpus of the word in the corpus, and the weight is CorpusWeight;

2,一小时内该词被用户输入的频度nHour,权重为HourWeight;2. The frequency nHour of the word being input by the user within one hour, and the weight is HourWeight;

3,一天内该词被用户输入的频度nDay,权重为DayWeight;3. The frequency nDay of the word being input by the user within a day, and the weight is DayWeight;

4,一个月内该词被用户输入的频度nMonth,权重为MonthWeight。4. The frequency nMonth of the word being input by the user within a month, and the weight is MonthWeight.

打分公式为:The scoring formula is:

Score 1=CorpusWeight*nCorpus+HourWeight*nHour+DayWeight*nDay+Score 1=CorpusWeight*nCorpus+HourWeight*nHour+DayWeight*nDay+

MonthWeight*nMonth。MonthWeight*nMonth.

步骤s102c,按分值由高到低,排序候选提示词,获得搜索提示词典。In step s102c, the candidate prompt words are sorted according to the scores from high to low to obtain a search prompt dictionary.

步骤s102c具体包括:Step s102c specifically includes:

按分值由高到低,排序所述候选提示词;According to the score from high to low, sort the candidate prompt words;

从所述候选提示词中获取分值大于一预设分值的候选提示词;Obtaining candidate prompt words with a score greater than a preset score from the candidate prompt words;

将所述分值大于一预设分值的候选提示词作为所述搜索提示词典。The candidate prompt words whose scores are greater than a preset score are used as the search prompt dictionary.

得到候选提示词的分数之后,假设该预设分值为X,则将分值大于X对应的候选提示词提取出来,作为搜索提示词典里的提示词。After the scores of the candidate prompt words are obtained, assuming that the preset score is X, the candidate prompt words whose scores are greater than X are extracted and used as prompt words in the search prompt dictionary.

如图3所示,如实施例一所述的方法,步骤103,基于所述原始双数组词典,生成拼写错误提示词典,拼写错误提示词典的实现原理同搜索提示词典的实现原理,基于双数组词典构建拼写错误提示词典,提示词典是由拼音--提示词相对应,将汉字内码对应的读音ID生成汉字-读音ID词典,其实现原理是申请一个长度为Length=65536(1<<16)的整数数组,其下标值即为汉字对应的内码,元素值为读音ID,其生成拼写错误提示词典的方法,具体包括:As shown in Figure 3, as in the method described in Embodiment 1, step 103, based on the original double array dictionary, generates a spelling error prompt dictionary, and the implementation principle of the spelling error prompt dictionary is the same as that of the search prompt dictionary, based on the double array The dictionary constructs a spelling error prompt dictionary. The prompt dictionary is corresponding to the pinyin-prompt word, and the pronunciation ID corresponding to the Chinese character internal code is generated into the Chinese character-pronunciation ID dictionary. The principle of implementation is to apply for a length of Length=65536 (1<<16 ), its subscript value is the internal code corresponding to the Chinese character, and the element value is the pronunciation ID. The method for generating a spelling error prompt dictionary specifically includes:

步骤s103a,基于所述原始双数组词典,生成与所述原始双数组词典平行的待排序拼写错误提示词典;Step s103a, based on the original double-array dictionary, generate a misspelling prompt dictionary to be sorted parallel to the original double-array dictionary;

步骤s103b,按第二打分标准,对所述待排序拼写错误词典中的候选提示词进行打分,获得每个候选提示词的分值;第二打分标准和第一打分标准相同,对每个词的打分标准包括以下几个因素:Step s103b, according to the second scoring standard, score the candidate prompt words in the misspelling dictionary to be sorted, and obtain the score value of each candidate prompt word; the second scoring standard is the same as the first scoring standard, and each word The scoring criteria include the following factors:

1,该词在语料中的词频nCorpus,权重为CorpusWeight;1. The word frequency nCorpus of the word in the corpus, and the weight is CorpusWeight;

2,一小时内该词被用户输入的频度nHour,权重为HourWeight;2. The frequency nHour of the word being input by the user within one hour, and the weight is HourWeight;

3,一天内该词被用户输入的频度nDay,权重为DayWeight;3. The frequency nDay of the word being input by the user within a day, and the weight is DayWeight;

4,一个月内该词被用户输入的频度nMonth,权重为MonthWeight。4. The frequency nMonth of the word being input by the user within a month, and the weight is MonthWeight.

打分公式为:The scoring formula is:

Score 2=CorpusWeight*nCorpus+HourWeight*nHour+DayWeight*nDay+Score 2=CorpusWeight*nCorpus+HourWeight*nHour+DayWeight*nDay+

MonthWeight*nMonth。MonthWeight*nMonth.

步骤s103c,按分值由高到低,排序所述候选提示词,获得拼写错误提示词典。Step s103c, sorting the candidate prompt words according to the scores from high to low to obtain a misspelling prompt dictionary.

拼写错误提示词典里的读音ID串是将读音转化为ID表示,这样做的好处是不会产生查找歧义,比如用户输入的信息为“Xian”,单凭拼音就不能够判定对应的汉字是什么,可以是“先,现,线......”,也可以是“西安”。为了不产生这类问题我们将单字的拼音转化为对应的ID,如“xi=1,an=2,xian=3”这样就能避免产生歧义问题。所有词的拼音转化为读音ID串后建立双数组,同时生成与之平行的第三数组拼写错误提示词典保存拼写错误提示信息。The pronunciation ID string in the spelling error prompt dictionary is to convert the pronunciation into an ID representation. The advantage of this is that there will be no search ambiguity. For example, the information entered by the user is "Xian", and the corresponding Chinese character cannot be determined by pinyin alone. , can be "first, now, line..." or "Xi'an". In order not to cause such problems, we convert the pinyin of a single character into the corresponding ID, such as "xi=1, an=2, xian=3" so as to avoid ambiguity problems. After the pinyin of all words is converted into the pronunciation ID string, a double array is established, and a third array of spelling error prompting dictionaries parallel to it is generated simultaneously to save the spelling error prompting information.

其中,所述按分值由高到低,排序所述候选提示词,获得拼写错误提示词典,具体包括:Wherein, the said candidate prompting words are sorted from high to low according to the score value to obtain a spelling error prompting dictionary, which specifically includes:

按分值由高到低,排序所述候选提示词;According to the score from high to low, sort the candidate prompt words;

从所述候选提示词中获取分值大于一预设分值的候选提示词;Obtaining candidate prompt words with a score greater than a preset score from the candidate prompt words;

将所述分值大于一预设分值的候选提示词作为所述拼写错误提示词典。The candidate prompt words whose scores are greater than a preset score are used as the spelling error prompt dictionary.

假如用户输入“百渡”,系统打分排序完成后,如果得到的由高到低的提示词顺序百度,摆渡,百度,那么系统会提示出:1、百度;2、摆渡;3拜读等提示词,以供用户选择,这样就避免了用户因错误输入而导致查询结果失败。If the user enters "Baidu", after the system scores and sorts, if the order of prompt words obtained from high to low is Baidu, Ferry, Baidu, then the system will prompt: 1. Baidu; 2. Ferry; 3. Read and other prompt words , for the user to choose, so as to avoid the failure of the query result caused by the user's wrong input.

在所述基于所述原始双数组词典,生成搜索提示词典之后,所述方法还包括:After generating the search prompt dictionary based on the original double array dictionary, the method further includes:

基于用户输入的搜索信息,更新所述搜索提示词典,以获得更新过的搜索提示词典。Based on the search information input by the user, the search prompt dictionary is updated to obtain an updated search prompt dictionary.

在所述基于所述原始双数组词典,生成拼写错误提示词典之后,所述方法还包括:After generating the typo prompt dictionary based on the original double-array dictionary, the method further includes:

基于用户输入的搜索信息,更新所述拼写错误提示词典,以获得更新过的拼写错误提示词典。Based on the search information input by the user, the spelling error prompting dictionary is updated to obtain an updated spelling error prompting dictionary.

将用户输入的信息记录到内存中,每隔十分钟写入文件一次,记录博鳌扩用户一小时内输入词频,一天内输入的词频,一个月输入的词频等。然后每隔一小时系统自动更新所有搜索提示相关词典文件,重新生成搜索提示词典,更新原内存数据区;且每隔一小时系统自动更新所有与拼写错误相关的词典文件,重新生成拼写错误提示词典,更新原内存数据区。Record the information entered by the user into the memory, write it into the file every ten minutes, and record the frequency of words input by Boao users within an hour, within a day, and within a month. Then every hour, the system automatically updates all dictionary files related to search prompts, regenerates search prompt dictionaries, and updates the original memory data area; and every hour, the system automatically updates all dictionary files related to spelling errors, and regenerates spelling error prompt dictionaries , to update the original memory data area.

实施例2Example 2

如图4所示,本申请实施例二提供一种搜索提示方法,该方法具体包括包括:As shown in Figure 4, Embodiment 2 of the present application provides a search prompt method, which specifically includes:

步骤201,获得用户输入的搜索信息;Step 201, obtaining the search information input by the user;

步骤202,判断所述搜索信息是否合法;Step 202, judging whether the search information is legal;

步骤203,在所述搜索信息合法时,基于生成的搜索提示词典,并返回与所述搜索信息对应的搜索提示词;Step 203, when the search information is legal, based on the generated search prompt dictionary, and return the search prompt words corresponding to the search information;

步骤204,在所述搜索信息不合法时,基于生成的拼写错误提示词典,并返回与所述搜索信息对应的至少两条错误提示信息。Step 204, when the search information is invalid, prompt a dictionary based on the generated spelling error, and return at least two pieces of error prompt information corresponding to the search information.

其搜索提示词典和拼写错误提示词典的生成方法,已在实施例1中介绍过,在此不再赘述。The methods for generating the search prompt dictionary and the misspelling prompt dictionary have been introduced in Embodiment 1, and will not be repeated here.

实施例3Example 3

如图5所示,本申请实施例三提供一种搜索提示系统,其特征在于,包括:As shown in Figure 5, Embodiment 3 of the present application provides a search prompt system, which is characterized in that it includes:

获得单元301,用于获得用户输入的搜索信息;Obtaining unit 301, configured to obtain search information input by the user;

判断单元302,用于判断所述搜索信息是否合法;A judging unit 302, configured to judge whether the search information is legal;

搜索提示词返回单元303,用于在所述搜索信息合法时,基于生成的搜索提示词典,并返回与所述搜索信息对应的搜索提示词;A search prompt word returning unit 303, configured to return a search prompt word corresponding to the search information based on the generated search prompt dictionary when the search information is legal;

错误提示信息返回单元304,用于在所述搜索信息不合法时,基于生成的拼写错误提示词典,并返回与所述搜索信息对应的至少两条错误提示信息。The error prompt information returning unit 304 is configured to prompt a dictionary based on the generated spelling error when the search information is invalid, and return at least two pieces of error prompt information corresponding to the search information.

同样,其搜索提示词典和拼写错误提示词典的生成方法,已在实施例1中介绍过,在此不再赘述。Likewise, the methods for generating the search prompt dictionary and the misspelling prompt dictionary have been introduced in Embodiment 1, and will not be repeated here.

本申请通过上述一个或多个实施例提供的技术方案,具有以下有益效果或者优点:The technical solutions provided by the present application through the above-mentioned one or more embodiments have the following beneficial effects or advantages:

由于基于双数组构建了提示词典,在搜索提示查询时,提高了查询效率;Since the prompt dictionary is constructed based on the double array, the query efficiency is improved when searching for prompt queries;

对搜索提示词和错误提示信息进行排序,显示的时候会显示出至少两个搜索提示词或者至少两条错误提示信息,方便于用户进行选择;Sort the search prompt words and error prompt information, and at least two search prompt words or at least two error prompt messages will be displayed when displaying, which is convenient for users to choose;

系统会记录用户输入的搜索词以及频度,定时自动更新后台数据,对搜索提示词和错误提示信息进行重新排序,以便提示出与用户搜索信息更加对应的搜索提示词以及错误提示信息。The system will record the search terms and frequency entered by the user, automatically update the background data regularly, and reorder the search prompt words and error prompt information, so as to prompt the search prompt words and error prompt information that are more corresponding to the user's search information.

尽管已描述了本发明的优选实施例,但本领域内的技术人员一旦得知了基本创造性概念,则可对这些实施例作出另外的变更和修改。所以,所附权利要求意欲解释为包括优选实施例以及落入本发明范围的所有变更和修改。While preferred embodiments of the invention have been described, additional changes and modifications to these embodiments can be made by those skilled in the art once the basic inventive concept is appreciated. Therefore, it is intended that the appended claims be construed to cover the preferred embodiment as well as all changes and modifications which fall within the scope of the invention.

显然,本领域的技术人员可以对本发明进行各种改动和变型而不脱离本发明的精神和范围。这样,倘若本发明的这些修改和变型属于本发明权利要求及其等同技术的范围之内,则本发明也意图包含这些改动和变型在内。Obviously, those skilled in the art can make various changes and modifications to the present invention without departing from the spirit and scope of the present invention. Thus, if these modifications and variations of the present invention fall within the scope of the claims of the present invention and equivalent technologies thereof, the present invention also intends to include these modifications and variations.

Claims (9)

CN201110457125.3A2011-12-30A kind of Search Hints method and systemExpired - Fee RelatedCN103186615B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201110457125.3ACN103186615B (en)2011-12-30A kind of Search Hints method and system

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201110457125.3ACN103186615B (en)2011-12-30A kind of Search Hints method and system

Publications (2)

Publication NumberPublication Date
CN103186615A CN103186615A (en)2013-07-03
CN103186615Btrue CN103186615B (en)2016-11-30

Family

ID=

Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101101594A (en)*2007-06-122008-01-09无敌科技(西安)有限公司Real-time sentence-assisted writing method and system
CN101751430A (en)*2008-12-122010-06-23汉王科技股份有限公司Electronic dictionary fuzzy searching method
CN102063508A (en)*2011-01-102011-05-18浙江大学Generalized suffix tree based fuzzy auto-completion method for Chinese search engine

Patent Citations (3)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101101594A (en)*2007-06-122008-01-09无敌科技(西安)有限公司Real-time sentence-assisted writing method and system
CN101751430A (en)*2008-12-122010-06-23汉王科技股份有限公司Electronic dictionary fuzzy searching method
CN102063508A (en)*2011-01-102011-05-18浙江大学Generalized suffix tree based fuzzy auto-completion method for Chinese search engine

Similar Documents

PublicationPublication DateTitle
Mendes et al.DBpedia spotlight: shedding light on the web of documents
US9201931B2 (en)Method for obtaining search suggestions from fuzzy score matching and population frequencies
CN100565515C (en)A kind of Chinese auto-answer method and system
US10409803B1 (en)Domain name generation and searching using unigram queries
CN104850574B (en)A kind of filtering sensitive words method of text-oriented information
Kothari et al.SMS based interface for FAQ retrieval
US20150154316A1 (en)Search suggestions of related entities based on co-occurrence and/or fuzzy-score matching
US20160041986A1 (en)Smart Search Engine
WO2008022581A1 (en)Method and device for obtaining the new words and input method system
TW201131402A (en)Enabling faster full-text searching using a structured data store
US20120323905A1 (en)Ranking data utilizing attributes associated with semantic sub-keys
WO2008014702A1 (en)Method and system of extracting new words
US10380248B1 (en)Acronym identification in domain names
WO2010015068A1 (en)Topic word generation method and system
BarrièreNatural language understanding in a semantic web context
CN102654866A (en)Method and device for establishing example sentence index and method and device for indexing example sentences
WO2014150155A1 (en)Systems and methods for analyzing failed and successful search queries
CN104462399B (en)The processing method and processing device of search result
WO2022134355A1 (en)Keyword prompt-based search method and apparatus, and electronic device and storage medium
CN101310277B (en) Method and system for obtaining a representation of text
De Wilde et al.Semantic enrichment of a multilingual archive with linked open data
CN102693320A (en)Searching method and device
US20120317141A1 (en)System and method for ordering of semantic sub-keys
CN103377224A (en)Method and device for recognizing problem types and method and device for establishing recognition models
CN111008285B (en)Author disambiguation method based on thesis key attribute network

Legal Events

DateCodeTitleDescription
PB01Publication
SE01Entry into force of request for substantive examination
GR01Patent grant
TR01Transfer of patent right

Effective date of registration:20220616

Address after:3007, Hengqin international financial center building, No. 58, Huajin street, Hengqin new area, Zhuhai, Guangdong 519031

Patentee after:New founder holdings development Co.,Ltd.

Patentee after:Peking University

Patentee after:BEIJING FOUNDER ELECTRONICS Co.,Ltd.

Address before:100871, Beijing, Haidian District Cheng Fu Road 298, founder building, 9 floor

Patentee before:PEKING UNIVERSITY FOUNDER GROUP Co.,Ltd.

Patentee before:Peking University

Patentee before:BEIJING FOUNDER ELECTRONICS Co.,Ltd.

CF01Termination of patent right due to non-payment of annual fee

Granted publication date:20161130


[8]ページ先頭

©2009-2025 Movatter.jp