Movatterモバイル変換


[0]ホーム

URL:


CN105205139B - A kind of personalization document retrieval method - Google Patents

A kind of personalization document retrieval method
Download PDF

Info

Publication number
CN105205139B
CN105205139BCN201510592309.9ACN201510592309ACN105205139BCN 105205139 BCN105205139 BCN 105205139BCN 201510592309 ACN201510592309 ACN 201510592309ACN 105205139 BCN105205139 BCN 105205139B
Authority
CN
China
Prior art keywords
interest
user
keyword
degree
keywords database
Prior art date
Legal status (The legal status is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the status listed.)
Expired - Fee Related
Application number
CN201510592309.9A
Other languages
Chinese (zh)
Other versions
CN105205139A (en
Inventor
罗旭斌
Current Assignee (The listed assignees may be inaccurate. Google has not performed a legal analysis and makes no representation or warranty as to the accuracy of the list.)
Individual
Original Assignee
Individual
Priority date (The priority date is an assumption and is not a legal conclusion. Google has not performed a legal analysis and makes no representation as to the accuracy of the date listed.)
Filing date
Publication date
Application filed by IndividualfiledCriticalIndividual
Priority to CN201510592309.9ApriorityCriticalpatent/CN105205139B/en
Publication of CN105205139ApublicationCriticalpatent/CN105205139A/en
Application grantedgrantedCritical
Publication of CN105205139BpublicationCriticalpatent/CN105205139B/en
Expired - Fee Relatedlegal-statusCriticalCurrent
Anticipated expirationlegal-statusCritical

Links

Classifications

Landscapes

Abstract

The invention discloses a kind of personalized document retrieval method, it the steps include: a, construct user information static library for each user: including being not limited to identity information and research field, being input to searching system;B, the interest keywords database X of user is constructed: including multiple interest keywords and each corresponding interest-degree of interest keyword;C, information retrieval: when user carries out information retrieval, the keyword set for setting input is combined into Q, is retrieved, obtain search result R1, R2 ..., Rn, then each interest keyword is added in keyword set, then is retrieved, obtained search result as and R1, R2 ..., Rn has repeat element, then the ranking of these repeat elements is moved forward, mobile distance is determined according to interest-degree, finally obtains search result.Using this method, each information retrieval is adjusted as a result, being all based on user interest keywords database, so that the search result of user individual is exported, so that the search result of output is more accurate.

Description

A kind of personalization document retrieval method
Technical field
The present invention relates to documents, technical field of information retrieval, specifically relate to a kind of search method of personalized document.
Background technique
Literature search refers to the process of to be needed to obtain document according to study and work.Existing peek-a-boo is mostNumber is all based on the static informations such as the attribute, including keyword, author, bibliography of document itself and constructs, will not be literaryThe characteristic for offering demander or retrieval people is included in during literature search, that is to say, that anyone inputs same search keyWhen, obtained search result is identical.In the epoch of this information explosion, literature search equally faces the information retrieval of magnanimityAs a result, if it is possible to the identity speciality for retrieving people is included in retrieving, individualized fit is carried out to search result, it will helpObtain very useful search result.For example, the search result that is obtained at retrieval " network " of personnel of research logistics andThe researcher of one research fiber optic communication input the search result that is obtained when same keyword should different from, to reflect themThe research achievement of respective research field, i.e., carry out personalized document retrieval according to its identity.
Publication No. CN 101373486, publication date are that on 2 25th, 2009 Chinese patent literatures disclose a kind of baseIn the personalized summary system of user interest model, the personalized summary system is by WEB information retrieval unit, user interest unitIt is formed with personalized summary unit.The personalized summary system is built by analysis user search log using conceptual clustering methodThe user interest model that vertical and/or update is described with level concept structure;Then according to the user interest model and search resultThe parsing for carrying out sentence similarity in user interest and search result, to obtain the personalized summary for meeting user.It usesThe personalized summary that personalized sentence scoring is handled has fully considered the Characteristic of Interest of user, makes the generating process root of abstractIt is matched according to the interest of user, the validity of abstract and the satisfaction of user can be improved.
Using above patent document as the prior art of representative, although it is emerging also to carry out user with search result using interest modelThe parsing of interest and sentence similarity in search result, to obtain the abstract for meeting user individual, but it is needed to sentence phaseIt is parsed like degree, the personalized summary system accuracy rate shown after parsing is simultaneously not high enough, and retrieval mode is complicated.TogetherWhen, since the user of peek-a-boo is mostly the researcher of profession, the content of retrieval is also mainly professional Research Literature, gainedThe result is that automatic abstract, and it is not good enough for the matching of professional Research Literature search result.
Summary of the invention
The present invention is directed to provide a kind of personalized literature search side for defect and deficiency present in the above-mentioned prior artMethod when being retrieved using this method, increases the interest keyword and corresponding interest-degree of user, for each information retrievalIt is adjusted as a result, being all based on user interest keywords database, so that the search result of user individual is exported, so that outputSearch result is more accurate, and search method is simple.
The present invention is realized by using following technical proposals:
A kind of personalization document retrieval method, it is characterised in that steps are as follows:
A, user information static library: identity information and research field including being not limited to user is constructed for each user,And searching system is input to by user;
B, the interest keywords database X of user is constructed for each user: crucial including multiple interest keywords and each interestThe corresponding interest-degree of word;Interest keywords database X-form is expressed as x1, x2 ..., xm, wherein m is non-zero natural number, forEach element x=(k, w), wherein k is interest keyword, and w is the corresponding interest-degree of interest keyword, interest keywords database XIt is initialized as the Focus Area that user inputs in step a, and assigns interest-degree unification to a quiescent value;
C, information retrieval: when user carries out information retrieval, the keyword set for setting input is combined into Q, is retrieved, and is examinedHitch fruit R1, R2 ..., Rn, n is non-zero natural number;Each interest keyword in the interest keywords database X of user is added againEnter into keyword set, then retrieved, obtained search result is such as and R1, R2 ..., Rn have repeat element, then by thisThe ranking of a little repeat elements moves forward, and mobile distance is determined according to the interest-degree of this interest keyword;
If having m interest keyword in user interest keywords database X, needs to do m information retrieval movement, finally adjustThe search result of whole completion is exported as final result.
The update of interest keywords database X: when each user inputs search key, search key is added to interest and is closedIn the X of keyword library, a new interest keyword is formed, and the corresponding interest-degree of interest keyword is initialized as a static stateValue;If some search key t has existed in interest keywords database X, then the corresponding interest-degree w of the interest keyword is added1。
Meanwhile the interest level of be interested in keyword is done into attenuation operations after retrieval every time, that is, reduce by a numerical value e.This numerical value reflects the speed of interest attenuation, can be a fixed value, such as 0.01, can also be related to the retrieval habit of user,Adaptive study is done to determine.If interest-degree is decayed to less than or equal to 0, then by its corresponding interest keyword from interest keywordIt is deleted in the X of library, to keep the fresh and alive property of interest keywords database.
It include interest keyword and search key in the keyword set.
Compared with prior art, the beneficial effects obtained by the present invention are as follows it is as follows:
1, interest first is constructed for each user when carrying out information retrieval using tri- steps of abc of the present inventionKeywords database X is first to carry out retrieval acquisition as a result, being further added by the interest keyword of user using search key in retrievalSearch result is obtained into keyword set, finally the ranking of duplicate element moves forward, mobile distance is according to interest keyThe interest-degree of word determines.Such mode adjusts each information retrieval as a result, being all based on user interest keywords databaseIt is whole, the search result of user individual is exported, search result is made more to match the demand of user.
2, this method is using being updated interest keywords database X, be according to the Information Retrieval Behaviors of each user, toFamily interest keywords database carries out dynamic adjustment, so that system constantly deepens the understanding to user, so that future retrieval resultIts interest is more matched, search result is more accurate.
Specific embodiment
As best implementation of the invention, it discloses a kind of personalized document retrieval methods, and its step are as follows:
A, user information static library: identity information and research field including being not limited to user is constructed for each user,And searching system is input to by user;
B, the interest keywords database X of user is constructed for each user: crucial including multiple interest keywords and each interestThe corresponding interest-degree of word;Interest keywords database X-form is expressed as x1, x2 ..., xm, wherein m is non-zero natural number, forEach element x=(k, w), wherein k is interest keyword, and w is the corresponding interest-degree of interest keyword, interest keywords database XIt is initialized as the Focus Area that user inputs in step a, and assigns interest-degree unification to a quiescent value;
C, information retrieval: when user carries out information retrieval, the keyword set for setting input is combined into Q, is retrieved, and is examinedHitch fruit R1, R2 ..., Rn, n is non-zero natural number;Each interest keyword in the interest keywords database X of user is added againEnter into keyword set, then retrieved, obtained search result is such as and R1, R2 ..., Rn have repeat element, then by thisThe ranking of a little repeat elements moves forward, and mobile distance is pressed linear scale according to the interest-degree w of this interest keyword and determined;
If having m interest keyword in user interest keywords database X, needs to do m information retrieval movement, finally adjustThe search result of whole completion is exported as final result.
The update of interest keywords database X: when each user inputs search key, search key is added to interest and is closedIn the X of keyword library, a new interest keyword is formed, and its corresponding interest-degree is initialized as a quiescent value;Such as someSearch key t has existed in interest keywords database X, then the corresponding interest-degree w of the interest keyword is added 1.
Meanwhile the interest level of be interested in keyword is done into attenuation operations after retrieval every time, that is, reduce by a numerical value e.This numerical value reflects the speed of interest attenuation, can be a fixed value, such as 0.01, can also be related to the retrieval habit of user,Adaptive study is done to determine.If interest-degree is decayed to less than or equal to 0, then by its corresponding interest keyword from interest keywordIt is deleted in the X of library, to keep the fresh and alive property of interest keywords database.
It include interest keyword and search key in keyword set in the present embodiment.
This method in actual application, dynamic user interest keywords database X, interest keyword including user andCorresponding interest-degree, to each information retrieval as a result, being adjusted based on user interest keywords database, to export user personalityThe search result of change;Meanwhile according to the Information Retrieval Behaviors of each user, dynamic adjustment is carried out to user interest keywords database,So that system constantly deepens the understanding to user, so that future retrieval result more matches its interest, search result is moreAccurately.

Claims (3)

CN201510592309.9A2015-09-172015-09-17A kind of personalization document retrieval methodExpired - Fee RelatedCN105205139B (en)

Priority Applications (1)

Application NumberPriority DateFiling DateTitle
CN201510592309.9ACN105205139B (en)2015-09-172015-09-17A kind of personalization document retrieval method

Applications Claiming Priority (1)

Application NumberPriority DateFiling DateTitle
CN201510592309.9ACN105205139B (en)2015-09-172015-09-17A kind of personalization document retrieval method

Publications (2)

Publication NumberPublication Date
CN105205139A CN105205139A (en)2015-12-30
CN105205139Btrue CN105205139B (en)2019-06-14

Family

ID=54952822

Family Applications (1)

Application NumberTitlePriority DateFiling Date
CN201510592309.9AExpired - Fee RelatedCN105205139B (en)2015-09-172015-09-17A kind of personalization document retrieval method

Country Status (1)

CountryLink
CN (1)CN105205139B (en)

Families Citing this family (2)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN110046243A (en)*2019-04-232019-07-23北京恒冠网络数据处理有限公司A kind of patent personalized retrieval analysis system based on big data
CN115203391A (en)*2022-06-242022-10-18润联软件系统(深圳)有限公司 An information retrieval method, device, computer equipment and storage medium

Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101080709A (en)*2004-03-292007-11-28咕果公司Variable personalization of search results in a search engine
CN102156728A (en)*2011-03-312011-08-17河南理工大学Improved personalized summary system based on user interest model
CN102819575A (en)*2012-07-202012-12-12南京大学Personalized search method for Web service recommendation
US8463810B1 (en)*2006-06-012013-06-11Monster Worldwide, Inc.Scoring concepts for contextual personalized information retrieval

Family Cites Families (1)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
US20070233672A1 (en)*2006-03-302007-10-04Coveo Inc.Personalizing search results from search engines

Patent Citations (4)

* Cited by examiner, † Cited by third party
Publication numberPriority datePublication dateAssigneeTitle
CN101080709A (en)*2004-03-292007-11-28咕果公司Variable personalization of search results in a search engine
US8463810B1 (en)*2006-06-012013-06-11Monster Worldwide, Inc.Scoring concepts for contextual personalized information retrieval
CN102156728A (en)*2011-03-312011-08-17河南理工大学Improved personalized summary system based on user interest model
CN102819575A (en)*2012-07-202012-12-12南京大学Personalized search method for Web service recommendation

Non-Patent Citations (1)

* Cited by examiner, † Cited by third party
Title
基于用户模型的个性化网络文献检索系统的研究与设计;谭利文;《基于用户模型的个性化网络文献检索系统的研究与设计》;20041215;第24-39页

Also Published As

Publication numberPublication date
CN105205139A (en)2015-12-30

Similar Documents

PublicationPublication DateTitle
US20210209109A1 (en)Method, apparatus, device, and storage medium for intention recommendation
CN112214593B (en)Question-answering processing method and device, electronic equipment and storage medium
Snell et al.Learning by distilling context
US10019484B2 (en)Third party search applications for a search system
WO2019236360A1 (en)Taxonomy enrichment using ensemble classifiers
CN110582762A (en) Automatic response server device, terminal device, response system, response method and program
CN102693309A (en)Candidate phrase querying method and aided translation system for computer aided translation
CN106815252A (en)A kind of searching method and equipment
CN105760417A (en)Cognitive Interactive Searching Method And System Based On Personalized User Model And Context
CN105068661A (en)Man-machine interaction method and system based on artificial intelligence
WO2016044028A1 (en)Query rewriting using session information
CN105447080B (en)A kind of inquiry complementing method in community's question and answer search
CN110866093A (en)Machine question-answering method and device
Alexander et al.Natural language web interface for database (NLWIDB)
CN105243149B (en)A kind of semantic-based web query recommended method and system
CN107992513B (en)Information processing system and method for realizing information processing
CN110851584B (en)Legal provision accurate recommendation system and method
CN106776808A (en)Information data offering method and device based on artificial intelligence
CN106599215A (en)Question generation method and question generation system based on deep learning
CN117473034A (en)Interactive text processing method and device, electronic equipment and storage medium
CN103927339B (en)Knowledge Reorganizing system and method for knowledge realignment
CN110110218B (en)Identity association method and terminal
CN105205139B (en)A kind of personalization document retrieval method
JP2016177359A (en) Search device and program
CN119294499A (en) Expert matching method and system based on large language model knowledge expansion and fusion

Legal Events

DateCodeTitleDescription
C06Publication
PB01Publication
C10Entry into substantive examination
SE01Entry into force of request for substantive examination
GR01Patent grant
GR01Patent grant
CF01Termination of patent right due to non-payment of annual fee

Granted publication date:20190614

Termination date:20200917

CF01Termination of patent right due to non-payment of annual fee

[8]ページ先頭

©2009-2025 Movatter.jp