A kind of high value patent automatically obtains method and apparatusTechnical field
Method and dress are automatically obtained the present invention relates to intellectual property technical field more particularly to a kind of high value patentIt sets.
Background technique
Currently, when carrying out high value patent retrieval using patent retrieval platform generally being intended to that some keys are manually enteredVocabulary and use with or wait logical relations field and other fields to constitute a retrieval type, these fields include: the patent No.,Patent name, abstract, international classification number, inventor, applicant, publication date etc., then carried out by computer denoising and artificial denoisingIt screens layer by layer, finally obtains high value patent.
But present inventor during inventive technique scheme, has found above-mentioned technology extremely in realizing the embodiment of the present applicationIt has the following technical problems less:
When retrieving high value patent in the prior art, it is excessive to be easy to appear range of search, denoises heavy workload, and high value is specialBenefit retrieval inaccuracy, or there is the technical issues of missing inspection not to keyword expansion or upper in retrieving.
Summary of the invention
The embodiment of the invention provides a kind of method and apparatus that automatically obtain of high value patent, solution is examined in the prior artWhen holdup value patent, it is excessive to be easy to appear range of search, denoises heavy workload, high value patent retrieval inaccuracy, Huo ZheThere is not the technical issues of missing inspection to keyword expansion or upper in retrieving, it is convenient to have reached operation, recall precision is high,Speed is fast, and the accurate technical effect of search result.
In view of the above problems, propose the embodiment of the present application in order to provide a kind of high value patent automatically obtain method andDevice.
In a first aspect, automatically obtaining method the present invention provides a kind of high value patent, which comprises obtain oneTarget literature, the target literature include the first keyword;First object patent database is obtained according to first keyword;The first document is obtained from the first object patent database;According to first document, the power of first document is obtainedSharp requested number, claim number of words and specification number of words;According to the claim quantity of first document, claim wordSeveral and specification number of words obtains the first weighted value, the second weighted value and the third weighted value of first document, and described in determinationFirst value assessment score of the first document;Judge whether the first value assessment score is greater than the first predetermined threshold;Work as instituteWhen stating the first value assessment score greater than the first predetermined threshold, prompt information is sent to first object patent database, wherein instituteStating prompt information is first document.
Preferably, the method also includes: according to the patented power people's information of the first document, wherein pass through the patentPower people's information judges the property of patentee;When the property of the patentee meets the first predetermined condition, to first objectPatent database sends prompt information, wherein the prompt information is first document.
Preferably, described after obtaining the first document in the first object patent database, comprising: according to the first inspectionSuo Pingtai obtains the License Info of first document;Judge whether first document has patent grant;If describedOne document has patent grant, obtains the second value scoring of first document.
Preferably, described after obtaining the first document in the first object patent database, further includes: according to firstSearching platform obtains the actionable information of first document;According to the actionable information, the third valence of first document is obtainedValue scoring.
Preferably, the method also includes: according to the patent licensing information of first document, actionable information obtain described inThe 4th weighted value, the 5th weighted value of first document;It is scored according to the first to five weighted value, first value, is describedSecond value scoring and third value scoring, determine the 4th value assessment score of first document;Judge describedWhether four value assessment scores are greater than the first predetermined threshold;When the 4th value assessment score is greater than the first predetermined threshold,Prompt information is sent to first object patent database, wherein the prompt information is first document.
Second aspect automatically obtains device the present invention provides a kind of high value patent, and described device includes:
First obtains unit, the first obtains unit obtain a target literature, and the target literature includes first crucialWord;
Second obtaining unit, second obtaining unit obtain first object patent data according to first keywordLibrary;
Third obtaining unit, the third obtaining unit obtain the first document from the first object patent database;
4th obtaining unit, the 4th obtaining unit obtain the right of first document according to first documentRequested number, claim number of words and specification number of words;
5th obtaining unit, the 5th obtaining unit is according to claim quantity, the claim of first documentNumber of words and specification number of words obtain the first weighted value, the second weighted value and the third weighted value of first document, and determine instituteState the first value assessment score of the first document;
First judging unit, it is predetermined that first judging unit judges whether the first value assessment score is greater than firstThreshold value;
First processing units, when the first value assessment score is greater than the first predetermined threshold, first processing is singleMember sends prompt information to first object patent database, wherein the prompt information is first document.
Preferably, described device further include:
6th obtaining unit, the 6th obtaining unit is according to the patented power people's information of the first document, wherein passes through instituteState the property that patentee's information judges patentee;
The second processing unit, when the property of the patentee meets the first predetermined condition, described the second processing unitPrompt information is sent to first object patent database, wherein the prompt information is first document.
Preferably, the third obtaining unit further include:
7th obtaining unit, the 7th obtaining unit obtain the license of first document according to the first searching platformInformation;
Second judgment unit, the second judgment unit judge whether first document has patent grant;
8th obtaining unit, if first document has a patent grant, the 8th obtaining unit obtains described theSecond value scoring of one document.
Preferably, the third obtaining unit further include:
9th obtaining unit, the 9th obtaining unit obtain the lawsuit of first document according to the first searching platformInformation;
Tenth obtaining unit, the tenth obtaining unit obtain the third of first document according to the actionable informationValue scoring.
Preferably, described device further include:
11st obtaining unit, patent licensing information of the 11st obtaining unit according to first document, lawsuitThe 4th weighted value, the 5th weighted value of first document described in information acquisition;
First determination unit, first determination unit according to the first to five weighted value, it is described first value scoring,The second value scoring and third value scoring, determine the 4th value assessment score of first document;
Third judging unit, it is predetermined that the third judging unit judges whether the 4th value assessment score is greater than firstThreshold value;
Third processing unit, when the 4th value assessment score is greater than the first predetermined threshold, the third processing is singleMember sends prompt information to first object patent database, wherein the prompt information is first document.
The third aspect, the present invention provides a kind of device that automatically obtains of high value patent, including memory, processor andThe computer program that can be run on a memory and on a processor is stored, the processor is realized following when executing described programStep: obtaining a target literature, and the target literature includes the first keyword;First object is obtained according to first keywordPatent database;The first document is obtained from the first object patent database;According to first document, described the is obtainedClaim quantity, claim number of words and the specification number of words of one document;According to the claim quantity of first document,Claim number of words and specification number of words obtain the first weighted value, the second weighted value and the third weighted value of first document,And determine the first value assessment score of first document;It is predetermined to judge whether the first value assessment score is greater than firstThreshold value;When the first value assessment score is greater than the first predetermined threshold, prompt letter is sent to first object patent databaseBreath, wherein the prompt information is first document.
Said one or multiple technical solutions in the embodiment of the present application at least have following one or more technology effectsFruit:
1. a kind of high value patent provided by the embodiments of the present application automatically obtains method and apparatus, which comprisesA target literature is obtained, the target literature includes the first keyword;First object patent is obtained according to first keywordDatabase;The first document is obtained from the first object patent database;According to first document, first text is obtainedClaim quantity, claim number of words and the specification number of words offered;According to claim quantity, the right of first documentIt is required that number of words and specification number of words, obtain the first weighted value, the second weighted value and the third weighted value of first document, and reallyFirst value assessment score of fixed first document;Judge whether the first value assessment score is greater than the first predetermined thresholdValue;When the first value assessment score is greater than the first predetermined threshold, prompt information is sent to first object patent database,Wherein the prompt information is first document.Through the invention, when solving high value patent retrieval in the prior art, easilyOccur range of search it is excessive, denoise heavy workload, retrieval inaccuracy, or in retrieving not to keyword expansion or on, there is the technical issues of missing inspection in position, and it is convenient to have reached operation, and recall precision is high, the accurate technical effect of search result.
2. the embodiment of the present application after obtaining the first document in the first object patent database, is wrapped by describedIt includes: according to the first searching platform, obtaining the License Info of first document;Judge whether first document is permitted with patentIt can;If first document has patent grant, the second value scoring of first document is obtained.It realizes by license timeSeveral pairs of high value patents are screened, and the technical effect that patent value is precisely judged from permitted number has further been reached.
3. the embodiment of the present application after obtaining the first document in the first object patent database, is also wrapped by describedIt includes: according to the first searching platform, obtaining the actionable information of first document;According to the actionable information, described first is obtainedThe third of document is worth scoring.The technical effect that patent value is precisely judged from patent stability is further reached.
The above description is only an overview of the technical scheme of the present invention, in order to better understand the technical means of the present invention,And it can be implemented in accordance with the contents of the specification, and in order to allow above and other objects of the present invention, feature and advantage canIt is clearer and more comprehensible, the followings are specific embodiments of the present invention.
Detailed description of the invention
Fig. 1 is a kind of flow diagram for automatically obtaining method of high value patent in the embodiment of the present invention;
Fig. 2 is a kind of structural schematic diagram for automatically obtaining device of high value patent in the embodiment of the present invention;
Fig. 3 is the structural schematic diagram for automatically obtaining device of another high value patent in the embodiment of the present invention.
Specific embodiment
The embodiment of the invention provides a kind of method and apparatus that automatically obtain of high value patent, technologies provided by the inventionScheme general thought is as follows: by obtaining a target literature, the target literature includes the first keyword;It is closed according to described firstKeyword obtains first object patent database;The first document is obtained from the first object patent database;According to describedOne document obtains claim quantity, claim number of words and the specification number of words of first document;According to first textClaim quantity, claim number of words and the specification number of words offered obtain the first weighted value, the second power of first documentWeight values and third weighted value, and determine the first value assessment score of first document;Judge first value assessment pointWhether number is greater than the first predetermined threshold;It is special to first object when the first value assessment score is greater than the first predetermined thresholdSharp database sends prompt information, wherein the prompt information is first document.It is special to solve high value in the prior artWhen benefit retrieval, it is excessive range of search easily occur, denoises heavy workload, retrieval inaccuracy, or not to key in retrieving, there is the technical issues of missing inspection in word extension or upper, and it is convenient to have reached operation, and recall precision is high, the accurate technology of search resultEffect.
Technical solution of the present invention is described in detail below by attached drawing and specific embodiment, it should be understood that the applicationSpecific features in embodiment and embodiment are the detailed description to technical scheme, rather than to present techniquesThe restriction of scheme, in the absence of conflict, the technical characteristic in the embodiment of the present application and embodiment can be combined with each other.
Embodiment one
Fig. 1 is a kind of flow diagram of high value patent automatically obtained in the embodiment of the present invention.As shown in Figure 1, instituteThe method of stating includes:
Step 110: obtaining a target literature, the target literature includes the first keyword.
Specifically, user can according to actual needs, searching one or more documents and materials or patent document are as baseThe target literature of plinth can be retrieved by the target literature and obtain target patent.In target literature Automatic sieve select one orMultiple keywords, the keyword can be the information such as some keyword, inventive point, the field in the target literature.
Step 120: first object patent database is obtained according to first keyword.
Specifically, being retrieved in searching platform or database according to the first keyword obtained, pass through retrievalCorresponding patent database is obtained, which may include all patent documents announced, can also be by artificialOperation, screens the patent document of authorization, unauthorized and failure.
Step 130: obtaining the first document from the first object patent database.
Further, described after obtaining the first document in the first object patent database, comprising: according to firstSearching platform obtains the License Info of first document;Judge whether first document has patent grant;If describedFirst document has patent grant, obtains the second value scoring of first document.Further, described from first meshIt marks after obtaining the first document in patent database, further includes: according to the first searching platform, obtain the lawsuit of first documentInformation;According to the actionable information, the third for obtaining first document is worth scoring.
Specifically, every patent document is selected from target patent database, it is special by searching platform automatically retrieval everyThe patent licensing information and actionable information of sharp document, pass through patent licensing information if patent has License Info and are sentenced automaticallyIt is disconnected, obtain the second value scoring of the patent.According to the actionable information of patent, the third value scoring of the document is obtained, automaticallyJudge the stability of the patent.
Step 140: according to first document, obtaining claim quantity, the claim number of words of first documentAnd specification number of words.
Step 150: according to the claim quantity, claim number of words and specification number of words of first document, obtainingThe first weighted value, the second weighted value and the third weighted value of first document, and determine the first value of first documentAssess score.
Specifically, passing through the quantity and claim and specification of retrieving the claim for automatically obtaining the patent documentNumber of words, the first weighted value, first weighted value are as follows: the power of target patent are determined by the claim quantity of target patentSharp requested number × shared score value ratio determines the second weight of target patent by the number of words of target patent claimsValue, second weighted value are as follows: the number of words of target patent claims × shared score value ratio passes through target patent specificationNumber of words determine the third weighted value of target patent, the third weighted value are as follows: the number of words of target patent specification × shared pointValue ratio obtains the first value assessment point of target patent according to first weighted value, the second weighted value and third weighted valueNumber.
Step 160: judging whether the first value assessment score is greater than the first predetermined threshold.
Step 170: when the first value assessment score is greater than the first predetermined threshold, to first object patent databasePrompt information is sent, wherein the prompt information is first document.
Specifically, one predetermined threshold of setting, when the first value assessment score of target patent is greater than the predetermined threshold,The patent document is sent to the first object patent database, determines that this patent document is qualified document.TogetherWhen, the second keyword is obtained in the retrieval history of patent retrieval platform according to user, it will the first valence relevant to the second keywordFor the high patent of value assessment score to user's pushed information, pushed information includes the patentee of the patent, abstract of description, patentThe information such as License Info and actionable information.
Further, the method also includes: according to the patented power people's information of the first document, wherein by it is described speciallyLi quanren's information judges the property of patentee;When the property of the patentee meets the first predetermined condition, to the first meshIt marks patent database and sends prompt information, wherein the prompt information is first document.
Specifically, this method can also pass through the information of patentee by the information of the patented power people of the first documentPatented power human nature matter determines that the patentee is school, research institute, enterprise or individual etc., when the property of the patenteeWhen matter meets preset condition, the patent is sent to the first object patent database automatically.
Further, the method also includes: according to the patent licensing information of first document, actionable information obtain instituteState the 4th weighted value, the 5th weighted value of the first document;According to the first to five weighted value, the first value scoring, instituteThe scoring of the second value and third value scoring are stated, determines the 4th value assessment score of first document;Described in judgementWhether the 4th value assessment score is greater than the first predetermined threshold;When the 4th value assessment score is greater than the first predetermined thresholdWhen, prompt information is sent to first object patent database, wherein the prompt information is first document.
Specifically, the 4th weighted value are as follows: the patent licensing information of first document × shared score value ratio, it is described5th weighted value are as follows: the actionable information of first document × shared score value ratio passes through all weighted values and all score values ratioExample carries out comprehensive descision to first document, obtains the 4th value assessment score, and a scheduled threshold value is arranged, when described theWhen 4th value scoring of one document is greater than the threshold value, first document is sent to the first object patent database.
Further, the method also includes: from first searching database obtain the first document;Judge describedSimilarity between one document and target retrieval document;When the similarity meets the first predetermined condition, described first is examinedRope database column is target database.Further, similar between the judgement first document and target retrieval documentDegree, further includes: semantic analysis is carried out according to the claim of first document and the target retrieval document, obtains the first phaseLike paragraph;Determine the first ratio of the number of words of the claim of the described first similar paragraph and the target retrieval document;JudgementWhether first ratio is greater than the first predetermined threshold;When first ratio is greater than the first predetermined threshold, described the is obtainedThe second similarity between one document and target retrieval document.
Specifically, the keyword more than the wherein frequency of occurrences is found in the literature content by semantic analysis, thenThe keyword more than the wherein frequency of occurrences is found in the target retrieval document, the keyword of the two is compared, and is obtained whereinSimilarity, the similarity be the first similarity, if the keyword is identical, or be synonym first similarity valueIt is just big.Other than being compared to keyword, also further the claim content of the two is compared, makes search result moreIt is accurate to add, and implements process are as follows: the claim of the document and the target retrieval document is subjected to semantic analysis respectively,It therefrom searches and contrasts the high paragraph of content similarity, then the higher paragraph of the similarity is subjected to number of words comparison, obtain describedSecond similarity of the high paragraph of similarity, if number of words is also close, the second similarity ratio is big, finally judges described firstSimilarity and which numerical value of the second similarity are bigger, and choosing is wherein biggish to be used as the document and the target retrieval documentFinal similarity degree.The similarity value obtained by comparison is preset similarity with searching system to compare, is judgedWhether the literature content retrieved and the target retrieval document are close, finally will acquire the target retrieval by searching for automaticallyThe target retrieval content of document, system automatically retrieval is more comprehensive, and missing inspection, false retrieval caused by avoiding human factor from being added etc. is askedTopic, to solve in the prior art, retrieving is manually operated, and carries out manual search according to title or keyword, then willSearch result carries out finishing analysis, and there is retrieval, time-consuming, and the technical issues of be easy to appear missing inspection, has reached and is automaticallySystem retrieval, retrieval comparison is more careful, and search result is more acurrate, avoids occurring because the unstable factor being artificially introduced missing inspection and showsAs improving the technical effect of recall precision.
Further, the method also includes: according to the target retrieval document, obtain expansion word range;From describedThe first expansion word is obtained according to the first rule in one searching database, wherein first expansion word is in the expansion word rangeIt is interior;The second searching database is obtained according to first expansion word;According to second searching database and first retrievalDatabase obtains target database.
Specifically, by judging the full text text meaning of word and description, determining the mesh according to target retrieval documentMark technical field locating for search file.The technological know-how that fields are used is judged by the technical field, so that it is determined thatTechnical tool dictionary.Then the range of the keyword of the core technology in patent document is determined by the technical tool dictionary,That is expansion word range.Multiple patent documents are retrieved from first searching database by the first term, it will be described moreA patent document carries out semantic analysis, mainly judges the keyword of the core technology in patent document, from the keyword reallyFixed multiple expansion words to patent searching, such as denomination of invention, technical field, abstract of description.Judge word in multiple expansion wordsIt anticipates same or similar word, and the highest expansion word of multiplicity expands as the first expansion word, described first in multiple expansion wordsWord is opened up within the scope of the expansion word.Wherein, first expansion word is similar word, e.g., polyethylene with first termWith thermoplastic resin etc..The first expansion word is judged whether within the scope of the expansion word, when first expansion word is in the expansionIt can be the second searching database according to the database of the first expansion word patent searching document when opening up within the scope of word.Pass through the first inspectionThe intersection of first searching database that rope word determines and second searching database determined by the first expansion word canTo obtain the target database of target retrieval document, retrieved by second searching database and first searching databasePatent document accuracy it is high.The weighting of the target database is calculated according to first weighted value and second weighted valueValue, the accuracy of the target database is determined by the weighted value.
Further, the method also includes: according to target retrieval document, obtain skill locating for the target retrieval documentArt field;Technical tool dictionary is obtained according to the technical field;It is obtained according to the technical tool dictionary and the first keywordFirst expansion word;First, which is obtained, according to the target retrieval document, the first keyword and the first expansion word compares document;According to instituteIt states the first searching database and obtains the first document;Judge that first document and first compares the similarity of document;When the phaseWhen meeting the first predetermined condition like degree, first document is stored in target database.
Specifically, being obtained described in the target retrieval document by the content analysis of the target retrieval documentParticular content belong to a certain technical field, the high data of the degree of correlation can be further searched for by determining technical field and believedBreath excludes invalid information.The skill is found out accordingly according to particular technique field belonging to the target retrieval document judgedThe technical tool dictionary in art field, the technical tool dictionary are all related major terms in the technical field, proprietary spySign, technical term etc., i.e., include all core contents and the keyword in the technical field comprehensively.
Searched in the technical tool dictionary with the synonym of first keyword or similar import, play identical workWith equal correlation words, which is the similar word of first keyword, the similar word be it is multiple, for example, if crucialWord is " nail ", can search similar word in related-art tool dictionary, such as screw, bolt it is multiple close orPerson acts on identical similar word.Then the multiple similar words found out are subjected to semantic analysis again, are found out and first keyWord looks like close multiple expansion words, finally by the number that the multiple expansion words determined by semantic analysis are carried out with frequency of occurrenceAmount statistics, using the highest expansion word of the most multiplicities of frequency of occurrence as the first expansion word, first expansion word be with it is describedThe high similar word of the close degree of first keyword.
It will be existed by first expansion word obtained in conjunction with the target retrieval document and first keywordIt is scanned in large database concept, finds the first of the condition of satisfaction and compare document, described first, which compares document, is and the targetThe higher document information of search file matching degree can be used as the destination document of classification reference.Crucial by described firstWord is retrieved in large database concept and recalls pertinent literature in first searching database obtained, and the document is to examine with targetRope document has certain relevance, includes the documents and materials of first keyword in content.To in first searching databaseThe first document compare document with described first and be compared, carry out semantic analysis in first document first, obtainThen plurality of first keyword out compares document content to described first and carries out semantic analysis, show that described first comparesMultiple second keywords occurred in document, finally to the multiple first keyword and the multiple second keyword successively intoRow semantic analysis obtains the similarity degree of the multiple first keyword and the multiple second keyword, to its similarity degreeQuantify to obtain the first similarity numerical value between the multiple first keyword and the multiple second keyword by calculating, thisThe similarity that value compares document with described first as first document.
Obtained first document and described first is compared preset first in the similarity and system of documentPredetermined condition is compared, and first predetermined condition can be preset similarity threshold.When first document withWhen described first similarity for comparing document meets the first predetermined condition, then first document is to compare document with described firstBelong to same technical field, the big documents and materials of content relevance, then using first document as target literature typing number of targetsAccording in library;If the similarity that first document compares document with described first is unsatisfactory for lower than first predetermined conditionWhen condition, first document is not to be inconsistent document, then does not enter in the target database, be deleted.
Further, the method also includes: the first keyword is obtained from automatically retrieval document;It is closed according to described firstKeyword determines the first searching database;The first document is determined from first searching database;Judge first document andThe similarity of target retrieval document;When the similarity meets predetermined condition, the second keyword is obtained from the first document,In the first keyword and second keyword belong to same technical field.
Specifically, by the searching system for the document typing automatically retrieval keyword for needing to retrieve, by system to instituteIt states target retrieval document content analysis and obtains keyword therein, as the first keyword, first keyword can be markThe more word of the frequency of occurrences in the subject or document of topic, or state word by the core effect that semantic analysis goes outEtc..After obtaining first keyword, first keyword is reaffirmed, first be examined according to the targetRope document determines the particular technique field of its content description, finds out the skill accordingly according to the particular technique field judgedThe technical tool dictionary in art field, the technical tool dictionary are all related major terms in the technical field, proprietary spySign, technical term etc., i.e., include all keywords in the technical field, then in the technical tool dictionary comprehensivelyAll keywords in the technical field where the target retrieval document are searched, with first keyword found outIt compares and analyzes, judges whether first keyword includes the keyword range found out in the described technical fieldInterior, if first keyword is within the scope of the keyword, first keyword is effective keyword, if not describedIt in keyword range, is then continued to search for invalid keyword needs, it is known that find effective first keyword, then use instituteIt states the first keyword to be retrieved in the large database concept of internet document, obtains all documents about first keywordSet forms the first searching database, and first searching database is all documents retrieved after keyword recognitionSet, ensure that the comprehensive and correctness of retrieval.
Phase is recalled being retrieved in first searching database obtained in large database concept by first keywordDocument is closed, the document is the documents and materials for having certain relevance with target retrieval document, from first searching databaseIn find out corresponding document, the document particular content high to the degree of association in first searching database carries out successively right respectivelyThan analysis, the similarity degree between the document and the target retrieval document in first searching database, the phase are judgedIt carries out being quantified as specific data like degree system.
Similarity threshold is preset in system, is compared according to the predetermined condition of obtained similarity and setting,When the similarity numerical value of document and the target retrieval document in first searching database meets predetermined condition, it is determined thatThe document is effective document.After effective documents have been determined, then the second keyword is searched from the document, it is describedSecond keyword is different keywords from first keyword, but belongs to same technical field, is all from determining technologyThe keyword obtained is analyzed in first searching database that field retrieves.
Further, the method also includes: the first classification number is determined according to the target retrieval document;According to describedOne document determines the second classification number;Judge whether first classification number and the second classification number are approximate classification number;When describedOne classification number and the second classification number are not approximate classification number, and first document is deleted from the first object database.
Further, described to judge whether first classification number and the second classification number are approximate classification number, comprising: according toFirst classification number determines the portion that the target retrieval document included, major class, group, big group, the first meaning of group;RootThe Secondary Meaning in the portion, major class, group, big group, group that first document included is determined according to second classification number;JudgementFirst meaning and the Secondary Meaning whether semantic similarity;When first meaning and the Secondary Meaning semanteme be not closeWhen, first classification number and the second classification number are not approximate classification number.
Specifically, obtaining technical field locating for the target retrieval document, then root according to target retrieval document firstTechnical tool dictionary is obtained according to the technical field, and then obtains the range of keyword, then judges that first keyword isIt is no in the range of the keyword, when first keyword is within the scope of the keyword, on patent retrieval websiteFirst keyword is inputted to scan for, so that the first object database comprising first keyword is obtained,In, patent document largely comprising first keyword is had collected in the first object database.Obtaining described theAfter one target database, in several patent documents comprising first keyword in the first object database,The patent document comprising first keyword is arbitrarily picked out as first document;At the same time, according to describedThe technical field that target retrieval document is determined, and then determine first classification number, it then opens selectFirst document, and then determine the second classification number of first document.Again by first classification number and described secondClassification number is compared, and analyzes and determines out whether first classification number and second classification number are approximate classification number.TrueIt makes first classification number and when the second classification number is not approximate classification number, that is, can determine first document and the targetThe semanteme of search file is not close, it may also be said to which first document is uncorrelated to the content of the target retrieval document, at this timeWith regard to first document is deleted from the first object database.
Further, the method also includes: the first classification number is determined according to first document;According to described first pointClass-mark determines the portion that first document included, major class, group, big group, the first meaning of group;To first meaning withThe target retrieval document carries out semantic analysis, wherein when first meaning and the target retrieval document semantic are kept off,First document is deleted from the first object database.
Specifically, first being contained by the classification number middle part of the first determining document, major class, group, big group, groupJustice so that whether judge the first document identical as the semanteme of the target retrieval document, and then reaches the denoising of the first documentPurpose.
Further, which comprises determine that patent document quantity is arranged according to classification number according to first object databaseName;Obtain least first classification number of patent quantity of document in the classification number;From the patent document of first classification numberObtain the first document;Judge the first similarity of first document Yu target patent document;When first similarity is less thanWhen predetermined threshold, the patent document that first classification number includes is deleted from first object database.
Specifically, the target patent document is the patent document that user wants retrieval, the first object databaseFor the database comprising the target patent document, the Q for the patent document for including in the first object database is then determinedA classification number, wherein Q is positive integer, all special by include in the first object database according still further to the Q classification numberSharp document is sorted out, to obtain the corresponding patent document quantity of the Q classification number, and to the Q classification number pairThe patent document quantity answered carries out ranking by ascending order, and then obtains patent quantity of document least first in the Q classification numberClassification number, wherein first classification number is included in the Q classification number, is one of classification of the Q classification numberNumber, and the corresponding patent document minimum number of first classification number.It is retrieved from the patent document of first classification numberThe first document is obtained, the first similarity of first document and the target patent document is analyzed and determined, that is, is exactly right respectivelyThe title of first document and the target patent document, description carry out semantic analysis, determine first textFirst similarity with the target patent document is offered, when first similarity is less than predetermined threshold, by described theThe patent document that one classification number includes is deleted from first object database.
Further, which comprises according to the patented power people's information of the first document, wherein pass through the patentPower people's information judges the property of patentee;It is special to first object when patentee's information meets the first predetermined conditionSharp database sends prompt information, wherein the prompt information is first document.
Specifically, obtaining every patent text by the retrieval to every patent document in patent database obtainedThe patentee's information and transfer history offered, preset a threshold value, when patent transfer the possession of number be higher than the threshold value when, to the patent intoRow scoring obtains the first value scoring of the patent.The patentee or applicant of the patent are obtained by searching platformProperty and the number being cited, then judge that the second value of the patent scores by citation times.When the first document meetsWhen the second value assessment score, the first document is sent to the first object patent database, the document is saved, and mentionsShow that user's document meets retrieval and requires.Meanwhile the second keyword is obtained in the retrieval history of patent retrieval platform according to user,By the high patent of the second value assessment score relevant to the second keyword to user's pushed information, pushed information includes the patentThe information such as patentee, abstract of description, patentee's transfer history.
Embodiment 2
Based on the same inventive concept of method is automatically obtained with high value patent a kind of in previous embodiment, the present invention is alsoThere is provided a kind of high value patent automatically obtains device, as shown in Fig. 2, described device includes:
First obtains unit, the first obtains unit obtain a target literature, and the target literature includes first crucialWord;
Second obtaining unit, second obtaining unit obtain first object patent data according to first keywordLibrary;
Third obtaining unit, the third obtaining unit obtain the first document from the first object patent database;
4th obtaining unit, the 4th obtaining unit obtain the right of first document according to first documentRequested number, claim number of words and specification number of words;
5th obtaining unit, the 5th obtaining unit is according to claim quantity, the claim of first documentNumber of words and specification number of words obtain the first weighted value, the second weighted value and the third weighted value of first document, and determine instituteState the first value assessment score of the first document;
First judging unit, it is predetermined that first judging unit judges whether the first value assessment score is greater than firstThreshold value;
First processing units, when the first value assessment score is greater than the first predetermined threshold, first processing is singleMember sends prompt information to first object patent database, wherein the prompt information is first document.
Further, described device further include:
6th obtaining unit, the 6th obtaining unit is according to the patented power people's information of the first document, wherein passes through instituteState the property that patentee's information judges patentee;
The second processing unit, when the property of the patentee meets the first predetermined condition, described the second processing unitPrompt information is sent to first object patent database, wherein the prompt information is first document.
Further, the third obtaining unit further include:
7th obtaining unit, the 7th obtaining unit obtain the license of first document according to the first searching platformInformation;
Second judgment unit, the second judgment unit judge whether first document has patent grant;
8th obtaining unit, if first document has a patent grant, the 8th obtaining unit obtains described theSecond value scoring of one document.
Further, the third obtaining unit further include:
9th obtaining unit, the 9th obtaining unit obtain the lawsuit of first document according to the first searching platformInformation;
Tenth obtaining unit, the tenth obtaining unit obtain the third of first document according to the actionable informationValue scoring.
Further, described device further include:
11st obtaining unit, patent licensing information of the 11st obtaining unit according to first document, lawsuitThe 4th weighted value, the 5th weighted value of first document described in information acquisition;
First determination unit, first determination unit according to the first to five weighted value, it is described first value scoring,The second value scoring and third value scoring, determine the 4th value assessment score of first document;
Third judging unit, it is predetermined that the third judging unit judges whether the 4th value assessment score is greater than firstThreshold value;
Third processing unit, when the 4th value assessment score is greater than the first predetermined threshold, the third processing is singleMember sends prompt information to first object patent database, wherein the prompt information is first document.
The various change mode for automatically obtaining method of one of 1 embodiment 1 of earlier figures high value patent and specific realityA kind of high value patent that example is equally applicable to the present embodiment automatically obtains device, by aforementioned to a kind of high value patentAutomatically obtain the detailed description of method, those skilled in the art are clear that a kind of high value patent in the present embodimentThe implementation method of device is automatically obtained, so this will not be detailed here in order to illustrate the succinct of book.
Embodiment 3
Based on the same inventive concept of method is automatically obtained with high value patent a kind of in previous embodiment, the present invention is alsoThere is provided another high value patent automatically obtains device, computer program is stored thereon with, when which is executed by processorRealize a kind of the step of automatically obtaining either method method of high value patent described previously.
Wherein, in Fig. 3, bus architecture (is represented) with bus 300, and bus 300 may include any number of interconnectionBus and bridge, bus 300 will include the one or more processors represented by processor 302 and what memory 304 represented depositsThe various circuits of reservoir link together.Bus 300 can also will peripheral equipment, voltage-stablizer and management circuit etc. itVarious other circuits of class link together, and these are all it is known in the art, therefore, no longer carry out further to it hereinDescription.Bus interface 306 provides interface between bus 300 and receiver 301 and transmitter 303.Receiver 301 and transmitter303 can be the same element, i.e. transceiver, provide the unit for communicating over a transmission medium with various other devices.
Processor 302 is responsible for management bus 300 and common processing, and memory 304 can be used for storage processor302 when executing operation used data.
Said one or multiple technical solutions in the embodiment of the present application at least have following one or more technology effectsFruit:
1. a kind of high value patent provided by the embodiments of the present application automatically obtains method and apparatus, which comprisesA target literature is obtained, the target literature includes the first keyword;First object patent is obtained according to first keywordDatabase;The first document is obtained from the first object patent database;According to first document, first text is obtainedClaim quantity, claim number of words and the specification number of words offered;According to claim quantity, the right of first documentIt is required that number of words and specification number of words, obtain the first weighted value, the second weighted value and the third weighted value of first document, and reallyFirst value assessment score of fixed first document;Judge whether the first value assessment score is greater than the first predetermined thresholdValue;When the first value assessment score is greater than the first predetermined threshold, prompt information is sent to first object patent database,Wherein the prompt information is first document.Through the invention, when solving high value patent retrieval in the prior art, easilyOccur range of search it is excessive, denoise heavy workload, retrieval inaccuracy, or in retrieving not to keyword expansion or on, there is the technical issues of missing inspection in position, and it is convenient to have reached operation, and recall precision is high, the accurate technical effect of search result.
2. the embodiment of the present application after obtaining the first document in the first object patent database, is wrapped by describedIt includes: according to the first searching platform, obtaining the License Info of first document;Judge whether first document is permitted with patentIt can;If first document has patent grant, the second value scoring of first document is obtained.It realizes by license timeSeveral pairs of high value patents are screened, and the technical effect that patent value is precisely judged from permitted number has further been reached.
3. the embodiment of the present application after obtaining the first document in the first object patent database, is also wrapped by describedIt includes: according to the first searching platform, obtaining the actionable information of first document;According to the actionable information, described first is obtainedThe third of document is worth scoring.The technical effect that patent value is precisely judged from patent stability is further reached.
It should be understood by those skilled in the art that, the embodiment of the present invention can provide as method, system or computer programProduct.Therefore, complete hardware embodiment, complete software embodiment or reality combining software and hardware aspects can be used in the present inventionApply the form of example.Moreover, it wherein includes the computer of computer usable program code that the present invention, which can be used in one or more,The computer program implemented in usable storage medium (including but not limited to magnetic disk storage, CD-ROM, optical memory etc.) producesThe form of product.
The present invention be referring to according to the method for the embodiment of the present invention, the process of equipment (system) and computer program productFigure and/or block diagram describe.It should be understood that every one stream in flowchart and/or the block diagram can be realized by computer program instructionsThe combination of process and/or box in journey and/or box and flowchart and/or the block diagram.It can provide these computer programsInstruct the processor of general purpose computer, special purpose computer, Embedded Processor or other programmable data processing devices to produceA raw machine, so that being generated by the instruction that computer or the processor of other programmable data processing devices execute for realThe device for the function of being specified in present one or more flows of the flowchart and/or one or more blocks of the block diagram.
These computer program instructions, which may also be stored in, is able to guide computer or other programmable data processing devices with spyDetermine in the computer-readable memory that mode works, so that it includes referring to that instruction stored in the computer readable memory, which generates,Enable the manufacture of device, the command device realize in one box of one or more flows of the flowchart and/or block diagram orThe function of being specified in multiple boxes.
These computer program instructions also can be loaded onto a computer or other programmable data processing device, so that countingSeries of operation steps are executed on calculation machine or other programmable devices to generate computer implemented processing, thus in computer orThe instruction executed on other programmable devices is provided for realizing in one or more flows of the flowchart and/or block diagram oneThe step of function of being specified in a box or multiple boxes.
Obviously, various changes and modifications can be made to the invention without departing from essence of the invention by those skilled in the artMind and range.In this way, if these modifications and changes of the present invention belongs to the range of the claims in the present invention and its equivalent technologiesWithin, then the present invention is also intended to include these modifications and variations.