A kind of conventional medicament patent information searching system and search methodTechnical field
The present invention relates to a kind of conventional medicament patent information searching system and search method, particularly a kind of system and method for retrieving the similarity prescription in the conventional medicament patent database.That is, in a conventional medicament patent database through prescription index processing, the patent No. of one piece of given patent is retrieved, can retrieve to this piece patent in the similar patent of prescription information.
Background technology
The described conventional medicament of this patent is meant self-sow, naturally occurring, unartificial synthetic medicament categories, and each similar drug that conventional medicament forms after artificial all belongs to the conventional medicament field.
The conventional medicament that with the Chinese medicine is representative has the applicating history in more than 5000 year, is the product of putting into practice science, having brought into play important role aspect the treatment human diseases, compares with synthetic drug, has advantages such as safety, low toxicity.The contained unartificial synthetic ingredient of natural drug more helps absorption by human body and utilization; especially be suitable for treating the comprehensive pathology of multisystem, many organs, many tissues; to spirit nerve, endocrine, immune system, viral disease and functional, agnogenic illness etc.; as tumour, acquired immune deficiency syndrome (AIDS) etc.; also can bring into play better curative effect; often can replenish the defective and the deficiency of modern biomedical; sometimes even play irreplaceable effect; and meet the requirement of new century " protection ecological, back to nature ".At present, develop a kind of new chemical synthetic drug, particularly develop a kind of new antibiotic, often to spend more than one hundred million dollars fund, and the inevitable bigger toxic and side effect of synthetic drug existence, therefore, people begin the diversion of new drug development to the conventional medicament aspect.Because various countries more and more pay attention to the research of conventional medicament, the amount of the application for patent of conventional medicament aspect all had very big growth every year in recent years, about 15% as the annual increment of the U.S., and the year increment of the conventional medicament big country China in east, Japan, Korea S is more than 30%.
Enlivening of conventional medicament technical research, the increase day by day of amount of the application for patent makes the retrieval to conventional medicament information require also corresponding raising, the prescription prescription information that occurs in the patent is carried out similarity retrieval become important retrieval requirement.Because the conventional medicament title is extremely complicated; identical conventional medicament prescription may adopt multiple different medicine name, keyword and indication to carry out patent protection; therefore there is a large amount of of the same name different sides in the conventional medicament patent or with square different name situation; this type of patent information particularly recall ratio and the precision ratio of prescription information retrieval is all very poor, causes a large amount of the repeat research and development and the wastings of resources.
The retrieval technique of at present relevant patent information is mainly at the generic patent information database, and is rarer particularly open at the retrieval technique of conventional medicament database at specialized database.The patent of Chinese patent CN03101527 " method of Patent document data retrieval " by name, this patent is at the generic patent database, set up a key word classification summary table related with technology category, search key and key word classification summary table are compared, to determine its technology category, from the correlation technique classification, retrieve the target patent.
The patent of Chinese patent CN 200410069014 " method of intellectual property search management, system and Storage Medias " by name, this patent provides a kind of method of intellectual property search management, also, be used for an intellecture property database that comprises patent data and carry out search management at the generic patent database.
The patent of Chinese patent CN 200610024619 " a kind of methods of utilizing the IPC sorted patent searching " by name, this patent adopts the mode of IPC classification number and key problem in technology word association dictionary, retrieval generic patent database.
Chinese patent CN 200610040349 by name " to patent genes or gene patent retrieve, note and data mining method " patent, this patent is to patent genes object such as patent sequence, patent microarray, patent single nucleotide polymorphism (SNP), patent motif and gene patent is retrieved, note and data mining method.
The patent of Chinese patent CN 200510032977 " Grammatical transformation methods of patent information retrieval " by name, this patent is at the generic patent database, the Grammatical transformation method of patent information retrieval is provided, is used for the difference retrieval grammer that the retrieval grammer is automatically converted on the patent information website, various countries is retrieved.
Because the specialized database particularly foundation of conventional medicament patent information database and the foundation of retrieval technique has suitable difficulty, the therefore at present relevant conventional medicament patent information database particularly retrieval technique of prescription information of traditional medicine patents database still belongs to blank.
In order to overcome the difficulty and the shortcoming of conventional medicament patent information and prescription information retrieval thereof, realize the prescription similarity retrieval of conventional medicament patent, a kind of searching system and search method of mating by retrieval precision requirements that the conventional medicament prescription is carried out need be provided.
Summary of the invention
For reaching above-mentioned purpose, the invention provides a kind of conventional medicament patent information searching system and search method, it can carry out patent prescription similarity retrieval by the patent No. in the conventional medicament patent database through index processing.
The present invention at first provides a kind of conventional medicament patent information searching system.This system comprises a plurality of client computers, an application server and a database storing equipment.
Application server links to each other with database storing equipment by connecting, and carries out the exchange of search instruction and data message.Client computer links to each other with application server by network, and interactive user interface is provided, and by the access to netwoks application server, sends retrieval and requires and check result for retrieval.The content of database storing device storage comprises conventional medicament patent information database and conventional medicament prescription information database, the supply server calls.
Application server is used for the patent of conventional medicament patent information database is retrieved, and it also comprises Query Information receiver module, patent information extraction module, prescription information extraction modules, conventional medicament information extraction modules, prescription information inquiry module and patent information enquiry module.The Query Information receiver module is used to receive user's Query Information and matching precision information is set, and offers database retrieval system.The patent information extraction module is used for system and searches corresponding patent information by user's search request at the conventional medicament patent information database.The prescription information extraction modules is used for according to the patent information that detects then from the corresponding prescription information of prescription information database extraction.The conventional medicament information extraction modules is used for the instruction according to system, extracts conventional medicament information from the conventional medicament registered database, forms the conventional medicament tabulation.Prescription information inquiry module is used for searching whole other prescriptions that contain this drug information according to the drug information of the prescription information of extracting in the prescription information database, and chooses satisfactory prescription subclass by matching precision.The patent information enquiry module is used for searching the patent that comprises this prescription subset information according to the prescription subclass of choosing at the conventional medicament patent database, for showing output.
The present invention also provides a kind of conventional medicament patent information search method, this method may further comprise the steps: (1) carries out index processing to gyp world conventional medicament patent bibliographic database in advance, form the conventional medicament database cluster, comprise conventional medicament patent information database, prescription information database and conventional medicament registered database; (2) each database in the related conventional medicament database cluster; (3) user is needed the patent No. of inquiry by the client computer input; (4) in the conventional medicament patent information database, search corresponding patent information according to the patent No. of input; (5) the prescription information of the described patent information of extraction in the prescription information database; (6) according to described prescription information, in the conventional medicament registered database, extract conventional medicament information, form the conventional medicament tabulation; (7) described conventional medicament name list is provided with prescription match retrieval accuracy value; (8) in the prescription information database, search with described tabulation in whole other prescriptions of medicine prescription coupling; (9) matching precision of determining by the user in the prescription information database is chosen the prescription subclass under the medicine with described medicine list match; (10) in the conventional medicament patent database, find out the patent subclass that comprises this prescription subset information; (11) show the described patent information subclass that meets matching precision at client computer.
In order to realize the retrieval of prescription information, need carry out the preprocessing index to patent information, formation can be used for the conventional medicament database cluster of prescription similarity retrieval, and this conventional medicament database cluster comprises conventional medicament patent information database, prescription information database and conventional medicament registered database.Wherein the conventional medicament patent information database is the deep processing specialized database that the conventional medicament patent bibliographic database index of the gyp world is processed to form, and its database contains the content of following field: go into to hide number (LDN), piece of writing name (TI), digest (AB), number of patent application (AP), patent announcement number (PN), patented claim day (AD), patent announcement day (PD), applicant's title (PA), address of the applicant (ADDR), applicant country/provinces and cities' code (PAC), inventor's title (INR), priority number (PRN), international monopoly Main classification number (IC1), subsidiary classification number (IC2), quote patent (CT) as proof, cited literature 2 (CP), international monopoly is specified country (DS), patented subject matter (IT), subject code (ITC), therapeutic action (EFF), patent accompanying drawing (IMG).Wherein the prescription information database is the deep processing specialized database that the conventional medicament patent bibliographic database index of the gyp world is processed to form, and its database contains the content of following field: number of patent application (AP), medicine name, medication amount, drug ratio.Wherein the conventional medicament registered database is the database of all conventional medicaments that occur in the patent documentation being registered formation, the registration field contents comprises fields such as the English name of conventional medicament, English another name, Chinese, Chinese another name, latin name, Latin plant name, Chinese phonetic alphabet name, property of medicine Gui Jing, and according to the only code system of conventional medicament it is classified.In above-mentioned three databases, the conventional medicament patent information database is related with the prescription information database by number of patent application (AP), the prescription information database is related with the conventional medicament registered database by medicine name, by these related cross search that realizes between database of the present invention.
Utilize database cluster of the present invention, system and method, can the patent retrieval similar to the prescription information that provides in this piece patent be come out by importation patent number.
Description of drawings
Fig. 1 is the hardware structure figure of the searching system and the search method of conventional medicament patent information of the present invention.
Fig. 2 is the searching system of conventional medicament patent information of the present invention and the application server functionality module map of search method.
Fig. 3 is the operation process chart of the searching system and the search method of conventional medicament patent information of the present invention.
Embodiment
As shown in Figure 1, be the hardware structure figure of the searching system of conventional medicament patent information of the present invention.This system comprisesapplication server 10,database storing equipment 12,connection 14, a plurality ofclient computer 13, network 15.Application server 10 links to each other withdatabase storing equipment 12 by connecting 14, and a plurality ofclient computers 13 pass throughnetwork 15 and link to each other with application server 10.Connecting 14 is that a kind of database connects, and can be ODBC or other database connected modes.Network 15 is a kind of virtual network or physical network, can be LAN (Local Area Network) or the Internet.
Client computer 13 is used to provide interactive interface, is convenient to the user and carries out corresponding operation, input search instruction and demonstration result forretrieval.Application server 10 is nucleus equipments of the searching system search operaqtion of this conventional medicament patent information, it receives the patent No. ofclient computer 13 inputs, in the conventional medicamentpatent information database 121 ofdatabase storing equipment 12, find corresponding with it patent, and turn toprescription information database 122 in theapplication server 10 to extract the prescription information of these patents, according to the medicament categories of extracting in the prescription, turn to conventional medicament registereddatabase 123 to extract the related drugs title, the tabulation of formation conventional medicament patent information, the user is provided with corresponding Accuracy Matching parameter at this conventional medicament tabulation, and this Accuracy Matching parameter is to contain the part of some conventional medicament in the prescription, all, at most, minimum flavor number is as parameter is set.Application server 10 is searched the prescription information of the Accuracy Matching parameter that meets setting inprescription information database 122, turn to conventional medicamentpatent information database 121 to inquire about the patent that contains this prescription information according to this prescription information again, and patent information is offered client computer 13.Database storing equipment 12 is used for the required database information of memory scan, its content comprises conventional medicamentpatent information database 121,prescription information database 122 and conventional medicament registereddatabase 123, and its storage medium can be hard disk, floppy disk, tape, light storage device and other data storage device.
As shown in Figure 2, be the functional block diagram of theapplication server 10 of conventional medicament patent information searching system of the present invention and search method.
Thisapplication server 10 comprisescentral processing unit 20, QueryInformation receiver module 21, patentinformation extraction module 22, prescriptioninformation extraction modules 23, conventional medicamentinformation extraction modules 24, prescriptioninformation inquiry module 25, patent information enquiry module 26.Wherein,central processing unit 20 is used for the manipulation control of each module, coordinates the action of each module.QueryInformation receiver module 21 is used to receive information inquiry instruction and the precision setting instruction thatclient computer 13 sends.Patentinformation extraction module 22 is used for extracting relevant patent information according to query statement from conventional medicament patent information database 121.The patent No. that prescriptioninformation extraction modules 23 is extracted according to patentinformation extraction module 22 is extracted the prescription information of relevant patent from prescription information database 122.Conventional medicamentinformation extraction modules 24 is changeed from conventional medicament registereddatabase 123 extraction related drugs titles according to the prescription information of extracting, the tabulation of formation conventional medicament patent information, output toclient computer 13, prescription retrieval precision parameter is set for the client operation personnel.After the client operation personnel are provided with prescription retrieval precision parameter, prescriptioninformation inquiry module 25 is inquired about whole other prescriptions that mate with above-mentioned tabulation Chinese traditional medicine inprescription information database 122 according to parameter, and compare with the parameter value that is provided with, choose satisfactory prescription subclass.Patentinformation enquiry module 26 is searched the patent information that contains these prescriptions according to the prescription subclass of choosing from the conventional medicament patent information database, export toclient computer 13.
As shown in Figure 3, be the operation process chart of conventional medicament patent information searching system of the present invention and search method.Its process is as follows: (1) user is needed the patent No. (step S31) of retrieval byclient computer 13 inputs; (2) corresponding conventional medicament patent information (step S32) is searched and extracted to the patentinformation extraction module 22 ofapplication server 10; (3) prescriptioninformation extraction modules 23 is changeed extraction contained prescription information (step S33) fromprescription information database 122 according to the patent No. of above-mentioned patent information; (4) conventional medicamentinformation extraction modules 24 is extracted the conventional medicament information (step S34) in this prescription from conventional medicament registereddatabase 123; The conventional medicament information that (5) will occur in prescription information forms tabulation (step S35); (6)client computer 13 is provided with retrieval prescription matching precision according to the medicine of tabulation, and method is the medicament categories in the selective listing, be provided with all, partly, various parameters (step S36) such as maximum, minimum; (7) prescriptioninformation inquiry module 25 is found out the whole prescriptions (step S37) that contain the medicine of tabulating according to the medicament categories that parameter is provided with inprescription information database 122; (8) prescription that retrieves and the prescription that matching condition is set are compared, meet when parameter matching condition threshold value is set, judge that then the prescription information of retrieval is effective prescription, form prescription subclass (step S38, step S39 and step S40); (9) patentinformation enquiry module 26 is inquired about the patent information that conforms to above-mentioned prescription subclass in conventional medicamentpatent information database 121, delivers to client computer 13 (step S41).
By above-mentioned steps, realized using the number of patent application retrieval relevant to have the process of similar prescription information patent with conventional medicament.
Embodiment described above gives an example or illustrative description, be not be intended to be exhaustive or the restriction the present invention, to those skilled in the art, it is conspicuous carrying out many modifications, variation or replacement within the spirit and scope of the present invention.The embodiment that selects and describe only is in order to explain principle of the present invention better.