Summary of the invention
The invention provides a kind of auto-check system and method based on symptom coupling and machine learning, in order to offer user's disease/disease diagnostic result accurately according to known symptom.
For achieving the above object, the invention provides a kind of auto-check system based on symptom coupling and machine learning, comprising:
Disease/disease database, for preserving known every kind of disease/disease and corresponding symptom thereof;
User interactive module, for receiving the symptom keyword set of user's input;
Symptom matching module, for mating with the symptom of described disease/disease database according to the described symptom keyword set of user's input, calculates the matching degree of described symptom keyword set and every kind of disease/disease;
Diagnostic module, for determining corresponding disease/disease according to described symptom keyword set with the matching degree of every kind of disease/disease.
Further, said system also comprises:
Vocabulary builds module, for building symptom degree of correlation vocabulary, is specially:
Obtain symptom data, wherein said symptom data comprise the symptom obtained from textbook, dictionary with the symptom set of, near synonym table, every disease/disease obtaining from described disease/disease database and ask every effectively symptom set of request of obtaining record from user;
For obtained symptom data, suppose to have two symptom x and y, the degree of association μ (x, y) of these two symptom x and y is
Wherein ρ (P) represents the judgement weight of data source P, artificially sets ρ (with near synonym table) > ρ (disease/syndrome storehouse) >=ρ (user asks record) according to expertise; R, p, q represents the each syndrome set in data source P;
wherein | p| represents the number of the symptom containing in symptom set p,
Two symptoms that the degree of association is greater than to degree of association threshold value are saved in the symptom degree of correlation vocabulary of establishment.
Further, described symptom matching module comprises:
Weight calculation unit, for calculate the weights W (d, x) of symptom x at disease/disease d according to following formula:
Wherein, ρ (S) represents the weight of data source S, and e represents each description unit information of diseases related/disease in data source S;
Matching degree computing unit, for calculating every disease/disease of described disease/disease database with respect to the matching degree of described symptom keyword set, is specially:
Suppose that the described symptom keyword set that user provides is combined into A, travel through each disease/disease d and corresponding symptom set σ (d) thereof in described disease/disease database;
Matching degree M (A, d) with following formula calculating disease/disease d with respect to described symptom keyword set A:
Wherein, | A| and | σ (d) | represent respectively the element number in set A and set σ (d);
Press M (A, d) descending order sorts to corresponding disease/disease, the result that sequence is obtained represents with R and presents to user, wherein R={d|M (A, d) > 0 and r (d) < N}, r (d) represents press the sequence number of disease/disease corresponding after the descending sequence of M (A, d), and N is the constant of artificial setting.
Further, said system also comprises:
Update module, for supplementing and upgrading described disease/disease database.
Further, the symptom set of described effective request refers in the matching result of this request and contains disease/disease that matching degree is greater than the constant C of setting.
For achieving the above object, the present invention also provides a kind of automatic diagnosis method based on symptom coupling and machine learning, comprises the following steps:
Receive the symptom keyword set of user's input;
Mate with the symptom in disease/disease database according to the symptom keyword set of user's input, calculate the matching degree of described symptom keyword set and every kind of disease/disease, wherein said disease/disease database is preserved known every kind of disease/disease and corresponding symptom thereof;
Determine corresponding disease/disease according to described symptom keyword set with the matching degree of every kind of disease/disease.
Further, further comprising the steps of before mating step according to the symptom keyword set of user's input with the symptom in disease/disease database:
Build symptom degree of correlation vocabulary, specifically comprise:
Obtain symptom data, wherein said symptom data comprise the symptom obtained from textbook, dictionary with the symptom set of, near synonym table, every disease/disease obtaining from described disease/disease database and ask every effectively symptom set of request of obtaining record from user;
For obtained symptom data, suppose to have two symptom x and y, the degree of association μ (x, y) of these two symptom x and y is
ρ (P) represents the judgement weight of data source P, artificially sets ρ (with near synonym table) > ρ (disease/syndrome storehouse) >=ρ (user asks record) according to expertise; R, p, q represents the each syndrome set in data source P;
wherein | p| represents the number of the symptom containing in symptom set p,
Two symptoms that the degree of association is greater than to degree of association threshold value are saved in the symptom degree of correlation vocabulary of establishment.
Further, the described symptom keyword set according to user's input is mated with the symptom in disease/disease database, and the matching degree step of calculating described symptom keyword set and every kind of disease/disease comprises:
Calculating the weights W (d, x) of symptom x in disease/disease d is
Wherein, ρ (S) represents the weight of data source S,
e represents each description unit information of diseases related/disease in data source S;
Suppose that the described symptom keyword set that user provides is combined into A, travel through each disease/disease d and corresponding symptom set σ (d) thereof in described disease/disease database;
Matching degree M (A, d) with following formula calculating disease/disease d with respect to described symptom keyword set A
Wherein, | A| and | σ (d) | represent respectively the element number in set A and set σ (d);
Press M (A, d) descending order sorts to corresponding disease/disease, the result that sequence is obtained represents with R and presents to user, wherein R={d|M (A, d) > 0 and r (d) < N}, r (d) represents press the sequence number of disease/disease corresponding after the descending sequence of M (A, d), and N is the constant of artificial setting.
Further, said method is further comprising the steps of:
Described disease/disease database is supplemented and upgraded.
Further, the symptom set of described effective request refers in the matching result of this request and contains disease/disease that matching degree is greater than the constant C of setting.
One group of symptom symptom corresponding with the disease of having included in system and syndrome that the present invention provides user is mated, automatically infer and cause possible disease and the syndrome of this group symptom by calculating matching degree, thereby provide relatively accurate diagnostic result to user.
Embodiment
Below in conjunction with the accompanying drawing in the embodiment of the present invention, the technical scheme in the embodiment of the present invention is clearly and completely described, obviously, described embodiment is only the present invention's part embodiment, instead of whole embodiment.Based on the embodiment in the present invention, those of ordinary skill in the art, not paying the every other embodiment obtaining under creative work prerequisite, belong to the scope of protection of the invention.
Fig. 1 is the auto-check system module map based on symptom coupling and machine learning of one embodiment of the invention; Fig. 2 is the auto-check system fundamental diagram based on symptom coupling and machine learning of a preferred embodiment of the invention.As shown in the figure, this auto-check system comprises:
Disease/disease database, for preserving known every kind of disease/disease and corresponding symptom thereof;
Wherein, in the time building disease/disease database, selected data source can be national standard (as " tcm clinical practice diagnosis and treatment term disease part "), modern Chinese medicine textbook, modern Chinese medicine dictionary etc., illness, case discussion etc. in traditional Chinese medical science ancient books, and Modern Medical Record data.
User interactive module, for receiving the symptom keyword set of user's input;
Symptom matching module, for mating with the symptom of described disease/disease database according to the described symptom keyword set of user's input, calculates the matching degree of described symptom keyword set and every kind of disease/disease;
Diagnostic module, for determining corresponding disease/disease according to described symptom keyword set with the matching degree of every kind of disease/disease.
Further, said system also comprises:
Vocabulary builds module, for building symptom degree of correlation vocabulary, is specially:
Obtain symptom data, wherein said symptom data comprise the symptom obtained from textbook, dictionary with the symptom set of, near synonym table, every disease/disease obtaining from described disease/disease database and ask every effectively symptom set of request of obtaining record from user; The textbook here, dictionary can be for example traditional Chinese medical science voluminous dictionary, tcm symptom antidiastole, Chinese near synonym dictionary etc., and the symptom set of effective request here refers in the matching result of this request and contains disease/disease that matching degree is greater than the constant C of setting;
For obtained symptom data, suppose to have two symptom x and y, the degree of association μ (x, y) of these two symptom x and y is
Wherein ρ (P) represents the judgement weight of data source P, artificially sets ρ (with near synonym table) > ρ (disease/syndrome storehouse) >=ρ (user asks record) according to expertise; R, p, q represents the each syndrome set in data source P;
wherein | p| represents the number of the symptom containing in symptom set p,
Two symptoms that the degree of association is greater than to degree of association threshold value are saved in the symptom degree of correlation vocabulary of establishment.
Further, described symptom matching module comprises:
Weight calculation unit, for calculate the weights W (d, x) of symptom x at disease/disease d according to following formula:
Wherein, ρ (S) represents the weight of data source S, artificially set according to expertise, meet ρ (national standard) > ρ (textbook, dictionary) >=ρ (traditional Chinese medical science ancient books) > ρ (modern case, e represents each description unit information of diseases related/disease in data source S;
Matching degree computing unit, for calculating every disease/disease of described disease/disease database with respect to the matching degree of described symptom keyword set, is specially:
Suppose that the described symptom keyword set that user provides is combined into A, travel through each disease/disease d and corresponding symptom set σ (d) thereof in described disease/disease database;
Matching degree M (A, d) with following formula calculating disease/disease d with respect to described symptom keyword set A:
Wherein, | A| and | σ (d) | represent respectively the element number in set A and set σ (d);
Press M (A, d) descending order sorts to corresponding disease/disease, the result that sequence is obtained represents with R and presents to user, wherein R={d|M (A, d) > 0 and r (d) < N}, r (d) represents press the sequence number of disease/disease corresponding after the descending sequence of M (A, d), and N is the constant of artificial setting.
Further, said system also comprises:
Update module, for supplementing and upgrading described disease/disease database.
Wherein, can be regularly the opportunity of renewal, as carried out once weekly; Also can be the immediate updating being triggered by accident, as new national standard is promulgated, World Health Organization announces new pandemic information etc.
Example:
User's input: sympotomatic set A: diarrhoea, stomach-ache, anus heat, slippery and rapid pulse
Disease/the disease of including
D1: the damp and hot wound of having loose bowels
σ (d1): the stomachache of having loose bowels, rush down that lower urgent, stool being yellowish-brown, burning sensation of the anus, dysphoria with smothery sensation are thirsty, oliguria with yellow urine, slippery and rapid pulse
D2: cholera humidifier fever
σ (d2): if the profit of telling, fever and chills bodily pain, depressed, the n and V of fever and chills abdominal pain, ambition, active borhorygmus are not had a pain, are gushed soft stool around navel and continue and rush down clear water or swill juice, peripheral coldness, lower leg contraction, slippery and rapid pulse
Suppose according to data with existing source, by μ (x, y) formula calculates μ (diarrhoea, the stomachache of having loose bowels)=0.14, μ (diarrhoea is rushed down lower urgent)=0.43, μ (stomach-ache, the stomachache of having loose bowels)=0.31, μ (anus heat, burning sensation of the anus)=0.61, μ (slippery and rapid pulse, slippery and rapid pulse)=1.00, μ (stomach-ache, active borhorygmus is had a pain around navel)=0.14, μ (diarrhoea, if gush soft stool continue rush down clear water or swill juice)=0.07, other μ (x, y) is 0;
And according to data with existing source, calculate W (d by the formula of W (d, y)1, the stomachache of having loose bowels)=0.71, W (d1, rush down lower urgent)=0.65, W (d1, burning sensation of the anus)=0.57, W (d1, slippery and rapid pulse)=0.41, W (d2, active borhorygmus is had a pain around navel)=0.31, W (d2if, gush soft stool continue rush down clear water or swill juice)=0.57,, other W (d, y) is 0;
Can calculate d by the formula of M (A, d)1and d2as follows with the matching degree of the symptom set of user input respectively:
M (A, d1μ)=[(diarrhoea, the stomachache of having loose bowels) × W (d1, the stomachache of having loose bowels) and+μ (diarrhoea is rushed down lower urgent) × W (d1, rush down lower urgent) and+μ (stomach-ache, the stomachache of having loose bowels) × W (d1, the stomachache of having loose bowels) and+μ (anus heat, burning sensation of the anus) × W (d1, burning sensation of the anus) and+μ (slippery and rapid pulse, slippery and rapid pulse) × W (d1, slippery and rapid pulse)]/(| A|| σ (d1) |)=0.0485
M (A, d2μ)=[(stomach-ache, active borhorygmus is had a pain around navel) × W (d2, active borhorygmus is had a pain around navel)+μ (diarrhoea, if gush soft stool continue rush down clear water or swill juice) × W (d2if, gush soft stool continue rush down clear water or swill juice)]/(| A|| σ (d2) |)=0.0021
Due to M (A, d1) > M (A, d2), therefore in the diagnostic result R that returns to user, M (A, d1) be placed on before.Embodiment adapts with said system, is below the automatic diagnosis method embodiment based on symptom coupling and machine learning of one embodiment of the invention, comprises the following steps:
Receive the symptom keyword set of user's input;
Mate with the symptom in disease/disease database according to the symptom keyword set of user's input, calculate the matching degree of described symptom keyword set and every kind of disease/disease, wherein said disease/disease database is preserved known every kind of disease/disease and corresponding symptom thereof;
Determine corresponding disease/disease according to described symptom keyword set with the matching degree of every kind of disease/disease.
Further, further comprising the steps of before mating step according to the symptom keyword set of user's input with the symptom in disease/disease database:
Build symptom degree of correlation vocabulary, specifically comprise:
Obtain symptom data, wherein said symptom data comprise the symptom obtained from textbook, dictionary with the symptom set of, near synonym table, every disease/disease obtaining from described disease/disease database and ask every effectively symptom set of request of obtaining record from user;
For obtained symptom data, suppose to have two symptom x and y, the degree of association μ (x, y) of these two symptom x and y is
ρ (P) represents the judgement weight of data source P, wherein
Two symptoms that the degree of association is greater than to degree of association threshold value are saved in the symptom degree of correlation vocabulary of establishment.
Further, the described symptom keyword set according to user's input is mated with the symptom in disease/disease database, and the matching degree step of calculating described symptom keyword set and every kind of disease/disease comprises:
Calculating the weights W (d, x) of symptom x in disease/disease d is
Wherein, ρ (S) represents the weight of data source S, and e represents each description unit information of diseases related/disease in data source S;
Suppose that the described symptom keyword set that user provides is combined into A, travel through each disease/disease d and corresponding symptom set σ (d) thereof in described disease/disease database;
Matching degree M (A, d) with following formula calculating disease/disease d with respect to described symptom keyword set A
Press M (A, d) descending order sorts to corresponding disease/disease, the result that sequence is obtained represents with R and presents to user, wherein R={d|M (A, d) > 0 and r (d) < N}, r (d) represents press the sequence number of disease/disease corresponding after the descending sequence of M (A, d), and N is the constant of artificial setting.
Further, said method is further comprising the steps of:
Described disease/disease database is supplemented and upgraded.
Further, the symptom set of described effective request refers in the matching result of this request and contains disease/disease that matching degree is greater than the constant C of setting.
One group of symptom symptom corresponding with the disease of having included in system and syndrome that the present invention provides user is mated, automatically infer and cause possible disease and the syndrome of this group symptom by calculating matching degree, thereby provide relatively accurate diagnostic result to user.
One of ordinary skill in the art will appreciate that: accompanying drawing is the schematic diagram of an embodiment, the module in accompanying drawing or flow process might not be that enforcement the present invention is necessary.
One of ordinary skill in the art will appreciate that: the module in the device in embodiment can be described and be distributed in the device of embodiment according to embodiment, also can carry out respective change and be arranged in the one or more devices that are different from the present embodiment.The module of above-described embodiment can be merged into a module, also can further split into multiple submodules.
Finally it should be noted that: above embodiment only, in order to technical scheme of the present invention to be described, is not intended to limit; Although the present invention is had been described in detail with reference to previous embodiment, those of ordinary skill in the art is to be understood that: its technical scheme that still can record previous embodiment is modified, or part technical characterictic is wherein equal to replacement; And these amendments or replacement do not make the essence of appropriate technical solution depart from the spirit and scope of embodiment of the present invention technical scheme.