MYCIN was an earlybackward chainingexpert system that usedartificial intelligence to identify bacteria causing severe infections, such asbacteremia andmeningitis, and to recommendantibiotics, with the dosage adjusted for patient's body weight — the name derived from the antibiotics themselves, as many antibiotics have the suffix "-mycin". The Mycin system was also used for the diagnosis of blood clotting diseases.MYCIN was developed over five or six years in the early 1970s atStanford University. It was written inLisp as the doctoral dissertation ofEdward Shortliffe under the direction of Bruce G. Buchanan,Stanley N. Cohen and others.
MYCIN emerged from theStanford Heuristic Programming Project.[1] MYCIN demonstrated the potential for expert systems in building high-performance medical reasoning programs. MYCIN is often viewed as a pioneer in the field of expert systems, even being referred to as the "grandaddy of them all-the one that launched the field" by Dr.Allen Newell.[2]
MYCIN led to the EMYCINexpert system shell ("essential MYCIN") for acquiring knowledge, reasoning with it, and explaining the results, without the specific medical knowledge. It can be described as "EMYCIN =Prolog + uncertainty + caching + questions + explanations + contexts - variables". An introduction is in Chapter 16 ofParadigms of Artificial Intelligence Programming (PAIP).[3]
MYCIN operated using a fairly simpleinference engine and a knowledge base of ~600 rules by obtaining individual inferential facts identified by experts and encoding such facts as individual production rules. No other AI program at the time contained as much domain-specific knowledge clearly separated from its inference procedures as MYCIN. It would query the physician running the program via a long series of simple yes/no or textual questions. At the end, it provided a list of possible culprit bacteria ranked from high to low based on the probability of each diagnosis, itsconfidence in each diagnosis' probability, the reasoning behind each diagnosis (that is, MYCIN would also list the questions and rules which led it to rank a diagnosis a particular way), and its recommended course of drug treatment. MYCIN could additionally respond to queries by physicians related to why it asked the user a certain question, how it arrived at a conclusion, and why it did not consider certain factors.[4]
The developers performed studies showing that MYCIN's performance was minimally affected by perturbations in the uncertainty metrics associated with individual rules, suggesting that the power in the system was related more to its knowledge representation and reasoning scheme than to the details of its numerical uncertainty model. Some observers felt that it should have been possible to use classicalBayesian statistics. MYCIN's developers argued that this would require either unrealistic assumptions ofprobabilistic independence, or require the experts to provide estimates for an unfeasibly large number ofconditional probabilities.[5][6]
Subsequent studies later showed that the certainty factor model could indeed be interpreted in a probabilistic sense, and highlighted problems with the implied assumptions of such a model. However the modular structure of the system would prove very successful, leading to the development ofgraphical models such asBayesian networks.[7]
A context in MYCIN determines what types of objects can be reasoned about. They are similar to variables in Prolog, orenvironment variables in operating systems.[3]
(defunmycin()"Determine what organism is infecting a patient."(emycin(list(defcontextpatient(namesexage)())(defcontextculture(sitedays-old)())(defcontextorganism()(identity)))))
In MYCIN it was possible that two or more rules might draw conclusions about a parameter with different weights of evidence. For example, one rule may conclude that the organism in question isE. Coli with a certainty of 0.8 whilst another concludes that it isE. Coli with a certainty of 0.5 or even -0.8. In the event the certainty is less than zero the evidence is actually against the hypothesis. In order to calculate the certainty factor MYCIN combined these weights using the formula below to yield a single certainty factor:
Where X and Y are the certainty factors.[8] This formula can be applied more than once if more than two rules draw conclusions about the same parameter. It iscommutative, so it does not matter in which order the weights were combined.
The combination formula was designed to have the following desirable properties:[3]
The following examples come from Chapter 16 of PAIP, which contains an implementation inCommon Lisp of a modified and simplified version of MYCIN for pedagogical purposes.
A rule, and an English paraphrase generated by the system:
(defrule52if(sitecultureisblood)(gramorganismisneg)(morphologyorganismisrod)(burnpatientisserious)then.4(identityorganismispseudomonas))Rule52:If1)THESITEOFTHECULTUREISBLOOD2)THEGRAMOFTHEORGANISMISNEG3)THEMORPHOLOGYOFTHEORGANISMISROD4)THEBURNOFTHEPATIENTISSERIOUSThenthereisweaklysuggestiveevidence(0.4)that1)THEIDENTITYOFTHEORGANISMISPSEUDOMONAS
An evaluation of MYCIN was conducted at theStanford Medical School. The first phase of the evaluation consisted of 10 test cases of diverse origin, chosen by a physician who was not acquainted with MYCIN's methods or knowledge base. These cases were presented to 7 physicians and 1 senior medical student. 10 prescriptions were compiled for each of the cases, 1 recommended by MYCIN, 1 prescribed by the treating physician at the county hospital, and 8 by the aforementioned individuals. The second phase of the evaluation consisted of eight infectious disease specialists being provided the clinical summary and set of 10 prescriptions for each of the 10 cases and tasked to provide their own recommendations for each case and assess the 10 prescriptions. MYCIN received an acceptability rating of 65%, which was comparable to the 42.5% to 62.5% rating of five faculty members.[9] This study is often cited as showing the potential for disagreement about therapeutic decisions, even among experts, when there is no "gold standard" for correct treatment.[citation needed]
This sectiondoes notcite anysources. Please helpimprove this section byadding citations to reliable sources. Unsourced material may be challenged andremoved.(May 2018) (Learn how and when to remove this message) |
MYCIN was never actually used in practice. This wasn't because of any weakness in its performance. Some observers raised ethical and legal issues related to the use of computers in medicine, regarding the responsibility of the physicians in case the system gave wrong diagnosis.[10] However, the greatest problem, and the reason that MYCIN was not used in routine practice, was the state of technologies for system integration, especially at the time it was developed. MYCIN was a stand-alone system that required a user to enter all relevant information about a patient by typing in responses to questions MYCIN posed. MYCIN ran on the DEC KI10 PDP-10, supporting a large time-shared system available over the early Internet (ARPANet), before personal computers were developed.[11][12]
MYCIN's greatest influence was accordingly its demonstration of the power of its representation and reasoning approach. Rule-based systems in many non-medical domains were developed in the years that followed MYCIN's introduction of the approach. In the 1980s, expert system "shells" were introduced (including one based on MYCIN, known as E-MYCIN (followed byKnowledge Engineering Environment - KEE))[13]and supported the development of expert systems in a wide variety of application areas.A difficulty that rose to prominence during the development of MYCIN and subsequent complex expert systems has been theextraction of the necessary knowledge for the inference engine to use from the human expert in the relevant fields into the rule base (the so-called "knowledge acquisition bottleneck").[14]