| Operators | IBM |
|---|---|
| Location | Thomas J. Watson Research Center,New York, USA |
| Architecture | 2,880POWER7 processor threads |
| Memory | 16terabytes ofRAM |
| Speed | 80teraFLOPS |
| Website | IBM Watson |
IBM Watson is a computer system capable ofanswering questions posed innatural language.[1] It was developed as a part ofIBM's DeepQA project by a research team, led byprincipal investigatorDavid Ferrucci.[2] Watson was named after IBM's founder and first CEO, industrialistThomas J. Watson.[3][4]
The computer system was initially developed to answer questions on the popular quiz showJeopardy![5] and in 2011, the Watson computer system competed onJeopardy! against championsBrad Rutter andKen Jennings,[3][6] winning the first-place prize of US$1 million.[7]
In February 2013, IBM announced that Watson's first commercial application would be forutilization management decisions in lung cancer treatment, atMemorial Sloan Kettering Cancer Center, New York City, in conjunction with WellPoint (nowElevance Health).[8]

Watson was created as aquestion answering (QA) computing system that IBM built to apply advancednatural language processing,information retrieval,knowledge representation,automated reasoning, andmachine learning technologies to the field ofopen domain question answering. The system is named DeepQA (though it did not involve the use ofdeep neural networks).[1]
IBM stated that Watson uses "more than 100 different techniques to analyze natural language, identify sources, find and generate hypotheses, find and score evidence, and merge and rank hypotheses."[10]
In recent years,[when?] Watson's capabilities have been extended and the way in which Watson works has been changed to take advantage of new deployment models (Watson onIBM Cloud), evolved machine learning capabilities, and optimized hardware available to developers and researchers.[citation needed]
Watson uses IBM's DeepQA software and the ApacheUIMA (Unstructured Information Management Architecture) framework implementation. The system was written in various languages, includingJava,C++, andProlog, and runs on theSUSE Linux Enterprise Server 11 operating system using the ApacheHadoop framework to provide distributed computing.[11][12][13]
Other than the DeepQA system, Watson contained several strategy modules. For example, one module calculated the amount to bet forFinal Jeopardy, according to the confidence score on getting the answer right, and the current scores of all contestants. One module used theBayes rule to calculate the probability that each unrevealed question might be theDaily Double, using historical data from the J! Archive as theprior. If a Daily Double is found, the amount to wager is computed by a 2-layered neural network of the same kind as those used byTD-Gammon, a neural network that played backgammon, developed byGerald Tesauro in the 1990s.[14] The parameters in the strategy modules were tuned by benchmarking against a statistical model of human contestants fitted on data from the J! Archive, and selecting the best one.[15][16][17]
The system is workload-optimized, integratingmassively parallelPOWER7 processors and built on IBM'sDeepQA technology,[18] which it uses to generate hypotheses, gather massive evidence, and analyze data.[1] Watson employs a cluster of ninety IBM Power 750 servers, each of which uses a 3.5 GHzPOWER7 eight-core processor, with four threads per core. In total, the system uses 2,880 POWER7 processor threads and 16terabytes of RAM.[18]
According toJohn Rennie, Watson can process 500 gigabytes (the equivalent of a million books) per second.[19] IBM master inventor and senior consultant Tony Pearson estimated Watson's hardware cost at about three million dollars.[20] ItsLinpack performance stands at 80TeraFLOPs, which is about half as fast as the cut-off line for theTop 500 Supercomputers list.[21] According to Rennie, all content was stored in Watson's RAM for the Jeopardy game because data stored onhard drives would be too slow to compete with human Jeopardy champions.[19]
The sources of information for Watson include encyclopedias,dictionaries,thesauri,newswire articles andliterary works. Watson also used databases,taxonomies andontologies includingDBpedia,WordNet andYAGO.[22] The IBM team provided Watson with millions of documents, including dictionaries, encyclopedias and other reference material, that it could use to build its knowledge.[23]
Watson parses questions into different keywords and sentence fragments in order to find statistically related phrases.[23] Watson's main innovation was not in the creation of a newalgorithm for this operation, but rather its ability to quickly execute hundreds of provenlanguage analysis algorithms simultaneously.[23][24] The more algorithms that find the same answer independently, the more likely Watson is to be correct. Once Watson has a small number of potential solutions, it is able to check against its database to ascertain whether the solution makes sense or not.[23]

Watson's basic working principle is to parse keywords in a clue while searching for related terms as responses. This gives Watson some advantages and disadvantages compared with humanJeopardy! players.[25] Watson has deficiencies inunderstanding the context of the clues. Watson can read, analyze, and learn from natural language, which gives it the ability to make human-like decisions.[26] As a result, human players usually generate responses faster than Watson, especially to short clues.[23] Watson's programming prevents it from using the popular tactic of buzzing before it is sure of its response.[23] However, Watson has consistently betterreaction time on the buzzer once it has generated a response, and is immune to human players' psychological tactics, such as jumping between categories on every clue.[23][27]
In a sequence of 20 mock games ofJeopardy!, human participants were able to use the six to seven seconds that Watson needed to hear the clue and decide whether to signal for responding.[23] During that time, Watson also has to evaluate the response and determine whether it is sufficiently confident in the result to signal.[23] Part of the system used to win theJeopardy! contest was the electronic circuitry that receives the "ready" signal and then examines whether Watson's confidence level was great enough to activate the buzzer. Given the speed of this circuitry compared to the speed of human reaction times, Watson's reaction time was faster than the human contestants except when the human anticipated (instead of reacted to) the ready signal.[28] After signaling, Watson speaks with an electronic voice and gives the responses inJeopardy!'s question format.[23] Watson's voice was synthesized from recordings that actor Jeff Woodman made for an IBMtext-to-speech program in 2004.[29]
TheJeopardy! staff used different means to notify Watson and the human players when to buzz,[28] which was critical in many rounds.[27] The humans were notified by a light, which took them tenths of a second toperceive.[30][31] Watson was notified by an electronic signal and could activate the buzzer within about eight milliseconds.[32] The humans tried to compensate for the perception delay by anticipating the light,[33] but the variation in the anticipation time was generally too great to fall within Watson's response time.[27] Watson did not attempt to anticipate the notification signal.[31][33]
SinceDeep Blue's victory overGarry Kasparov in chess in 1997, IBM had been on the hunt for a new challenge. In 2004, IBM Research manager Charles Lickel, over dinner with coworkers, noticed that the restaurant they were in had fallen silent. He soon discovered the cause of this evening's hiatus:Ken Jennings, who was then in the middle of his successful 74-game run onJeopardy!. Nearly the entire restaurant had piled toward the televisions, mid-meal, to watchJeopardy!. Intrigued by the quiz show as a possible challenge for IBM, Lickel passed the idea on, and in 2005, IBM Research executivePaul Horn supported Lickel, pushing for someone in his department to take up the challenge of playingJeopardy! with an IBM system. Though he initially had trouble finding any research staff willing to take on what looked to be a much more complex challenge than the wordless game of chess, eventually David Ferrucci took him up on the offer.[34] In competitions managed by the United States government, Watson's predecessor, a system named Piquant, was usually able to respond correctly to only about 35% of clues and often required several minutes to respond.[35][36][37] To compete successfully onJeopardy!, Watson would need to respond in no more than a few seconds, and at that time, the problems posed by the game show were deemed to be impossible to solve.[23]
In initial tests run during 2006 by David Ferrucci, the senior manager of IBM's Semantic Analysis and Integration department, Watson was given 500 clues from pastJeopardy! programs. While the best real-life competitors buzzed in half the time and responded correctly to as many as 95% of clues, Watson's first pass could get only about 15% correct. During 2007, the IBM team was given three to five years and a staff of 15 people to solve the problems.[23]John E. Kelly III succeeded Paul Horn as head ofIBM Research in 2007.[38]InformationWeek described Kelly as "the father of Watson" and credited him for encouraging the system to compete against humans onJeopardy!.[39] By 2008, the developers had advanced Watson such that it could compete withJeopardy! champions.[23] By February 2010, Watson could beat humanJeopardy! contestants on a regular basis.[40]
During the game, Watson had access to 200 million pages of structured and unstructured content consuming fourterabytes ofdisk storage[11] including the full text of the 2011 edition of Wikipedia,[41] but was not connected to the Internet.[42][23] For each clue, Watson's three most probable responses were displayed on the television screen. Watson consistently outperformed its human opponents on the game's signaling device, but had trouble in a few categories, notably those having short clues containing only a few words.[citation needed]
Although the system is primarily an IBM effort, Watson's development involved faculty and graduate students fromRensselaer Polytechnic Institute,Carnegie Mellon University,University of Massachusetts Amherst, theUniversity of Southern California'sInformation Sciences Institute, theUniversity of Texas at Austin, theMassachusetts Institute of Technology, and theUniversity of Trento,[9] as well as students fromNew York Medical College.[43] Among the team of IBM programmers who worked on Watson was 2001Who Wants to Be a Millionaire? top prize winner Ed Toutant, who himself had appeared onJeopardy! in 1989 (winning one game).[44]

In 2008, IBM representatives communicated withJeopardy! executive producerHarry Friedman about the possibility of having Watson compete againstKen Jennings andBrad Rutter, two of the most successful contestants on the show, and the program's producers agreed.[23][45] Watson's differences with human players had generated conflicts between IBM andJeopardy! staff during the planning of the competition.[25] IBM repeatedly expressed concerns that the show's writers would exploit Watson's cognitive deficiencies when writing the clues, thereby turning the game into aTuring test. To alleviate that claim, a third party randomly picked the clues from previously written shows that were never broadcast.[25]Jeopardy! staff also showed concerns over Watson's reaction time on the buzzer. Originally Watson signaled electronically, but show staff requested that it press a button physically, as the human contestants would.[46] Even with a robotic "finger" pressing the buzzer, Watson remained faster than its human competitors. Ken Jennings noted, "If you're trying to win on the show, the buzzer is all", and that Watson "can knock out a microsecond-precise buzz every single time with little or no variation. Human reflexes can't compete with computer circuits in this regard."[27][33][47]Stephen Baker, a journalist who recorded Watson's development in his bookFinal Jeopardy, reported that the conflict between IBM andJeopardy! became so serious in May 2010 that the competition was almost cancelled.[25] As part of the preparation, IBM constructed a mock set in a conference room at one of its technology sites to model the one used onJeopardy!. Human players, including formerJeopardy! contestants, also participated in mock games against Watson with Todd Alan Crain ofThe Onion playing host.[23] About 100 test matches were conducted with Watson winning 65% of the games.[48]
To provide a physical presence in the televised games, Watson was represented by an "avatar" of a globe, inspired by the IBM "smarter planet" symbol. Jennings described the computer's avatar as a "glowing blue ball crisscrossed by 'threads' of thought—42 threads, to be precise",[49] and stated that the number of thought threads in the avatar was anin-joke referencing thesignificance of thenumber 42 inDouglas Adams'Hitchhiker's Guide to the Galaxy.[49]Joshua Davis, the artist who designed the avatar for the project, explained to Stephen Baker that there are 36 triggerable states that Watson was able to use throughout the game to show its confidence in responding to a clue correctly; he had hoped to be able to find forty-two, to add another level to theHitchhiker's Guide reference, but he was unable to pinpoint enough game states.[50]
A practice match was recorded on January 13, 2011, and the official matches were recorded on January 14, 2011. All participants maintained secrecy about the outcome until the match was broadcast in February.[51]
In a practice match before the press on January 13, 2011, Watson won a 15-question round against Ken Jennings and Brad Rutter with a score of $4,400 to Jennings's $3,400 and Rutter's $1,200, though Jennings and Watson were tied before the final $1,000 question. None of the three players responded incorrectly to a clue.[52]
The first round was broadcast February 14, 2011, and the second round, on February 15, 2011. The right to choose the first category had been determined by a draw won by Rutter.[53] Watson, represented by a computer monitor display and artificial voice, responded correctly to the second clue and then selected the fourth clue of the first category, a deliberate strategy to find the Daily Double as quickly as possible.[54] Watson's guess at the Daily Double location was correct. At the end of the first round, Watson was tied with Rutter at $5,000; Jennings had $2,000.[53]
Watson's performance was characterized by some quirks. In one instance, Watson repeated a reworded version of an incorrect response offered by Jennings. (Jennings said "What are the '20s?" in reference to the 1920s. Then Watson said "What is 1920s?") Because Watson could not recognize other contestants' responses, it did not know that Jennings had already given the same response. In another instance, Watson was initially given credit for a response of "What is a leg?" after Jennings incorrectly responded "What is: he only had one hand?" to a clue aboutGeorge Eyser (the correct response was, "What is: he's missing a leg?"). Because Watson, unlike a human, could not have been responding to Jennings's mistake, it was decided that this response was incorrect. The broadcast version of the episode was edited to omit Trebek's original acceptance of Watson's response.[55] Watson also demonstrated complex wagering strategies on the Daily Doubles, with one bet at $6,435 and another at $1,246.[56]Gerald Tesauro, one of the IBM researchers who worked on Watson, explained that Watson's wagers were based on its confidence level for the category and a complexregression model called the Game State Evaluator.[17]
Watson took a commanding lead in Double Jeopardy!, correctly responding to both Daily Doubles. Watson responded to the second Daily Double correctly with a 32% confidence score.[56]
However, during the Final Jeopardy! round, Watson was the only contestant to miss the clue in the category U.S. Cities ("Itslargest airport was named for aWorld War II hero; itssecond largest, for aWorld War II battle"). Rutter and Jennings gave the correct response of Chicago, but Watson's response was "What isToronto?????" with five question marks appended indicating a lack of confidence.[56][57][58] Ferrucci offered reasons why Watson would appear to have guessed a Canadian city: categories only weakly suggest the type of response desired, the phrase "U.S. city" did not appear in the question, there arecities named Toronto in the U.S., and Toronto in Ontario has anAmerican League baseball team.[59]Chris Welty, who also worked on Watson, suggested that it may not have been able to correctly parse the second part of the clue, "its second largest, for a World War II battle" (which was not a standalone clause despite it following asemicolon, and required context to understand that it was referring to a second-largestairport).[60]Eric Nyberg, a professor at Carnegie Mellon University and a member of the development team, stated that the error occurred because Watson does not possess the comparative knowledge to discard that potential response as not viable.[58] Although not displayed to the audience as with non-Final Jeopardy! questions, Watson's second choice was Chicago. Both Toronto and Chicago were well below Watson's confidence threshold, at 14% and 11% respectively. Watson wagered only $947 on the question.[61]
The game ended with Jennings with $4,800, Rutter with $10,400, and Watson with $35,734.[56]
During the introduction, Trebek (a Canadian native) joked that he had learned Toronto was a U.S. city, and Watson's error in the first match prompted an IBM engineer to wear aToronto Blue Jays jacket to the recording of the second match.[62]
In the first round, Jennings was finally able to choose a Daily Double clue,[63] while Watson responded to one Daily Double clue incorrectly for the first time in the Double Jeopardy! Round.[64] After the first round, Watson placed second for the first time in the competition after Rutter and Jennings were briefly successful in increasing their dollar values before Watson could respond.[64][65] Nonetheless, the final result ended with a victory for Watson with a score of $77,147, besting Jennings who scored $24,000 and Rutter who scored $21,600.[66]
The prizes for the competition were $1 million for first place (Watson), $300,000 for second place (Jennings), and $200,000 for third place (Rutter). As promised, IBM donated 100% of Watson's winnings to charity, with 50% of those winnings going toWorld Vision and 50% going toWorld Community Grid.[67] Similarly, Jennings and Rutter donated 50% of their winnings to their respective charities.[68]
In acknowledgement of IBM and Watson's achievements, Jennings made an additional remark in his Final Jeopardy! response: "I for one welcome our new computer overlords", paraphrasinga joke fromThe Simpsons.[69][70] Jennings later wrote an article forSlate, in which he stated:
IBM has bragged to the media that Watson's question-answering skills are good for more than annoying Alex Trebek. The company sees a future in which fields likemedical diagnosis,business analytics, andtech support are automated by question-answering software like Watson. Just as factory jobs were eliminated in the 20th century by new assembly-line robots, Brad and I were the firstknowledge-industry workers put out of work by the new generation of 'thinking' machines. 'Quiz show contestant' may be the first job made redundant by Watson, but I'm sure it won't be the last.[49]
PhilosopherJohn Searle argues that Watson—despite impressive capabilities—cannot actually think.[71] Drawing on hisChinese roomthought experiment, Searle claims that Watson, like other computational machines, is capable only of manipulating symbols, but has no ability to understand the meaning of those symbols; however, Searle's experiment has itsdetractors.[72]
On February 28, 2011, Watson played an untelevised exhibition match ofJeopardy! against members of theUnited States House of Representatives. In the first round,Rush D. Holt, Jr. (D-NJ, a formerJeopardy! contestant), who was challenging the computer withBill Cassidy (R-LA, later Senator from Louisiana), led with Watson in second place. However, combining the scores between all matches, the final score was $40,300 for Watson and $30,000 for the congressional players combined.[73]
IBM's Christopher Padilla said of the match, "The technology behind Watson represents a major advancement in computing. In the data-intensive environment of government, this type of technology can help organizations make better decisions and improve how government helps its citizens."[73]
After the national press attention gained by the 2011Jeopardy! appearance, IBM sought out partnerships from education to weather and cancer to retail chatbots in order convince business about Watson's alleged capabilities. This ultimately led to the failure of Watson to find a profit-making product for the company.[74]
In 2011, the IBMgeneral counsel wrote inThe National Law Review arguing that the law profession will become more efficient and better with Watson.[75] After the national attentionJeopardy! afforded them, began an ultimately unsuccessful and expensive project that began when theMemorial Sloan Kettering Cancer Center tried to use Watson to help doctors diagnose and treat cancer patients. Ultimately, the division cost $4 billion to develop but was sold for a quarter of that—$1 billion, in 2022.[76] By 2023, Watson resulted in IBM losing 10% of its stock value, costing four times more than what it brought to the company and resulting in mass layoffs.[74]
From 2012 through the late 2010s, Watson's technology was used to create applications—mostly discontinued[77] to help people make decisions in a variety of areas, among them:
In 2021, technology reporter atThe New York Times forSteve Rohr, explained:
The company's missteps with Watson began with its early emphasis on big and difficult initiatives intended to generate both acclaim and sizable revenue for the company, according to many of the more than a dozen current and former IBM managers and scientists interviewed for this article. Several of those people asked not to be named because they had not been authorized to speak or still had business ties to IBM.
— Steve Rohr, "What Ever Happened to IBM's Watson?",The New York Times[77]
Writing inThe Atlantic in 2023, Mac Schwerin argued that IBM's leadership fundamentally did not understand the technology, leading to the hardship and strain caused by the project, saying:
But the suits in charge went after the bigger and more technically challenging game of feeding the machine entirely different types of material. They viewed Watson as a generational meal ticket.
— Mac Schwerin, "America Forgot About IBM Watson. Is ChatGPT Next?",The Atlantic[90]
In the end, IBM's initial vision for Watson as a transformative technology capable of revolutionizing industries did not materialize as anticipated.[91] Watson's capabilities were primarily suited to specific tasks, like natural language processing for trivia games, rather than generalized commercial problem-solving.[92] Watson's mismatch between capabilities and IBM's marketing contributed significantly to Watson's commercial struggles and eventual decline. The overstated claims about Watson's abilities also caused public sentiment to turn against the idea of Watson and artificial intelligence.[77]
Between 2019 and 2023, IBM shifted focus to a separate initiative WatsonX, distinctly different from Watson, aiming for narrower, industry-targeted technology within IBM's cloud computing and platform-based strategiesIBM Watsonx.[77][74]
IBM's Watson was used to analyze medical datasets to provide physicians with guidance on diagnoses and cancer treatment decisions.[93][94] When a physician submitted a query to Watson, the system started a multi-step process by parsing the input to identify key information, examining patient data to uncover relevant medical and hereditary history, and finally compare various data sources to form and test hypotheses.[95][94]
IBM claimed that Watson's could draw from a wide range of sources, including treatment guidelines,electronic medical records, and research materials.[94] Although, company executives would later blame the lack of data on the projects ultimate failure.[76]
Notably, Watson has not been involved in the actual diagnosis process, but rather assists doctors in identifying suitable treatment options for patients who have already been diagnosed.[96]In fact, a study of 1,000 challenging patient cases found that Watson's recommendations matched those of human doctors in an impressive 99% of cases.[97]
IBM established partnerships with theCleveland Clinic,[98] theMD Anderson Cancer Center, andMemorial Sloan-Kettering Cancer Center to further its mission in healthcare. In 2011, IBM entered into a research partnership withNuance Communications and physicians at theUniversity of Maryland andHarvard to develop a commercial product using Watson'sclinical decision support capabilities. IBM partnered withWellPoint (nowAnthem) in 2011 to utilize Watson in suggesting treatment options to physicians,[99] and in 2013, Watson was deployed in its first commercial application for utilization management decisions in lung cancer treatment at Memorial Sloan-Kettering Cancer Center.[8] The Cleveland Clinic collaboration aimed to enhance Watson's health expertise and support medical professionals in treating patients more effectively. However, the MD Anderson Cancer Center pilot program, initiated in 2013, ultimately failed to meet its goals and was discontinued after $65 million in investment.[100][101][98]
In 2016, IBM launched "IBM Watson for Oncology", a product designed to provide personalized, evidence-based cancer care options to physicians and patients.[91] This initiative marked a significant milestone in the adoption of Watson's technology in the healthcare industry. Additionally, IBM partnered withManipal Hospitals in India to offer Watson's expertise to patients online.[102][103]
The company ultimately faced challenges in the healthcare market, with no profit and increased competition.[91] In 2022, IBM announced the sell-off of its Watson Health unit to Francisco Partners, marking a significant shift in the company's approach to the healthcare industry.[91][76]
On January 9, 2014, IBM announced it was creating a business unit around Watson.[104] IBM Watson Group will have headquarters in New York City'sSilicon Alley and will employ 2,000 people. IBM has invested $1 billion to get the division going. Watson Group will develop three newcloud-delivered services: Watson Discovery Advisor, Watson Engagement Advisor, and Watson Explorer. Watson Discovery Advisor will focus onresearch and development projects inpharmaceutical industry, publishing, andbiotechnology, Watson Engagement Advisor will focus on self-service applications using insights on the basis ofnatural language questions posed by business users, and Watson Explorer will focus on helping enterprise users uncover and share data-driven insights based on federated search more easily.[104] The company is also launching a $100 million venture fund to spur application development for "cognitive" applications. According to IBM, the cloud-delivered enterprise-ready Watson has seen its speed increase 24 times over—a 2,300 percent improvement in performance and its physical size shrank by 90 percent—from the size of a master bedroom to three stacked pizza boxes.[104] IBM CEOVirginia Rometty said she wants Watson to generate $10 billion in annual revenue within ten years.[105] In 2017, IBM andMIT established a new joint research venture in artificial intelligence. IBM invested $240 million to create the MIT–IBM Watson AI Lab in partnership with MIT, which brings together researchers in academia and industry to advance AI research, with projects ranging from computer vision and NLP to devising new ways to ensure that AI systems are fair, reliable and secure.[106] In March 2018, IBM's CEOGinni Rometty proposed "Watson's Law", the "use of and application of business, smart cities, consumer applications and life in general."[107]
This article'suse ofexternal links may not follow Wikipedia's policies or guidelines. Pleaseimprove this article by removingexcessive orinappropriate external links, and converting useful links where appropriate intofootnote references.(May 2022) (Learn how and when to remove this message) |