This article is part of the series on |
Chess programming |
---|
![]() |
Computer chess includes both hardware (dedicated computers) andsoftware capable of playingchess. Computer chess provides opportunities for players to practice even in the absence of human opponents, and also provides opportunities for analysis, entertainment and training. Computer chess applications that play at the level of achess grandmaster or higher are available on hardware fromsupercomputers tosmart phones. Standalone chess-playing machines are also available.Stockfish,Leela Chess Zero,GNU Chess,Fruit, and other free open source applications are available for various platforms.
Computer chess applications, whether implemented in hardware or software, use different strategies than humans to choose their moves: they useheuristic methods to build, search and evaluatetrees representing sequences of moves from the current position and attempt to execute the best such sequence during play. Such trees are typically quite large, thousands to millions of nodes. The computational speed of modern computers, capable of processing tens of thousands to hundreds of thousands of nodes or more per second, along with extension and reduction heuristics that narrow the tree to mostly relevant nodes, make such an approach effective.
The first chess machines capable of playing chess or reduced chess-like games were software programs running on digital computers early in thevacuum-tube computer age (1950s). The early programs played so poorly that even a beginner could defeat them. Within 40 years, in 1997,chess engines running on super-computers or specialized hardware were capable ofdefeating even the best human players. By 2006,programs running on desktop PCs had attained the same capability. In 2006,Monty Newborn, Professor of Computer Science atMcGill University, declared: "the science has been done". Nevertheless,solving chess is not currently possible for modern computers due to thegame's extremely large number of possible variations.[1]
Computer chess was once considered the "Drosophila of AI", the edge ofknowledge engineering. The field is now considered a scientifically completed paradigm, and playing chess is a mundane computing activity.[2]
In the past, stand-alone chess machines (usually microprocessors running software chess programs; occasionally specialized hardware) were sold. Today,chess engines may be installed as software on ordinary devices likesmartphones andPCs,[3] either alone or alongsideGUI programs such asChessbase and themobile apps forChess.com andLichess (both primarily websites).[4] Examples offree and open source engines includeStockfish[5] andLeela Chess Zero[6] (Lc0). Chess.com maintains its ownproprietary engine named Torch.[7] Some chess engines, including Stockfish, have web versions made in languages likeWebAssembly andJavaScript.[8] Most chess programs and sites offer the ability to analyze positions and games using chess engines, and some offer the ability to play against engines (which can be set to play at custom levels of strength) as though they were normal opponents.
Hardware requirements for chess engines are minimal, but performance will vary with processor speed, and memory, needed to hold largetransposition tables. Most modern chess engines, such as Stockfish, rely onefficiently updatable neural networks, tailored to be run exclusively onCPUs,[9][10] but Lc0 uses networks reliant onGPU performance.[11][12] Top engines such as Stockfish can be expected to beat the world's best players reliably, even when running on consumer-grade hardware.[13]
Perhaps the most common type of chess software are programs that simply play chess. A human player makes a move on the board, the AI calculates and plays a subsequent move, and the human and AI alternate turns until the game ends. Thechess engine, which calculates the moves, and thegraphical user interface (GUI) are sometimes separate programs. Different engines can be connected to the GUI, permitting play against different styles of opponent. Engines often have a simple textcommand-line interface, while GUIs may offer a variety of piece sets, board styles, or even 3D or animated pieces. Because recent engines are so capable, engines or GUIs may offer some way of handicapping the engine's ability, to improve the odds for a win by the human player.Universal Chess Interface (UCI) engines such asFritz orRybka may have a built-in mechanism for reducing theElo rating of the engine (via UCI's uci_limitstrength and uci_elo parameters). Some versions ofFritz have a Handicap and Fun mode for limiting the current engine or changing the percentage of mistakes it makes or changing its style.Fritz also has a Friend Mode where during the game it tries to match the level of the player.
Chess databases allow users to search through a large library of historical games, analyze them, check statistics, and formulate an opening repertoire.Chessbase (for PC) is a common program for these purposes amongst professional players, but there are alternatives such asShane's Chess Information Database (Scid)[14] for Windows, Mac or Linux,Chess Assistant[15] for PC,[16] Gerhard Kalab's Chess PGN Master for Android[17] or Giordano Vicoli's Chess-Studio for iOS.[18]
Programs such asPlaychess allow players to play against one another over the internet.
Chess training programs teach chess.Chessmaster had playthrough tutorials by IMJosh Waitzkin and GMLarry Christiansen.Stefan Meyer-Kahlen offersShredder Chess Tutor based on the Step coursebooks of Rob Brunia and Cor Van Wijgerden. FormerWorld ChampionMagnus Carlsen'sPlay Magnus company released aMagnus Trainer app for Android and iOS.Chessbase hasFritz and Chesster for children. Convekta provides a large number of training apps such as CT-ART and its Chess King line based on tutorials by GM Alexander Kalinin and Maxim Blokh.
There is alsosoftware for handling chess problems.
After discovering refutation screening—the application ofalpha–beta pruning to optimizing move evaluation—in 1957, a team atCarnegie Mellon University predicted that a computer would defeat the world human champion by 1967.[19] It did not anticipate the difficulty of determining the right order to evaluate moves. Researchers worked to improve programs' ability to identifykiller heuristics, unusually high-scoring moves to reexamine when evaluating other branches, but into the 1970s most top chess players believed that computers would not soon be able to play at aMaster level.[20] In 1968,International MasterDavid Levy made a famous bet that no chess computer would be able to beat him within ten years,[21] and in 1976Senior Master and professor of psychologyEliot Hearst ofIndiana University wrote that "the only way a current computer program could ever win a single game against a master player would be for the master, perhaps in a drunken stupor while playing 50 games simultaneously, to commit some once-in-a-year blunder".[20]
In the late 1970s chess programs began defeating highly skilled human players.[20] The year of Hearst's statement,Northwestern University'sChess 4.5 at thePaul Masson American Chess Championship'sClass B level became the first to win a human tournament. Levy won his bet in 1978 by beatingChess 4.7, but it achieved the first computer victory against a Master-class player at the tournament level by winning one of the six games.[21] In 1980,Belle began often defeating Masters. By 1982 two programs played at Master level and three were slightly weaker.[20]
The sudden improvement without a theoretical breakthrough was unexpected, as many did not expect that Belle's ability to examine 100,000 positions a second—about eight plies—would be sufficient. The Spracklens, creators of the successful microcomputer programSargon, estimated that 90% of the improvement came from faster evaluation speed and only 10% from improved evaluations.New Scientist stated in 1982 that computers "playterrible chess ... clumsy, inefficient, diffuse, and just plain ugly", but humans lost to them by making "horrible blunders, astonishing lapses, incomprehensible oversights, gross miscalculations, and the like" much more often than they realized; "in short, computers win primarily through their ability to find and exploit miscalculations in human initiatives".[20]
By 1982, microcomputer chess programs could evaluate up to 1,500 moves a second and were as strong as mainframe chess programs of five years earlier, able to defeat a majority of amateur players. While only able to look ahead one or two plies more than at their debut in the mid-1970s, doing so improved their play more than experts expected; seemingly minor improvements "appear to have allowed the crossing of a psychological threshold, after which a rich harvest of human error becomes accessible",New Scientist wrote.[20] While reviewingSPOC in 1984,BYTE wrote that "Computers—mainframes, minis, and micros—tend to play ugly, inelegant chess", but notedRobert Byrne's statement that "tactically they are freer from error than the average human player". The magazine describedSPOC as a "state-of-the-art chess program" for the IBM PC with a "surprisingly high" level of play, and estimated its USCF rating as 1700 (Class B).[22]
At the 1982North American Computer Chess Championship,Monroe Newborn predicted that a chess program could become world champion within five years; tournament director and International MasterMichael Valvo predicted ten years; the Spracklens predicted 15;Ken Thompson predicted more than 20; and others predicted that it would never happen. The most widely held opinion, however, stated that it would occur around the year 2000.[23] In 1989, Levy was defeated byDeep Thought in an exhibition match. Deep Thought, however, was still considerably below World Championship level, as the reigning world champion,Garry Kasparov, demonstrated in two strong wins in 1989. It was not until a 1996 match withIBM'sDeep Blue that Kasparov lost his first game to a computer at tournament time controls inDeep Blue versus Kasparov, 1996, game 1. This game was, in fact, the first time a reigning world champion had lost to a computer using regular time controls. However, Kasparov regrouped to win three anddraw two of the remaining five games of the match, for a convincing victory.
In May 1997, an updated version of Deep Blue defeated Kasparov 3½–2½ in a return match. A documentary mainly about the confrontation was made in 2003, titledGame Over: Kasparov and the Machine.
a | b | c | d | e | f | g | h | ||
8 | ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() ![]() | 8 | |||||||
7 | 7 | ||||||||
6 | 6 | ||||||||
5 | 5 | ||||||||
4 | 4 | ||||||||
3 | 3 | ||||||||
2 | 2 | ||||||||
1 | 1 | ||||||||
a | b | c | d | e | f | g | h |
With increasing processing power and improved evaluation functions, chess programs running on commercially available workstations began to rival top-flight players. In 1998,Rebel 10 defeatedViswanathan Anand, who at the time was ranked second in the world, by a score of 5–3. However, most of those games were not played at normal time controls. Out of the eight games, four wereblitz games (five minutes plus five secondsFischer delay for each move); these Rebel won 3–1. Two were semi-blitz games (fifteen minutes for each side) that Rebel won as well (1½–½). Finally, two games were played as regular tournament games (forty moves in two hours, one hour sudden death); here it was Anand who won ½–1½.[24] In fast games, computers played better than humans, but at classical time controls – at which a player's rating is determined – the advantage was not so clear.
In the early 2000s, commercially available programs such asJunior andFritz were able to draw matches against former world champion Garry Kasparov and classical world championVladimir Kramnik.
In October 2002, Vladimir Kramnik and Deep Fritz competed in the eight-gameBrains in Bahrain match, which ended in a draw. Kramnik won games 2 and 3 by "conventional"anti-computer tactics – play conservatively for a long-term advantage the computer is not able to see in itsgame tree search. Fritz, however, won game 5 after a severe blunder by Kramnik. Game 6 was described by the tournament commentators as "spectacular". Kramnik, in a better position in the earlymiddlegame, tried a piece sacrifice to achieve a strong tactical attack, a strategy known to be highly risky against computers who are at their strongest defending against such attacks. True to form, Fritz found a watertight defense and Kramnik's attack petered out leaving him in a bad position. Kramnik resigned the game, believing the position lost. However, post-game human and computer analysis has shown that the Fritz program was unlikely to have been able to force a win and Kramnik effectively sacrificed a drawn position. The final two games were draws. Given the circumstances, most commentators still rate Kramnik the stronger player in the match.[citation needed]
In January 2003, Kasparov playedJunior, another chess computer program, in New York City. The match ended 3–3.
In November 2003, Kasparov playedX3D Fritz. The match ended 2–2.
In 2005,Hydra, a dedicated chess computer with custom hardware and sixty-four processors and also winner of the 14thIPCCC in 2005, defeated seventh-rankedMichael Adams 5½–½ in a six-game match (though Adams' preparation was far less thorough than Kramnik's for the 2002 series).[25]
In November–December 2006, World Champion Vladimir Kramnik played Deep Fritz. This time the computer won; the match ended 2–4. Kramnik was able to view the computer's opening book. In the first five games Kramnik steered the game into a typical "anti-computer" positional contest. He lost one game (overlooking a mate in one), and drew the next four. In the final game, in an attempt to draw the match, Kramnik played the more aggressiveSicilian Defence and was crushed.
There was speculation that interest in human–computer chess competition would plummet as a result of the 2006 Kramnik-Deep Fritz match.[26] According to Newborn, for example, "the science is done".[27]
Human–computer chess matches showed the best computer systems overtaking human chess champions in the late 1990s. For the 40 years prior to that, the trend had been that the best machines gained about 40 points per year in theElo rating while the best humans only gained roughly 2 points per year.[28] The highest rating obtained by a computer in human competition was Deep Thought's USCF rating of 2551 in 1988 and FIDE no longer accepts human–computer results in their rating lists. Specialized machine-only Elo pools have been created for rating machines, but such numbers, while similar in appearance, are not directly compared.[29] In 2016, theSwedish Chess Computer Association rated computer programKomodo at 3361.
Chess engines continue to improve. In 2009, chess engines running on slower hardware reached thegrandmaster level. Amobile phone won acategory 6 tournament with a performance rating 2898: chess engineHiarcs 13 running insidePocket Fritz 4 on the mobile phoneHTC Touch HD won the Copa Mercosur tournament inBuenos Aires, Argentina with 9 wins and 1 draw on August 4–14, 2009.[30] Pocket Fritz 4 searches fewer than 20,000 positions per second.[31] This is in contrast to supercomputers such as Deep Blue that searched 200 million positions per second.
Advanced Chess is a form of chess developed in 1998 by Kasparov where a human plays against another human, and both have access to computers to enhance their strength. The resulting "advanced" player was argued by Kasparov to be stronger than a human or computer alone. This has been proven in numerous occasions, such as at Freestyle Chess events.
Players today are inclined to treat chess engines as analysis tools rather than opponents.[32] Chess grandmasterAndrew Soltis stated in 2016 "The computers are just much too good" and that world championMagnus Carlsen won't play computer chess because "he just loses all the time and there's nothing more depressing than losing without even being in the game."[33]
Since the era of mechanical machines that played rook and king endings and electrical machines that played other games likehex in the early years of the 20th century, scientists and theoreticians have sought to develop a procedural representation of how humans learn, remember, think and apply knowledge, and the game of chess, because of its daunting complexity, became the "Drosophila of artificial intelligence (AI)".[Note 1] The procedural resolution of complexity became synonymous with thinking, and early computers, even before the chess automaton era, were popularly referred to as "electronic brains". Several different schema were devised starting in the latter half of the 20th century to represent knowledge and thinking, as applied to playing the game of chess (and other games like checkers):
Using "ends-and-means" heuristics a human chess player can intuitively determine optimal outcomes and how to achieve them regardless of the number of moves necessary, but a computer must be systematic in its analysis. Most players agree thatlooking at least five moves ahead (tenplies) when necessary is required to play well. Normal tournament rules give each player an average of three minutes per move. On average there are more than 30 legal moves per chess position, so a computer must examine a quadrillion possibilities to look ahead ten plies (five full moves); one that could examine a million positions a second would require more than 30 years.[20]
The earliest attempts at procedural representations of playing chess predated the digital electronic age, but it was the stored program digital computer that gave scope to calculating such complexity. Claude Shannon, in 1949, laid out the principles of algorithmic solution of chess. In that paper, the game is represented by a "tree", or digital data structure of choices (branches) corresponding to moves. The nodes of the tree were positions on the board resulting from the choices of move. The impossibility of representing an entire game of chess by constructing a tree from first move to last was immediately apparent: there are an average of 36 moves per position in chess and an average game lasts about 35 moves to resignation (60-80 moves if played to checkmate, stalemate, or other draw). There are 400 positions possible after the first move by each player, about 200,000 after two moves each, and nearly 120 million after just 3 moves each.
So a limited lookahead (search) to some depth, followed by using domain-specific knowledge to evaluate the resulting terminal positions was proposed. A kind of middle-ground position, given good moves by both sides, would result, and its evaluation would inform the player about the goodness or badness of the moves chosen. Searching and comparing operations on the tree were well suited to computer calculation; the representation of subtle chess knowledge in the evaluation function was not. The early chess programs suffered in both areas: searching the vast tree required computational resources far beyond those available, and what chess knowledge was useful and how it was to be encoded would take decades to discover.
The developers of a chess-playing computer system must decide on a number of fundamental implementation issues. These include:
Adriaan de Groot interviewed a number of chess players of varying strengths, and concluded that bothmasters and beginners look at around forty to fifty positions before deciding which move to play. What makes the former much better players is that they usepattern recognition skills built from experience. This enables them to examine some lines in much greater depth than others by simply not considering moves they can assume to be poor. More evidence for this being the case is the way that good human players find it much easier to recall positions from genuine chess games, breaking them down into a small number of recognizable sub-positions, rather than completely random arrangements of the same pieces. In contrast, poor players have the same level of recall for both.
The equivalent of this in computer chess areevaluation functions for leaf evaluation, which correspond to the human players' pattern recognition skills, and the use of machine learning techniques in training them, such as Texel tuning,stochastic gradient descent, andreinforcement learning, which corresponds to building experience in human players. This allows modern programs to examine some lines in much greater depth than others by using forwards pruning and other selective heuristics to simply not consider moves the program assume to be poor through their evaluation function, in the same way that human players do. The only fundamental difference between a computer program and a human in this sense is that a computer program can search much deeper than a human player could, allowing it to search more nodes and bypass thehorizon effect to a much greater extent than is possible with human players.
Computer chess programs usually support a number of commonde facto standards. Nearly all of today's programs can read and write game moves asPortable Game Notation (PGN), and can read and write individual positions asForsyth–Edwards Notation (FEN). Older chess programs often only understoodlong algebraic notation, but today users expect chess programs to understand standardalgebraic chess notation.
Starting in the late 1990s, programmers began to develop separatelyengines (with acommand-line interface which calculates which moves are strongest in a position) or agraphical user interface (GUI) which provides the player with a chessboard they can see, and pieces that can be moved. Engines communicate their moves to the GUI using a protocol such as the Chess Engine Communication Protocol (CECP) orUniversal Chess Interface (UCI). By dividing chess programs into these two pieces, developers can write only the user interface, or only the engine, without needing to write both parts of the program. (See alsochess engine.)
Developers have to decide whether to connect the engine to an opening book and/or endgametablebases or leave this to the GUI.
Thedata structure used to represent each chess position is key to the performance of move generation andposition evaluation. Methods include pieces stored in an array ("mailbox" and "0x88"), piece positions stored in a list ("piece list"), collections of bit-sets for piece locations ("bitboards"), andhuffman coded positions for compact long-term storage.
Computer chess programs consider chess moves as agame tree. In theory, they examine all moves, then all counter-moves to those moves, then all moves countering them, and so on, where each individual move by one player is called a "ply". This evaluation continues until a certain maximum search depth or the program determines that a final "leaf" position has been reached (e.g. checkmate).
One particular type of search algorithm used in computer chess areminimax search algorithms, where at each ply the "best" move by the player is selected; one player is trying to maximize the score, the other to minimize it. By this alternating process, one particular terminal node whose evaluation represents the searched value of the position will be arrived at. Its value is backed up to the root, and that evaluation becomes the valuation of the position on the board. This search process is called minimax.
A naive implementation of the minimax algorithm can only search to a small depth in a practical amount of time, so various methods have been devised to greatly speed the search for good moves.Alpha–beta pruning, a system of defining upper and lower bounds on possible search results and searching until the bounds coincided, is typically used to reduce the search space of the program.
In addition, various selective search heuristics, such asquiescence search, forward pruning, search extensions and search reductions, are also used as well. These heuristics are triggered based on certain conditions in an attempt to weed out obviously bad moves (history moves) or to investigate interesting nodes (e.g. check extensions,passed pawns on seventhrank, etc.). These selective search heuristics have to be used very carefully however. If the program overextends, it wastes too much time looking at uninteresting positions. If too much is pruned or reduced, there is a risk of cutting out interesting nodes.
Monte Carlo tree search (MCTS) is a heuristic search algorithm which expands the search tree based on random sampling of the search space. A version of Monte Carlo tree search commonly used in computer chess is PUCT, Predictor and Upper Confidence bounds applied to Trees.
DeepMind'sAlphaZero andLeela Chess Zero uses MCTS instead of minimax. Such engines usebatching ongraphics processing units in order to calculate theirevaluation functions and policy (move selection), and therefore require aparallel search algorithm as calculations on the GPU are inherently parallel. The minimax and alpha-beta pruning algorithms used in computer chess are inherently serial algorithms, so would not work well with batching on the GPU. On the other hand, MCTS is a good alternative, because the random sampling used in Monte Carlo tree search lends itself well to parallel computing, and is why nearly all engines which support calculations on the GPU use MCTS instead of alpha-beta.
Many other optimizations can be used to make chess-playing programs stronger. For example,transposition tables are used to record positions that have been previously evaluated, to save recalculation of them.Refutation tables record key moves that "refute" what appears to be a good move; these are typically tried first in variant positions (since a move that refutes one position is likely to refute another). The drawback is that transposition tables at deep ply depths can get quite large – tens to hundreds of millions of entries. IBM's Deep Blue transposition table in 1996, for example was 500 million entries. Transposition tables that are too small can result in spending more time searching for non-existent entries due to threshing than the time saved by entries found. Many chess engines usepondering, searching to deeper levels on the opponent's time, similar to human beings, to increase their playing strength.
Of course, faster hardware and additional memory can improve chess program playing strength. Hyperthreaded architectures can improve performance modestly if the program is running on a single core or a small number of cores. Most modern programs are designed to take advantage of multiple cores to do parallel search. Other programs are designed to run on a general purpose computer and allocate move generation, parallel search, or evaluation to dedicated processors or specialized co-processors.
Thefirst paper on chess search was byClaude Shannon in 1950.[34] He predicted the two main possible search strategies which would be used, which he labeled "Type A" and "Type B",[35] before anyone had programmed a computer to play chess.
Type A programs would use a "brute force" approach, examining every possible position for a fixed number of moves using a pure naiveminimax algorithm. Shannon believed this would be impractical for two reasons.
First, with approximately thirty moves possible in a typical real-life position, he expected that searching the approximately 109 positions involved in looking three moves ahead for both sides (sixplies) would take about sixteen minutes, even in the "very optimistic" case that the chess computer evaluated a million positions every second. (It took about forty years to achieve this speed.) A later search algorithm calledalpha–beta pruning, a system of defining upper and lower bounds on possible search results and searching until the bounds coincided, reduced the branching factor of the game tree logarithmically, but it still was not feasible for chess programs at the time to exploit the exponential explosion of the tree.
Second, it ignored the problem of quiescence, trying to only evaluate a position that is at the end of anexchange of pieces or other important sequence of moves ('lines'). He expected that adapting minimax to cope with this would greatly increase the number of positions needing to be looked at and slow the program down still further. He expected that adapting type A to cope with this would greatly increase the number of positions needing to be looked at and slow the program down still further.
This led naturally to what is referred to as "selective search" or "type B search", using chess knowledge (heuristics) to select a few presumably good moves from each position to search, and prune away the others without searching. Instead of wasting processing power examining bad or trivial moves, Shannon suggested that type B programs would use two improvements:
This would enable them to look further ahead ('deeper') at the most significant lines in a reasonable time. However, early attempts at selective search often resulted in the best move or moves being pruned away. As a result, little or no progress was made for the next 25 years dominated by this first iteration of the selective search paradigm. The best program produced in this early period was Mac Hack VI in 1967; it played at the about the same level as the average amateur (C class on the United States Chess Federation rating scale).
Meanwhile, hardware continued to improve, and in 1974, brute force searching was implemented for the first time in the Northwestern University Chess 4.0 program. In this approach, all alternative moves at a node are searched, and none are pruned away. They discovered that the time required to simply search all the moves was much less than the time required to apply knowledge-intensive heuristics to select just a few of them, and the benefit of not prematurely or inadvertently pruning away good moves resulted in substantially stronger performance.
In the 1980s and 1990s, progress was finally made in the selective search paradigm, with the development ofquiescence search, null move pruning, and other modern selective search heuristics. These heuristics had far fewer mistakes than earlier heuristics did, and was found to be worth the extra time it saved because it could search deeper and widely adopted by many engines. While many modern programs do usealpha-beta search as a substrate for their search algorithm, these additional selective search heuristics used in modern programs means that the program no longer does a "brute force" search. Instead they heavily rely on these selective search heuristics to extend lines the program considers good and prune and reduce lines the program considers bad, to the point where most of the nodes on the search tree are pruned away, enabling modern programs to search very deep.
In 2006,Rémi Coulom createdMonte Carlo tree search, another kind of type B selective search. In 2007, an adaption of Monte Carlo tree search called Upper Confidence bounds applied to Trees or UCT for short was created by Levente Kocsis and Csaba Szepesvári. In 2011, Chris Rosin developed a variation of UCT called Predictor + Upper Confidence bounds applied to Trees, or PUCT for short. PUCT was then used inAlphaZero in 2017, and later inLeela Chess Zero in 2018.
In the 1970s, most chess programs ran on super computers like Control Data Cyber 176s or Cray-1s, indicative that during that developmental period for computer chess, processing power was the limiting factor in performance. Most chess programs struggled to search to a depth greater than 3 ply. It was not until the hardware chess machines of the 1980s, that a relationship between processor speed and knowledge encoded in the evaluation function became apparent.
It has been estimated that doubling the computer speed gains approximately fifty to seventyElo points in playing strength (Levy & Newborn 1991:192).
For most chess positions, computers cannot look ahead to all possible final positions. Instead, they must look ahead a fewplies and compare the possible positions, known as leaves. The algorithm that evaluates leaves is termed the "evaluation function", and these algorithms are often vastly different between different chess programs. Evaluation functions typically evaluate positions in hundredths of a pawn (called a centipawn), where by convention, a positive evaluation favors White, and a negative evaluation favors Black. However, some evaluation function output win/draw/loss percentages instead of centipawns.
Historically, handcrafted evaluation functions consider material value along with other factors affecting the strength of each side. When counting up the material for each side, typical values for pieces are 1 point for apawn, 3 points for aknight orbishop, 5 points for arook, and 9 points for aqueen. (SeeChess piece relative value.) Theking is sometimes given an arbitrarily high value such as 200 points (Shannon's paper) to ensure that a checkmate outweighs all other factors (Levy & Newborn 1991:45). In addition to points for pieces, most handcrafted evaluation functions take many factors into account, such as pawn structure, the fact that a pair of bishops are usually worth more, centralized pieces are worth more, and so on. The protection of kings is usually considered, as well as the phase of the game (opening, middle or endgame).Machine learning techniques such as Texel turning,stochastic gradient descent, orreinforcement learning are usually used to optimise handcrafted evaluation functions.
Most modern evaluation functions make use ofneural networks. The most common evaluation function in use today is theefficiently updatable neural network, which is a shallow neural network whose inputs arepiece-square tables. Piece-square tables are a set of 64 values corresponding to the squares of the chessboard, and there typically exists a piece-square table for every piece and colour, resulting in 12 piece-square tables and thus 768 inputs into the neural network. In addition, some engines usedeep neural networks in their evaluation function. Neural networks are usually trained using somereinforcement learning algorithm, in conjunction withsupervised learning orunsupervised learning.
The output of the evaluation function is a single scalar, quantized in centipawns or other units, which is, in the case of handcrafted evaluation functions, a weighted summation of the various factors described, or in the case of neural network based evaluation functions, the output of the head of the neural network. The evaluation putatively represents or approximates the value of the subtree below the evaluated node as if it had been searched to termination, i.e. the end of the game. During the search, an evaluation is compared against evaluations of other leaves, eliminating nodes that represent bad or poor moves for either side, to yield a node which by convergence, represents the value of the position with best play by both sides.
Endgame play had long been one of the great weaknesses of chess programs because of the depth of search needed. Some otherwise master-level programs were unable to win in positions where even intermediate human players could force a win.
To solve this problem, computers have been used to analyze somechess endgame positions completely, starting withking andpawn against king. Such endgame tablebases are generated in advance using a form ofretrograde analysis, starting with positions where the final result is known (e.g., where one side has been mated) and seeing which other positions are one move away from them, then which are one move from those, etc.Ken Thompson was a pioneer in this area.
The results of the computer analysis sometimes surprised people. In 1977 Thompson's Belle chess machine used the endgame tablebase for a king androok against king andqueen and was able to draw that theoretically lost ending against several masters (seePhilidor position#Queen versus rook). This was despite not following the usual strategy to delay defeat by keeping the defending king and rook close together for as long as possible. Asked to explain the reasons behind some of the program's moves, Thompson was unable to do so beyond saying the program's database simply returned the best moves.
Most grandmasters declined to play against the computer in the queen versus rook endgame, butWalter Browne accepted the challenge. A queen versus rook position was set up in which the queen can win in thirty moves, with perfect play. Browne was allowed 2½ hours to play fifty moves, otherwise a draw would be claimed under thefifty-move rule. After forty-five moves, Browne agreed to a draw, being unable to force checkmate or win the rook within the next five moves. In the final position, Browne was still seventeen moves away from checkmate, but not quite that far away from winning the rook. Browne studied the endgame, and played the computer again a week later in a different position in which the queen can win in thirty moves. This time, he captured the rook on the fiftieth move, giving him a winning position.[36][37]
Other positions, long believed to be won, turned out to take more moves against perfect play to actually win than were allowed by chess's fifty-move rule. As a consequence, for some years the official FIDE rules of chess were changed to extend the number of moves allowed in these endings. After a while, the rule reverted to fifty moves in all positions – more such positions were discovered, complicating the rule still further, and it made no difference in human play, as they could not play the positions perfectly.
Over the years, otherendgame database formats have been released including the Edward Tablebase, the De Koning Database and theNalimov Tablebase which is used by many chess programs such asRybka,Shredder andFritz. Tablebases for all positions with six pieces are available.[38] Some seven-piece endgames have been analyzed by Marc Bourzutschky and Yakov Konoval.[39] Programmers using the Lomonosov supercomputers in Moscow have completed a chess tablebase for all endgames with seven pieces or fewer (trivial endgame positions are excluded, such as six white pieces versus a lone blackking).[40][41] In all of these endgame databases it is assumed that castling is no longer possible.
Many tablebases do not consider the fifty-move rule, under which a game where fifty moves pass without a capture or pawn move can be claimed to be a draw by either player. This results in the tablebase returning results such as "Forced mate in sixty-six moves" in some positions which would actually be drawn because of the fifty-move rule. One reason for this is that if the rules of chess were to be changed once more, giving more time to win such positions, it will not be necessary to regenerate all the tablebases. It is also very easy for the program using the tablebases to notice and take account of this 'feature' and in any case if using an endgame tablebase will choose the move that leads to the quickest win (even if it would fall foul of the fifty-move rule with perfect play). If playing an opponent not using a tablebase, such a choice will give good chances of winning within fifty moves.
The Nalimov tablebases, which use state-of-the-artcompression techniques, require 7.05GB of hard disk space for all five-piece endings. To cover all the six-piece endings requires approximately 1.2TB. It is estimated that a seven-piece tablebase requires between 50 and 200TB of storage space.[42]
Endgame databases featured prominently in 1999, when Kasparov played anexhibition match on the Internet against the rest of the world. A seven pieceQueen andpawn endgame was reached with the World Team fighting to salvage a draw.Eugene Nalimov helped by generating the six piece ending tablebase where both sides had two Queens which was used heavily to aid analysis by both sides.
The most popular endgame tablebase is syzygy which is used by most top computer programs likeStockfish,Leela Chess Zero, andKomodo. It is also significantly smaller in size than other formats, with 7-piece tablebases taking only 18.4 TB.[43]
For a current state-of-the art chess engine like Stockfish, a table base only provides a very minor increase in playing strength (approximately 3 Elo points for syzygy 6men as of Stockfish 15).[44]
Chess engines, like human beings, may save processing time as well as select variations known to be strong via referencing anopening book stored in a database. Opening books cover the opening moves of a game to variable depth, depending on opening and variation, but usually to the first 10-12 moves (20-24 ply). In the early eras of computer chess, trusting variations studied in-depth by human grandmasters for decades was superior to the weak performance of mid-20th-century engines. And even in the contemporary era, allowing computer engines to extensively analyze various openings at their leisure beforehand, and then simply consult the results when in a game, speeds up their play.
In the 1990s, some theorists believed that chess engines of the day had much of their strength in memorized opening books and knowledge dedicated to known positions, and thus believed a validanti-computer tactic would be to intentionally play some out-of-book moves in order to force the chess program to think for itself. This seems to have been a dubious assumption even then; Garry Kasparov tried it via using the non-standardMieses Opening at the 1997Deep Blue versus Garry Kasparov Game 1 match, but lost. This tactic became even weaker as time passed; the opening books stored in computer databases can be far more extensive than even the best prepared humans, meaning computers will be well-prepared for even rare variations and know the correct play. More generally, the play of engines even in fully unknown situations (as comes up in variants such asChess960) is still exceptionally strong, so the lack of an opening book isn't even a major disadvantage for tactically sharp chess engines, who can discover strong moves in unfamiliar board variations accurately.
In contemporary engine tournaments, engines are often told to play situations from a variety of openings, including unbalanced ones, to reduce the draw rate and to add more variety to the games.[45]
CEGT,[46] CSS,[47]SSDF,[48] WBEC,[49]REBEL,[50] FGRL,[51] and IPON[52] maintain rating lists allowing fans to compare the strength of engines. Various versions ofStockfish,Komodo,Leela Chess Zero, andFat Fritz dominate the rating lists in the early 2020s.
CCRL (Computer Chess Rating Lists) is an organisation that tests computerchess engines'strength by playing the programs against each other. CCRL was founded in 2006 to promote computer-computer competition and tabulate results on a rating list.[53]
The organisation runs three different lists: 40/40 (40 minutes for every 40 moves played), 40/4 (4 minutes for every 40 moves played), and 40/4FRC (same time control but Chess960).[Note 2] Pondering (orpermanent brain) is switched off and timing is adjusted to the AMD64 X2 4600+ (2.4 GHz)CPU by usingCrafty 19.17 BH as a benchmark. Generic, neutralopening books are used (as opposed to the engine's own book) up to a limit of 12 moves into the game alongside 4 or 5 mantablebases.[53][54][55]
The idea of creating a chess-playing machine dates back to the eighteenth century. Around 1769, the chess playingautomaton calledThe Turk, created byHungarian inventorFarkas Kempelen, became famous before being exposed as a hoax. Before the development ofdigital computing, serious trials based on automata such asEl Ajedrecista of 1912, built by Spanish engineerLeonardo Torres Quevedo, which played a king and rook versus king ending, were too complex and limited to be useful for playing full games of chess. The field of mechanical chess research languished until the advent of the digital computer in the 1950s.
Since then, chess enthusiasts andcomputer engineers have built, with increasing degrees of seriousness and success, chess-playing machines and computer programs. One of the few chess grandmasters to devote himself seriously to computer chess was formerWorld Chess ChampionMikhail Botvinnik, who wrote several works on the subject. Botvinnik's interest in Computer Chess started in the 50s, favouring chess algorithms based on Shannon's selective type B strategy, as discussed along with Max Euwe 1958 in Dutch Television. Working with relatively primitive hardware available in theSoviet Union in the early 1960s, Botvinnik had no choice but to investigate software move selection techniques; at the time only the most powerful computers could achieve much beyond a three-ply full-width search, and Botvinnik had no such machines. In 1965 Botvinnik was a consultant to the ITEP team in a US-Soviet computer chess match which won a correspondence chess match against the Kotok-McCarthy-Program led by John McCarthy in 1967.(seeKotok-McCarthy). Later he advised the team that created the chess program Kaissa at Moscow's Institute of Control Sciences. Botvinnik had his own ideas to model a Chess Master's Mind. After publishing and discussing his early ideas on attack maps and trajectories at Moscow Central Chess Clubin 1966, he found Vladimir Butenko as supporter and collaborator. Butenko first implemented the 15x15 vector attacks board representation on a M-20 computer, determining trajectories. After Botvinnik introduced the concept of Zones in 1970, Butenko refused further cooperation and began to write his own program, dubbed Eureka. In the 70s and 80s, leading a team around Boris Stilman, Alexander Yudin, Alexander Reznitskiy, Michael Tsfasman and Mikhail Chudakov, Botvinnik worked on his own project 'Pioneer' - which was an Artificial Intelligence based chess project. In the 90s, Botvinnik already in his 80s, he worked on the new project 'CC Sapiens'.
One developmental milestone occurred when the team fromNorthwestern University, which was responsible for theChess series of programs and won the first threeACMComputer Chess Championships (1970–72), abandoned type B searching in 1973. The resulting program, Chess 4.0, won that year's championship and its successors went on to come in second in both the 1974 ACM Championship and that year's inauguralWorld Computer Chess Championship, before winning the ACM Championship again in 1975, 1976 and 1977. The type A implementation turned out to be just as fast: in the time it used to take to decide which moves were worthy of being searched, it was possible just to search all of them. In fact, Chess 4.0 set the paradigm that was and still is followed essentially by all modern Chess programs today, and that had been successfully started by the Russian ITEP in 1965.
In 1978, an early rendition of Ken Thompson's hardware chess machineBelle, entered and won the North American Computer Chess Championship over the dominant Northwestern University Chess 4.7.
Technological advances by orders of magnitude in processing power have made the brute force approach far more incisive than was the case in the early years. The result is that a very solid, tactical AI player aided by some limited positional knowledge built in by the evaluation function and pruning/extension rules began to match the best players in the world. It turned out to produce excellent results, at least in the field of chess, to let computers do what they do best (calculate) rather than coax them into imitating human thought processes and knowledge. In 1997Deep Blue, a brute-force machine capable of examining 500 million nodes per second, defeated World Champion Garry Kasparov, marking the first time a computer has defeated a reigning world chess champion in standard time control.
In 2016,NPR asked experts to characterize the playing style of computer chess engines.Murray Campbell of IBM stated that "Computers don't have any sense of aesthetics... They play what they think is the objectively best move in any position, even if it looks absurd, and they can play any move no matter how ugly it is." Grandmasters Andrew Soltis andSusan Polgar stated that computers are more likely to retreat than humans are.[33]
Whileneural networks have been used in theevaluation functions of chess engines since the late 1980s, with programs such as NeuroChess, Morph, Blondie25, Giraffe,AlphaZero, andMuZero,[56][57][58][59][60] neural networks did not become widely adopted by chess engines until the arrival ofefficiently updatable neural networks in the summer of 2020. Efficiently updatable neural networks were originally developed incomputer shogi in 2018 by Yu Nasu,[61][62] and had to be first ported to a derivative of Stockfish called Stockfish NNUE on 31 May 2020,[63] and integrated into the official Stockfish engine on 6 August 2020,[64][65] before other chess programmers began to adopt neural networks into their engines.
Some people, such as theRoyal Society'sVenki Ramakrishnan, believe thatAlphaZero led to the widespread adoption of neural networks in chess engines.[66] However, AlphaZero influenced very few engines to begin using neural networks, and those tended to be new experimental engines such asLeela Chess Zero, which began specifically to replicate the AlphaZero paper. Thedeep neural networks used in AlphaZero's evaluation function required expensivegraphics processing units, which were not compatible with existing chess engines. The vast majority of chess engines only usecentral processing units, and computing and processing information on the GPUs require speciallibraries in the backend such asNvidia'sCUDA, which none of the engines had access to. Thus the vast majority of chess engines such asKomodo andStockfish continued to use handcrafted evaluation functions untilefficiently updatable neural networks were ported to computer chess fromcomputer shogi in 2020, which did not require either the use of GPUs or libraries like CUDA at all. Even then, the neural networks used in computer chess are fairly shallow, and thedeep reinforcement learning methods pioneered by AlphaZero are still extremely rare in computer chess.
a | b | c | d | e | f | |||
6 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | 6 | |
5 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | 5 | |
4 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | 4 | |
3 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | 3 | |
2 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | 2 | |
1 | ![]() | ![]() | ![]() | ![]() | ![]() | ![]() | 1 | |
a | b | c | d | e | f |
These chess playing systems include custom hardware with approx. dates of introduction (excluding dedicated microcomputers):
In the late 1970s to early 1990s, there was a competitive market for dedicated chess computers. This market changed in the mid-1990s when computers with dedicated processors could no longer compete with the fast processors in personal computers.
Recently, some hobbyists have been using theMulti Emulator Super System to run the chess programs created for Fidelity or Hegener & Glaser's Mephisto computers on modern 64-bit operating systems such asWindows 10.[87] The author ofRebel, Ed Schröder has also adapted three of the Hegener & Glaser Mephisto's he wrote to work as UCI engines.[88]
These programs can be run on MS-DOS, and can be run on 64-bit Windows 10 via emulators such asDOSBox orQemu:[89]
Well-known computer chess theorists include:
The prospects of completelysolving chess are generally considered to be rather remote. It is widely conjectured that no computationally inexpensive method to solve chess exists even in the weak sense of determining with certainty the value of the initial position, and hence the idea of solving chess in the stronger sense of obtaining a practically usable description of a strategy for perfect play for either side seems unrealistic today. However, it has not been proven that no computationally cheap way of determining the best move in a chess position exists, nor even that a traditionalalpha–beta searcher running on present-day computing hardware could not solve the initial position in an acceptable amount of time. The difficulty in proving the latter lies in the fact that, while the number of board positions that could happen in the course of a chess game is huge (on the order of at least 1043[91] to 1047), it is hard to rule out with mathematical certainty the possibility that the initial position allows either side to force a mate or athreefold repetition after relatively few moves, in which case the search tree might encompass only a very small subset of the set of possible positions. It has been mathematically proven thatgeneralized chess (chess played with an arbitrarily large number of pieces on an arbitrarily large chessboard) isEXPTIME-complete,[92] meaning that determining the winning side in an arbitrary position of generalized chess provably takes exponential time in the worst case; however, this theoretical result gives no lower bound on the amount of work required to solve ordinary 8x8 chess.
Martin Gardner'sMinichess, played on a 5×5 board with approximately 1018 possible board positions, has been solved; its game-theoretic value is 1/2 (i.e. a draw can be forced by either side), and the forcing strategy to achieve that result has been described.
Progress has also been made from the other side: as of 2012, all 7 and fewer pieces (2 kings and up to 5 other pieces) endgames have been solved.
A "chess engine" is software that calculates and orders which moves are the strongest to play in a given position. Engine authors focus on improving the play of their engines, often just importing the engine into agraphical user interface (GUI) developed by someone else. Engines communicate with the GUI by standardized protocols such as the nowadays ubiquitousUniversal Chess Interface developed byStefan Meyer-Kahlen and Franz Huber. There are others, like the Chess Engine Communication Protocol developed by Tim Mann forGNU Chess andWinboard.Chessbase has its own proprietary protocol, and at one time Millennium 2000 had another protocol used forChessGenius. Engines designed for one operating system and protocol may be ported to other OS's or protocols. Chess engines are regularly matched against each other at dedicatedchess engine tournaments.
In 1997, theInternet Chess Club released its first Java client for playing chess online against other people inside one's webbrowser.[93] This was probably one of the first chess web apps.Free Internet Chess Server followed soon after with a similar client.[94] In 2004,International Correspondence Chess Federation opened up a web server to replace their email-based system.[95]Chess.com started offering Live Chess in 2007.[96]Chessbase/Playchess has long had a downloadable client, and added a web-based client in 2013.[97]
Another popular web app is tactics training. The now defunct Chess Tactics Server opened its site in 2006,[98] followed by Chesstempo the next year,[99] andChess.com added its Tactics Trainer in 2008.[100]Chessbase added a tactics trainer web app in 2015.[101]
Chessbase took their chess game database online in 1998.[102] Another early chess game databases was Chess Lab, which started in 1999.[103]New In Chess had initially tried to compete withChessbase by releasing a NICBase program forWindows 3.x, but eventually, decided to give up on software, and instead focus on their online database starting in 2002.[104]
One could play against the engineShredder online from 2006.[105] In 2015,Chessbase added a play Fritz web app,[106] as well as My Games for storing one's games.[107]
Starting in 2007,Chess.com offered the content of the training program, Chess Mentor, to their customers online.[108] Top GMs such asSam Shankland andWalter Browne have contributed lessons.
The introduction of artificial intelligence transformed the game of chess, particularly at the elite levels. AI greatly influenced defensive strategies. It has the capacity to compute every potential move without concern, unlike human players who are bound to emotional and psychological impacts from factors such as stress or tiredness. As a result, many positions once considered not defensible are now recognized as defensible.
After studying millions of games, chess engines made new analysis and improved the existing theories of opening. These improvements led to the creation of new ideas and changed the way players think throughout all parts of the game.[109]In classical chess, elite players commonly initiate games by making 10 to 15 opening moves that align with established analyses or leading engine recommendations.[110]
Unlike traditional over-the-board tournaments where handheld metal detectors are employed in order to counter players attempts at using electronic assistance, fair-play monitoring in online chess is much more challenging.
During the 2020 European Online Chess Championship, which saw a record participation of nearly 4000 players over 80 participants were disqualified for cheating—most from beginner and youth categories.[111] The event underscored the growing need for advanced detection methods in online competitions.
In response to these issues, chess platforms such asChess.com developed AI-based statistical models which track improbable moves by a player and compare them to moves that could be made by an engine. Expert examination is conducted for all suspected cases, and the findings are published on a regular basis. FIDE introduced AI behavior-tracking technology to strengthenanti-cheating measures in online events.[112]
AI-based detection systems use a combination of machine learning to track suspicious player actions in different games. This is done by measuring discrepancies between the real moves and the predicted moves derived from the available statistics. Players of unusually high skill level or unusual strategies that can imitate moves characteristic of automated chess systems. Each case is examined by a human expert to ensure that the decision is correct before any actions are made to guarantee fairness and accuracy.[112]
The Maia Chess project was began in 2020 by theUniversity of Toronto,Cornell University, andMicrosoft Research. Maia Chess is aneural network constructed to impersonate a human’s manner of playing chess based on skill. Each Maia models was tested on 9 sets of 500,000 positions each, covering rating levels from 1100 to 1900. They perform best when predicting moves made by players at their targeted rating level, with lower Maias accurately predicting moves from lower-rated players (around 1100) and higher Maias doing the same for higher-rated players (around 1900). The primary goal of Maia is to develop an AI chess engine that imitates human decision-making rather than focusing on optimal moves. Through personalization across different skill levels, Maia is able to simulate game styles typical for each level more accurately.[113][114]
While considered something done more for entertainment than for serious play, people have discovered thatlarge language models (LLMs) of the type created in 2018 and beyond such asGPT-3 can be prompted into producing chess moves given proper language prompts. While inefficient compared to native chess engines, the fact that LLMs can track the board state at all beyond the opening rather than simply recite chess-like phrases in adreamlike state was considered greatly surprising. LLM play has a number of quirks compared to engine play; for example, engines don't generally "care" how a board state was arrived at. However, LLMs seem to produce different quality moves for a chess position reached via strong play compared to the same board state produced via a set of strange preceding moves (which will generally produce weaker and more random moves).[115]
{{cite web}}
: CS1 maint: bot: original URL status unknown (link){{cite web}}
: CS1 maint: archived copy as title (link) This article incorporatestext by Chess Programming Wiki available under theCC BY-SA 3.0 license.