Movatterモバイル変換

[0]ホーム

Jump to content

Extensive-form game

Edit links

From Wikipedia, the free encyclopedia

(Redirected fromExtensive form game)

Wide-ranging representation of a game in game theory

Ingame theory, anextensive-form game is a specification of agame allowing for the explicit representation of a number of key aspects, like the sequencing of players' possible moves, theirchoices at every decision point, the (possiblyimperfect) information each player has about the other player's moves when they make a decision, and their payoffs for all possible game outcomes. Extensive-form games also allow for the representation ofincomplete information in the form of chance events modeled as "moves by nature". Extensive-form representations differ fromnormal-form in that they provide a more complete description of the game in question, whereas normal-form simply boils down the game into a payoff matrix.

Finite extensive-form games

[edit]

Some authors, particularly in introductory textbooks, initially define the extensive-form game as being just agame tree with payoffs (no imperfect or incomplete information), and add the other elements in subsequent chapters as refinements. Whereas the rest of this article follows this gentle approach with motivating examples, we present upfront the finite extensive-form games as (ultimately) constructed here. This general definition was introduced byHarold W. Kuhn in 1953, who extended an earlier definition ofvon Neumann from 1928. Following the presentation fromHart (1992), ann-player extensive-form game thus consists of the following:

A finite set ofn (rational) players
Arooted tree, called thegame tree
Each terminal (leaf) node of the game tree has ann-tuple ofpayoffs, meaning there is one payoff for each player at the end of every possible play
Apartition of the non-terminal nodes of the game tree inn+1 subsets, one for each (rational) player, and with a special subset for a fictitious player called Chance (or Nature). Each player's subset of nodes is referred to as the "nodes of the player". (A game of complete information thus has an empty set of Chance nodes.)
Each node of the Chance player has aprobability distribution over its outgoing edges.
Each set of nodes of a rational player is further partitioned ininformation sets, which make certain choices indistinguishable for the player when making a move, in the sense that:
- there is a one-to-one correspondence between outgoing edges of any two nodes of the same information set—thus the set of all outgoing edges of an information set is partitioned inequivalence classes, each class representing a possible choice for a player's move at some point—, and
- every (directed) path in the tree from the root to a terminal node can cross each information set at most once
the complete description of the game specified by the above parameters iscommon knowledge among the players

A play is thus a path through the tree from the root to a terminal node. At any given non-terminal node belonging to Chance, an outgoing branch is chosen according to the probability distribution. At any rational player's node, the player must choose one of the equivalence classes for the edges, which determines precisely one outgoing edge except (in general) the player doesn't know which one is being followed. (An outside observer knowing every other player's choices up to that point, and therealization of Nature's moves, can determine the edge precisely.) Apure strategy for a player thus consists of aselection—choosing precisely one class of outgoing edges for every information set (of his). In a game of perfect information, the information sets aresingletons. It's less evident how payoffs should be interpreted in games with Chance nodes. It is assumed that each player has avon Neumann–Morgenstern utility function defined for every game outcome; this assumption entails that every rational player will evaluate ana priori random outcome by itsexpected utility.

The above presentation, while precisely defining the mathematical structure over which the game is played, elides however the more technical discussion of formalizing statements about how the game is played like "a player cannot distinguish between nodes in the same information set when making a decision". These can be made precise usingepistemic modal logic; seeShoham & Leyton-Brown (2009, chpt. 13) for details.

Aperfect information two-player game over agame tree (as defined incombinatorial game theory andartificial intelligence) can be represented as an extensive form game with outcomes (i.e. win, lose, ordraw). Examples of such games includetic-tac-toe,chess, andinfinite chess.^[1]^[2] A game over anexpectminimax tree, like that ofbackgammon, has no imperfect information (all information sets are singletons) but has moves of chance. For example,poker has both moves of chance (the cards being dealt) and imperfect information (the cards secretly held by other players). (Binmore 2007, chpt. 2)

Perfect and complete information

[edit]

A complete extensive-form representation specifies:

the players of a game
for every player every opportunity they have to move
what each player can do at each of their moves
what each player knows for every move
the payoffs received by every player for every possible combination of moves

The game on the right has two players: 1 and 2. The numbers by every non-terminal node indicate to which player that decision node belongs. The numbers by every terminal node represent the payoffs to the players (e.g. 2,1 represents a payoff of 2 to player 1 and a payoff of 1 to player 2). The labels by every edge of the graph are the name of the action that edge represents.

The initial node belongs to player 1, indicating that player 1 moves first. Play according to the tree is as follows: player 1 chooses betweenU andD; player 2 observes player 1's choice and then chooses betweenU' andD'. The payoffs are as specified in the tree. There are four outcomes represented by the four terminal nodes of the tree: (U,U'), (U,D'), (D,U') and (D,D'). The payoffs associated with each outcome respectively are as follows (0,0), (2,1), (1,2) and (3,1).

If player 1 playsD, player 2 will playU' to maximise their payoff and so player 1 will only receive 1. However, if player 1 playsU, player 2 maximises their payoff by playingD' and player 1 receives 2. Player 1 prefers 2 to 1 and so will playU and player 2 will playD'. This is thesubgame perfect equilibrium.

Imperfect information

[edit]

An advantage of representing the game in this way is that it is clear what the order of play is. The tree shows clearly that player 1 moves first and player 2 observes this move. However, in some games play does not occur like this. One player does not always observe the choice of another (for example, moves may be simultaneous or a move may be hidden). Aninformation set is a set of decision nodes such that:

Every node in the set belongs to one player.
When the game reaches the information set, the player who is about to move cannot differentiate between nodes within the information set; i.e. if the information set contains more than one node, the player to whom that set belongs does not know which node in the set has been reached.

In extensive form, an information set is indicated by a dotted line connecting all nodes in that set or sometimes by a loop drawn around all the nodes in that set.

A game with imperfect information represented in extensive form

If a game has an information set with more than one member that game is said to haveimperfect information. A game withperfect information is such that at any stage of the game, every player knows exactly what has taken place earlier in the game; i.e. every information set is asingleton set.^[1]^[2] Any game without perfect information has imperfect information.

The game on the right is the same as the above game except that player 2 does not know what player 1 does when they come to play. The first game described has perfect information; the game on the right does not. If both players are rational and both know that both players are rational and everything that is known by any player is known to be known by every player (i.e. player 1 knows player 2 knows that player 1 is rational and player 2 knows this, etc.ad infinitum), play in the first game will be as follows: player 1 knows that if they playU, player 2 will playD' (because for player 2 a payoff of 1 is preferable to a payoff of 0) and so player 1 will receive 2. However, if player 1 playsD, player 2 will playU' (because to player 2 a payoff of 2 is better than a payoff of 1) and player 1 will receive 1. Hence, in the first game, the equilibrium will be (U,D') because player 1 prefers to receive 2 to 1 and so will playU and so player 2 will playD'.

In the second game it is less clear: player 2 cannot observe player 1's move. Player 1 would like to fool player 2 into thinking they have playedU when they have actually playedD so that player 2 will playD' and player 1 will receive 3. In fact in the second game there is aperfect Bayesian equilibrium where player 1 playsD and player 2 playsU' and player 2 holds the belief that player 1 will definitely playD. In this equilibrium, every strategy is rational given the beliefs held and every belief is consistent with the strategies played. Notice how the imperfection of information changes the outcome of the game.

To more easily solve this game for theNash equilibrium,^[3] it can be converted to thenormal form.^[4] Given this is asimultaneous/sequential game, player one and player two each have twostrategies.^[5]

Player 1's Strategies: {U , D}
Player 2's Strategies: {U’ , D’}


Player 2 Player 1	Up' (U')	Down' (D')
Up (U)	(0,0)	(2,1)
Down (D)	(1,2)	(3,1)

We will have a two by two matrix with a unique payoff for each combination of moves. Using the normal form game, it is now possible to solve the game and identify dominant strategies for both players.

If player 1 plays Up (U), player 2 prefers to play Down (D’) (Payoff 1>0)
If player 1 plays Down (D), player 2 prefers to play Up (U’) (Payoff 2>1)
If player 2 plays Up (U’), player 1 prefers to play Down (D) (Payoff 1>0)
If player 2 plays Down (D’), player 1 prefers to play Down (D) (3>2)

These preferences can be marked within the matrix, and any box where both players have a preference provides a nash equilibrium. This particular game has a single solution of (D,U’) with a payoff of (1,2).

In games with infinite action spaces and imperfect information, non-singleton information sets are represented, if necessary, by inserting a dotted line connecting the (non-nodal) endpoints behind the arc described above or by dashing the arc itself. In theStackelberg competition described above, if the second player had not observed the first player's move the game would no longer fit the Stackelberg model; it would beCournot competition.

Incomplete information

[edit]

It may be the case that a player does not know exactly what the payoffs of the game are or of whattype their opponents are. This sort of game hasincomplete information. In extensive form it is represented as a game with complete but imperfect information using the so-calledHarsanyi transformation. This transformation introduces to the game the notion ofnature's choice orGod's choice. Consider a game consisting of an employer considering whether to hire a job applicant. The job applicant's ability might be one of two things: high or low. Their ability level is random; they either have low ability with probability 1/3 or high ability with probability 2/3. In this case, it is convenient to model nature as another player of sorts who chooses the applicant's ability according to those probabilities. Nature however does not have any payoffs. Nature's choice is represented in the game tree by a non-filled node. Edges coming from a nature's choice node are labelled with the probability of the event it represents occurring.

A game with incomplete and imperfect information represented in extensive form

The game on the left is one of complete information (all the players and payoffs are known to everyone) but of imperfect information (the employer doesn't know what nature's move was.) The initial node is in the centre and it is not filled, so nature moves first. Nature selects with the same probability the type of player 1 (which in this game is tantamount to selecting the payoffs in the subgame played), either t1 or t2. Player 1 has distinct information sets for these; i.e. player 1 knows what type they are (this need not be the case). However, player 2 does not observe nature's choice. They do not know the type of player 1; however, in this game they do observe player 1's actions; i.e. there is perfect information. Indeed, it is now appropriate to alter the above definition of complete information: at every stage in the game, every player knows what has been playedby the other players. In the case of private information, every player knows what has been played by nature. Information sets are represented as before by broken lines.

In this game, if nature selects t1 as player 1's type, the game played will be like the very first game described, except that player 2 does not know it (and the very fact that this cuts through their information sets disqualify it fromsubgame status). There is oneseparatingperfect Bayesian equilibrium; i.e. an equilibrium in which different types do different things.

If both types play the same action (pooling), an equilibrium cannot be sustained. If both playD, player 2 can only form the belief that they are on either node in the information set with probability 1/2 (because this is the chance of seeing either type). Player 2 maximises their payoff by playingD'. However, if they playD', type 2 would prefer to playU. This cannot be an equilibrium. If both types playU, player 2 again forms the belief that they are at either node with probability 1/2. In this case player 2 playsD', but then type 1 prefers to playD.

If type 1 playsU and type 2 playsD, player 2 will playD' whatever action they observe, but then type 1 prefersD. The only equilibrium hence is with type 1 playingD, type 2 playingU and player 2 playingU' if they observeD and randomising if they observeU. Through their actions, player 1 hassignalled their type to player 2.

Formal definition

[edit]

Formally, a finite game in extensive form is a structure $\Gamma =\langle {\mathcal {K}},\mathbf {H} ,[(\mathbf {H} _{i})_{i\in {\mathcal {I}}}],\{A(H)\}_{H\in \mathbf {H} },a,\rho ,u\rangle$ where:

${\mathcal {K}}=\langle V,v^{0},T,p\rangle$ is a finite tree with a set of nodes $V {\displaystyle V}$ , a unique initial node $v^{0}\in V$ , a set of terminal nodes $T\subset V$ (let $D=V\setminus T$ be a set of decision nodes) and an immediate predecessor function $p:V\rightarrow D$ on which the rules of the game are represented,
$\mathbf {H}$ is a partition of $D {\displaystyle D}$ called an information partition,
$A(H)$ is a set of actions available for each information set $H\in \mathbf {H}$ which forms a partition on the set of all actions ${\mathcal {A}}$ .
$a:V\setminus \{v^{0}\}\rightarrow {\mathcal {A}}$ is an action partition associating each node $v {\displaystyle v}$ to a single action $a(v)$ , fulfilling:

$\forall H\in \mathbf {H} ,\forall v\in H$ , the restriction $a_{v}:s(v)\rightarrow A(H)$ of $a {\displaystyle a}$ on $s(v)$ is a bijection, with $s(v)$ the set of successor nodes of $v {\displaystyle v}$ .

${\mathcal {I}}=\{1,...,I\}$ is a finite set of players, $0 {\displaystyle 0}$ is (a special player called) nature, and $(\mathbf {H} _{i})_{i\in {\mathcal {I}}\cup \{0\}}$ is a player partition of information set $\mathbf {H}$ . Let $\iota (v)=\iota (H)$ be a single player that makes a move at node $v\in H$ .
$\rho =\{\rho _{H}:A(H)\rightarrow [0,1]|H\in \mathbf {H} _{0}\}$ is a family of probabilities of the actions of nature, and
$u=(u_{i})_{i\in {\mathcal {I}}}:T\rightarrow \mathbb {R} ^{\mathcal {I}}$ is a payoff profile function.

Infinite action space

[edit]

It may be that a player has an infinite number of possible actions to choose from at a particular decision node. The device used to represent this is an arc joining two edges protruding from the decision node in question. If the action space is a continuum between two numbers, the lower and upper delimiting numbers are placed at the bottom and top of the arc respectively, usually with a variable that is used to express the payoffs. The infinite number of decision nodes that could result are represented by a single node placed in the centre of the arc. A similar device is used to represent action spaces that, whilst not infinite, are large enough to prove impractical to represent with an edge for each action.

A game with infinite action spaces represented in extensive form

The tree on the left represents such a game, either with infinite action spaces (anyreal number between 0 and 5000) or with very large action spaces (perhaps anyinteger between 0 and 5000). This would be specified elsewhere. Here, it will be supposed that it is the former and, for concreteness, it will be supposed it represents two firms engaged inStackelberg competition. The payoffs to the firms are represented on the left, with⁠ $q_{1}$ ⁠ and⁠ $q_{2}$ ⁠ as the strategy they adopt and⁠ $c_{1}$ ⁠ and⁠ $c_{2}$ ⁠ as some constants (here marginal costs to each firm). Thesubgame perfect Nash equilibria of this game can be found by taking thefirst partial derivative^{[citation needed]} of each payoff function with respect to the follower's (firm 2) strategy variable (⁠ $q_{2}$ ⁠) and finding itsbest response function, $q_{2}(q_{1})={\tfrac {5000-q_{1}-c_{2}}{2}}$ . The same process can be done for the leader except that in calculating its profit, it knows that firm 2 will play the above response and so this can be substituted into its maximisation problem. It can then solve for⁠ $q_{1}$ ⁠ by taking the first derivative, yielding $q_{1}^{*}={\tfrac {5000+c_{2}-2c_{1}}{2}}$ . Feeding this into firm 2's best response function, $q_{2}^{*}={\tfrac {5000+2c_{1}-3c_{2}}{4}}$ and $(q_{1}^{*},q_{2}^{*})$ is the subgame perfect Nash equilibrium.

References

[edit]

^^a ^bhttps://www.math.uni-hamburg/Infinite Games, Yurii Khomskii (2010) Infinite Games (section 1.1), Yurii Khomskii (2010)
^^a ^b"Infinite Chess, PBS Infinite Series" PBS Infinite Series. Perfect information defined at 0:25, with academic sourcesarXiv:1302.4377 andarXiv:1510.08155.
^Watson, Joel. (2013-05-09).Strategy : an introduction to game theory. pp. 97–100.ISBN 978-0-393-91838-0.OCLC 1123193808.
^Watson, Joel. (2013-05-09).Strategy : an introduction to game theory. pp. 26–28.ISBN 978-0-393-91838-0.OCLC 1123193808.
^Watson, Joel. (2013-05-09).Strategy : an introduction to game theory. pp. 22–26.ISBN 978-0-393-91838-0.OCLC 1123193808.

Hart, Sergiu (1992). "Games in extensive and strategic forms". InAumann, Robert; Hart, Sergiu (eds.).Handbook of Game Theory with Economic Applications. Vol. 1. Elsevier.ISBN 978-0-444-88098-7.
Binmore, Kenneth (2007).Playing for real: a text on game theory. Oxford University Press US.ISBN 978-0-19-530057-4.
Dresher M. (1961). The mathematics of games of strategy: theory and applications (Ch4: Games in extensive form, pp74–78). Rand Corp.ISBN 0-486-64216-X
Fudenberg D andTirole J. (1991) Game theory (Ch3 Extensive form games, pp67–106). MIT press.ISBN 0-262-06141-4
Leyton-Brown, Kevin; Shoham, Yoav (2008),Essentials of Game Theory: A Concise, Multidisciplinary Introduction, San Rafael, CA: Morgan & Claypool Publishers,ISBN 978-1-59829-593-1. An 88-page mathematical introduction; see Chapters 4 and 5.Free online Archived 2000-08-15 at theWayback Machine at many universities.
Luce R.D. andRaiffa H. (1957). Games and decisions: introduction and critical survey. (Ch3: Extensive and Normal Forms, pp39–55). Wiley New York.ISBN 0-486-65943-7
Osborne MJ andRubinstein A. 1994. A course in game theory (Ch6 Extensive game with perfect information, pp. 89–115). MIT press.ISBN 0-262-65040-1
Shoham, Yoav; Leyton-Brown, Kevin (2009),Multiagent Systems: Algorithmic, Game-Theoretic, and Logical Foundations, New York:Cambridge University Press,ISBN 978-0-521-89943-7. A comprehensive reference from a computational perspective; see Chapter 5.Downloadable free online.

Neumann, J. (1928). "Zur Theorie der Gesellschaftsspiele".Mathematische Annalen.100:295–320.doi:10.1007/BF01448847.S2CID 122961988.
Harold William Kuhn (2003).Lectures on the theory of games. Princeton University Press.ISBN 978-0-691-02772-2. contains Kuhn's lectures at Princeton from 1952 (officially unpublished previously, but in circulation as photocopies)

Game theory

Traditionalgame theory

Definitions	Asynchrony Bayesian regret Best response Bounded rationality Cheap talk Coalition Complete contract Complete information Complete mixing Confrontation analysis Conjectural variation Contingent cooperator Coopetition Cooperative game theory Dynamic inconsistency Escalation of commitment Farsightedness Game semantics Hierarchy of beliefs Imperfect information Incomplete information Information set Move by nature Mutual knowledge Non-cooperative game theory Non-credible threat Outcome Perfect information Perfect recall Ply Preference Rationality Sequential game Simultaneous action selection Spite Strategic complements Strategic dominance Strategic form Strategic interaction Strategic move Strategy Subgame Succinct game Topological game Tragedy of the commons Uncorrelated asymmetry
Equilibrium concepts	Backward induction Bayes correlated equilibrium Bayesian efficiency Bayesian game Bayesian Nash equilibrium Berge equilibrium Bertrand–Edgeworth model Coalition-proof Nash equilibrium Core Correlated equilibrium Cursed equilibrium Edgeworth price cycle Epsilon-equilibrium Gibbs equilibrium Incomplete contracts Inequity aversion Individual rationality Iterated elimination of dominated strategies Markov perfect equilibrium Mertens-stable equilibrium Nash equilibrium Open-loop model Pareto efficiency Payoff dominance Perfect Bayesian equilibrium Price of anarchy Program equilibrium Proper equilibrium Quantal response equilibrium Quasi-perfect equilibrium Rational agent Rationalizability Rationalizable strategy Satisfaction equilibrium Self-confirming equilibrium Sequential equilibrium Shapley value Strong Nash equilibrium Subgame perfect equilibrium Trembling hand equilibrium
Strategies	Appeasement Bid shading Cheap talk Collusion Commitment device De-escalation Deterrence Escalation Fictitious play Focal point Grim trigger Hobbesian trap Markov strategy Max-dominated strategy Mixed strategy Pure strategy Tit for tat Win–stay, lose–switch
Games	All-pay auction Battle of the sexes Nash bargaining game Bertrand competition Blotto game Centipede game Coordination game Cournot competition Deadlock Dictator game Trust game Diner's dilemma Dollar auction El Farol Bar problem Electronic mail game Gift-exchange game Guess 2/3 of the average Keynesian beauty contest Kuhn poker Lewis signaling game Matching pennies Obligationes Optional prisoner's dilemma Pirate game Prisoner's dilemma Public goods game Rendezvous problem Rock paper scissors Stackelberg competition Stag hunt Traveler's dilemma Ultimatum game Volunteer's dilemma War of attrition
Theorems	Arrow's impossibility theorem Aumann's agreement theorem Brouwer fixed-point theorem Competitive altruism Folk theorem Gibbard–Satterthwaite theorem Gibbs lemma Glicksberg's theorem Kakutani fixed-point theorem Kuhn's theorem One-shot deviation principle Prim–Read theory Rational ignorance Rational irrationality Sperner's lemma Zermelo's theorem
Subfields	Algorithmic game theory Behavioral game theory Behavioral strategy Compositional game theory Contract theory Drama theory Graphical game theory Heresthetic Mean-field game theory Negotiation theory Quantum game theory Social software
Key people	Albert W. Tucker Alvin E. Roth Amos Tversky Antoine Augustin Cournot Ariel Rubinstein David Gale David K. Levine David M. Kreps Donald B. Gillies Drew Fudenberg Eric Maskin Harold W. Kuhn Herbert Simon Herbert Scarf Hervé Moulin Jean Tirole Jean-François Mertens Jennifer Tour Chayes Ken Binmore Kenneth Arrow Leonid Hurwicz Lloyd Shapley Martin Shubik Melvin Dresher Merrill M. Flood Olga Bondareva Oskar Morgenstern Paul Milgrom Peyton Young Reinhard Selten Robert Aumann Robert Axelrod Robert B. Wilson Roger Myerson Samuel Bowles Suzanne Scotchmer Thomas Schelling William Vickrey

Combinatorial game theory

Core concepts	Combinatorial explosion Determinacy Disjunctive sum First-player and second-player win Game complexity Game tree Impartial game Misère Partisan game Solved game Sprague–Grundy theorem Strategy-stealing argument Zugzwang
Games	Chess Chomp Clobber Cram Domineering Hackenbush Nim Notakto Subtract a square Sylver coinage Toads and Frogs
Mathematical tools	Mex Nimber On Numbers and Games Star Surreal number Winning Ways for Your Mathematical Plays
Search algorithms	Alpha–beta pruning Expectiminimax Minimax Monte Carlo tree search Negamax Paranoid algorithm Principal variation search
Key people	Claude Shannon John Conway John von Neumann

Evolutionary game theory

Core concepts	Bishop–Cannings theorem Evolution and the Theory of Games Evolutionarily stable set Evolutionarily stable state Evolutionarily stable strategy Replicator equation Risk dominance Stochastically stable equilibrium Weak evolutionarily stable strategy
Games	Chicken Stag hunt
Applications	Cultural group selection Fisher's principle Mobbing Terminal investment hypothesis
Key people	John Maynard Smith Robert Axelrod

Mechanism design

Core concepts	Algorithmic mechanism design Bayesian-optimal mechanism Incentive compatibility Market design Monotonicity Participation constraint Revelation principle Strategyproofness Vickrey–Clarke–Groves mechanism
Theorems	Myerson–Satterthwaite theorem Revenue equivalence
Applications	Digital goods auction Knapsack auction Truthful cake-cutting