Movatterモバイル変換

[0]ホーム

Jump to content

Best response

Edit links

From Wikipedia, the free encyclopedia

Concept in game theory

"Reaction function" redirects here. For the economic concept, seemonetary policy reaction function.

Ingame theory, thebest response is thestrategy (or strategies) which produces the most favorableoutcome for a player, taking other players' strategies as given.^[1] The concept of a best response is central toJohn Nash's best-known contribution, theNash equilibrium, the point at which each player in a game has selected the best response (or one of the best responses) to the other players' strategies.^[2]

Correspondence

[edit]

Reactioncorrespondences, also known as best response correspondences, are used in the proof of the existence ofmixed strategy Nash equilibria.^[3]^[4] Reaction correspondences are not "reaction functions" sincefunctions must only have one value per argument, and many reaction correspondences will be undefined, i.e., a vertical line, for some opponent strategy choice. One constructs a correspondenceb(·), for each player from the set of opponent strategy profiles into the set of the player's strategies. So, for any given set of opponent's strategiesσ_−i,b_i(σ_−i) represents playeri's best responses toσ_−i.

Figure 2. Reaction correspondence for player X in the Stag Hunt game.

Response correspondences for all2 × 2normal form games can be drawn with aline for each player in aunit square strategyspace. Figures 1 to 3 graphs the best response correspondences for thestag hunt game. The dotted line in Figure 1 shows theoptimal probability that player Y plays 'Stag' (in they-axis), as a function of the probability that player X plays Stag (shown in thex-axis). In Figure 2 the dotted line shows the optimal probability that player X plays 'Stag' (shown in thex-axis), as a function of the probability that player Y plays Stag (shown in they-axis). Note that Figure 2 plots theindependent andresponse variables in the opposite axes to those normally used, so that it may be superimposed onto the previous graph, to show theNash equilibria at the points where the two player's best responses agree in Figure 3.

There are three distinctive reaction correspondence shapes, one for each of the three types ofsymmetric2 × 2 games: coordination games, discoordination games, and games with dominated strategies(the trivial fourth case in which payoffs are always equal for both moves is not really a game theoretical problem). Any payoff symmetric2 × 2 game will take one of these three forms.

Coordination games

[edit]

Figure 3. Reaction correspondence for both players in the Stag Hunt game. Nash equilibria shown with points, where the two player's correspondences agree, i.e. cross

Games in which players score highest when both players choose the same strategy, such as thestag hunt andbattle of the sexes, are calledcoordination games. These games have reaction correspondences of the same shape as Figure 3, where there is one Nash equilibrium in the bottom left corner, another in the top right, and a mixing Nash somewhere along the diagonal between the other two.

Anti-coordination games

[edit]

Figure 4. Reaction correspondence for both players in the hawk-dove game. Nash equilibria shown with points, where the two player's correspondences agree, i.e. cross

Games such as thegame of chicken andhawk-dove game in which players score highest when they choose opposite strategies, i.e., discoordinate, are called anti-coordination games. They have reaction correspondences (Figure 4) that cross in the opposite direction to coordination games, with three Nash equilibria, one in each of the top left and bottom right corners, where one player chooses one strategy, the other player chooses the opposite strategy. The third Nash equilibrium is amixed strategy which lies along the diagonal from the bottom left to top right corners. If the players do not know which one of them is which, then the mixed Nash is anevolutionarily stable strategy (ESS), as play is confined to the bottom left to top right diagonal line. Otherwise anuncorrelated asymmetry is said to exist, and the corner Nash equilibria areESSes.

Games with dominated strategies

[edit]

Figure 5. Reaction correspondence for a game with a dominated strategy.

Games withdominated strategies have reaction correspondences which only cross at one point, which will be in either the bottom left, or top right corner in payoff symmetric2 × 2 games. For instance, in the single-playprisoner's dilemma, the "Cooperate" move is not optimal for any probability of opponent Cooperation. Figure 5 shows the reaction correspondence for such a game, where the dimensions are "Probability play Cooperate", the Nash equilibrium is in the lower left corner where neither player plays Cooperate. If the dimensions were defined as "Probability play Defect", then both players best response curves would be 1 for all opponent strategy probabilities and the reaction correspondences would cross (and form a Nash equilibrium) at the top right corner.

Other (payoff asymmetric) games

[edit]

A wider range of reaction correspondences shapes is possible in2 × 2 games with payoff asymmetries. For each player there are five possible best response shapes, shown in Figure 6. From left to right these are: dominated strategy (always play 2), dominated strategy (always play 1), rising (play strategy 2 if probability that the other player plays 2 is above threshold), falling (play strategy 1 if probability that the other player plays 2 is above threshold), and indifferent (both strategies play equally well under all conditions).

Figure 6 - The five possible reaction correspondences for a player in a2 × 2 game. The axes are assumed to show the probability that the player plays their strategy 1. From left to right: A) Always play 2, strategy 1 is dominated, B) Always play 1, strategy 2 is dominated, C) Strategy 1 best when opponent plays his strategy 1 and 2 best when opponent plays his 2, D) Strategy 1 best when opponent plays his strategy 2 and 2 best when opponent plays his 1, E) Both strategies play equally well no matter what the opponent plays.

While there are only four possible types of payoff symmetric2 × 2 games (of which one is trivial), the five different best response curves per player allow for a larger number of payoff asymmetric game types. Many of these are not truly different from each other. The dimensions may be redefined (exchange names of strategies 1 and 2) to produce symmetrical games which are logically identical.

Matching pennies

[edit]

One well-known game with payoff asymmetries is thematching pennies game. In this game one player, the row player (graphed on the y dimension) wins if the players coordinate (both choose heads or both choose tails) while the other player, the column player (shown in thex-axis) wins if the players discoordinate. Player Y's reaction correspondence is that of a coordination game, while that of player X is a discoordination game. The only Nash equilibrium is the combination of mixed strategies where both players independently choose heads and tails with probability 0.5 each.

Figure 7. Reaction correspondences for players in thematching pennies game. The leftmost mapping is for the coordinating player, the middle shows the mapping for the discoordinating player. The sole Nash equilibrium is shown in the right hand graph.

Dynamics

[edit]

Inevolutionary game theory,best response dynamics represents a class of strategy updating rules, where players strategies in the next round are determined by their best responses to some subset of the population. Some examples include:

In a large population model, players choose their next action probabilistically based on which strategies are best responses to the population as a whole.
In a spatial model, players choose (in the next round) the action that is the best response to all of their neighbors.^[5]

Importantly, in these models players only choose the best response on the next round that would give them the highest payoffon the next round. Players do not consider the effect that choosing a strategy on the next round would have on future play in the game. This constraint results in the dynamical rule often being calledmyopic best response.

In the theory ofpotential games,best response dynamics refers to a way of finding aNash equilibrium by computing the best response for every player:

Theorem—In any finite potential game, best response dynamics always converge to a Nash equilibrium.^[6]

Smoothed

[edit]

Instead of best response correspondences, some models usesmoothed best response functions. These functions are similar to the best response correspondence, except that the function does not "jump" from one pure strategy to another. The difference is illustrated in Figure 8, where black represents the best response correspondence and the other colors each represent different smoothed best response functions. In standard best response correspondences, even the slightest benefit to one action will result in the individual playing that action with probability 1. In smoothed best response as the difference between two actions decreases the individual's play approaches 50:50.

There are many functions that represent smoothed best response functions. The functions illustrated here are several variations on the following function:

${\frac {e^{E(1)/\gamma }}{e^{E(1)/\gamma }+e^{E(2)/\gamma }}}$

whereE(x) represents the expected payoff of actionx, andγ is a parameter that determines the degree to which the function deviates from the true best response (a largerγ implies that the player is more likely to make 'mistakes').

There are several advantages to using smoothed best response, both theoretical and empirical. First, it is consistent with psychological experiments; when individuals are roughly indifferent between two actions they appear to choose more or less at random. Second, the play of individuals is uniquely determined in all cases, since it is acorrespondence that is also afunction. Finally, using smoothed best response with some learning rules (as inFictitious play) can result in players learning to playmixed strategy Nash equilibria.^[7]

References

[edit]

^Fudenberg & Tirole (1991), p. 29;Gibbons (1992), pp. 33–49.
^Nash (1950).
^Fudenberg & Tirole (1991), Section 1.3.B.
^Osborne & Rubinstein (1994), Section 2.2.
^Ellison (1993).
^Nisan et al. (2007), Section 19.3.2.
^Fudenberg & Levine (1998).

Bibliography

[edit]

Ellison, G. (1993),"Learning, Local Interaction, and Coordination"(PDF),Econometrica,61 (5):1047–1071,doi:10.2307/2951493,JSTOR 2951493
Fudenberg, D.;Levine, David K. (1998),The Theory of Learning in Games, Cambridge, Massachusetts:MIT Press
Fudenberg, Drew;Tirole, Jean (1991),Game Theory, Cambridge, Massachusetts:MIT Press,ISBN 9780262061414Book preview.
Gibbons, R. (1992),A Primer in Game Theory, Harvester-Wheatsheaf,S2CID 10248389
Nash, John F. (1950), "Equilibrium points inn-person games",Proceedings of the National Academy of Sciences of the United States of America,36 (1):48–49,Bibcode:1950PNAS...36...48N,doi:10.1073/pnas.36.1.48,PMC 1063129,PMID 16588946
Nisan, N.; Roughgarden, T.; Tardos, É.; Vazirani, V. V. (2007),Algorithmic Game Theory(PDF), New York:Cambridge University Press
Osborne, M. J.;Rubinstein, Ariel (1994),A Course in Game Theory, Cambridge, Massachusetts:MIT Press
Young, H. P. (2005),Strategic Learning and Its Limits,Oxford University Press

Game theory

Traditionalgame theory

Definitions	Asynchrony Bayesian regret Best response Bounded rationality Cheap talk Coalition Complete contract Complete information Complete mixing Conjectural variation Contingent cooperator Coopetition Cooperative game theory Dynamic inconsistency Escalation of commitment Farsightedness Game semantics Hierarchy of beliefs Imperfect information Incomplete information Information set Move by nature Mutual knowledge Non-cooperative game theory Non-credible threat Outcome Perfect information Perfect recall Ply Preference Rationality Sequential game Simultaneous action selection Spite Strategic complements Strategic dominance Strategic form Strategic interaction Strategic move Strategy Subgame Succinct game Topological game Tragedy of the commons Uncorrelated asymmetry
Equilibrium concepts	Backward induction Bayes correlated equilibrium Bayesian efficiency Bayesian game Bayesian Nash equilibrium Berge equilibrium Bertrand–Edgeworth model Coalition-proof Nash equilibrium Core Correlated equilibrium Cursed equilibrium Edgeworth price cycle Epsilon-equilibrium Gibbs equilibrium Incomplete contracts Inequity aversion Individual rationality Iterated elimination of dominated strategies Markov perfect equilibrium Mertens-stable equilibrium Nash equilibrium Open-loop model Pareto efficiency Payoff dominance Perfect Bayesian equilibrium Price of anarchy Program equilibrium Proper equilibrium Quantal response equilibrium Quasi-perfect equilibrium Rational agent Rationalizability Rationalizable strategy Satisfaction equilibrium Self-confirming equilibrium Sequential equilibrium Shapley value Strong Nash equilibrium Subgame perfect equilibrium Trembling hand equilibrium
Strategies	Appeasement Bid shading Cheap talk Collusion Commitment device De-escalation Deterrence Escalation Fictitious play Focal point Grim trigger Hobbesian trap Markov strategy Max-dominated strategy Mixed strategy Pure strategy Tit for tat Win–stay, lose–switch
Games	All-pay auction Battle of the sexes Nash bargaining game Bertrand competition Blotto game Centipede game Coordination game Cournot competition Deadlock Dictator game Trust game Diner's dilemma Dollar auction El Farol Bar problem Electronic mail game Gift-exchange game Guess 2/3 of the average Keynesian beauty contest Kuhn poker Lewis signaling game Matching pennies Obligationes Optional prisoner's dilemma Pirate game Prisoner's dilemma Public goods game Rendezvous problem Rock paper scissors Stackelberg competition Stag hunt Traveler's dilemma Ultimatum game Volunteer's dilemma War of attrition
Theorems	Arrow's impossibility theorem Aumann's agreement theorem Brouwer fixed-point theorem Competitive altruism Folk theorem Gibbard–Satterthwaite theorem Gibbs lemma Glicksberg's theorem Kakutani fixed-point theorem Kuhn's theorem One-shot deviation principle Prim–Read theory Rational ignorance Rational irrationality Sperner's lemma Zermelo's theorem
Subfields	Algorithmic game theory Behavioral game theory Behavioral strategy Compositional game theory Confrontation analysis Contract theory Drama theory Graphical game theory Heresthetic Mean-field game theory Negotiation theory Quantum game theory Social software
Key people	Albert W. Tucker Alvin E. Roth Amos Tversky Antoine Augustin Cournot Ariel Rubinstein David Gale David K. Levine David M. Kreps Donald B. Gillies Drew Fudenberg Eric Maskin Harold W. Kuhn Herbert Simon Herbert Scarf Hervé Moulin Jean Tirole Jean-François Mertens Jennifer Tour Chayes Ken Binmore Kenneth Arrow Leonid Hurwicz Lloyd Shapley Martin Shubik Melvin Dresher Merrill M. Flood Olga Bondareva Oskar Morgenstern Paul Milgrom Peyton Young Reinhard Selten Robert Aumann Robert Axelrod Robert B. Wilson Roger Myerson Samuel Bowles Suzanne Scotchmer Thomas Schelling William Vickrey

Combinatorial game theory

Core concepts	Combinatorial explosion Determinacy Disjunctive sum First-player and second-player win Game complexity Game tree Impartial game Misère Partisan game Solved game Sprague–Grundy theorem Strategy-stealing argument Zugzwang
Games	Chess Chomp Clobber Cram Domineering Hackenbush Nim Notakto Subtract a square Sylver coinage Toads and Frogs
Mathematical tools	Mex Nimber On Numbers and Games Star Surreal number Winning Ways for Your Mathematical Plays
Search algorithms	Alpha–beta pruning Expectiminimax Minimax Monte Carlo tree search Negamax Paranoid algorithm Principal variation search
Key people	Claude Shannon John Conway John von Neumann

Evolutionary game theory

Core concepts	Bishop–Cannings theorem Evolution and the Theory of Games Evolutionarily stable set Evolutionarily stable state Evolutionarily stable strategy Replicator equation Risk dominance Stochastically stable equilibrium Weak evolutionarily stable strategy
Games	Chicken Stag hunt
Applications	Cultural group selection Fisher's principle Mobbing Terminal investment hypothesis
Key people	John Maynard Smith Robert Axelrod

Mechanism design

Core concepts	Algorithmic mechanism design Bayesian-optimal mechanism Incentive compatibility Market design Myerson ironing Monotonicity Participation constraint Revelation principle Strategyproofness Vickrey–Clarke–Groves mechanism Virtual valuation
Theorems	Myerson–Satterthwaite theorem Revenue equivalence Border's theorem
Applications	Digital goods auction Knapsack auction Truthful cake-cutting