Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Epsilon-equilibrium

From Wikipedia, the free encyclopedia
Situation where players have only a small incentive to change strategies
Epsilon-equilibrium
Solution concept ingame theory
Relationship
Superset ofNash Equilibrium
Significance
Used forstochastic games

Ingame theory, anepsilon-equilibrium, or near-Nash equilibrium, is astrategy profile that approximatelysatisfies the condition ofNash equilibrium. In a Nash equilibrium, no player has an incentive to change hisbehavior. In an approximate Nash equilibrium, this requirement is weakened to allow the possibility that aplayer may have a small incentive to do something different. This may still be considered an adequatesolution concept, assuming for examplestatus quo bias. This solution concept may be preferred to Nashequilibrium due to being easier to compute, or alternatively due to the possibility that in games of morethan 2 players, the probabilities involved in an exact Nash equilibrium need not berational numbers.[1]

Definition

[edit]

There is more than one alternative definition.

The standard definition

[edit]

Given a game and a real non-negative parameterε{\displaystyle \varepsilon }, astrategy profile is said to be anε{\displaystyle \varepsilon }-equilibrium if it is not possible for any player to gain more thanε{\displaystyle \varepsilon } inexpected payoff by unilaterally deviating from hisstrategy.[2]: 45  EveryNash Equilibrium is equivalent to anε{\displaystyle \varepsilon }-equilibrium whereε=0{\displaystyle \varepsilon =0}.

Formally, letG=(N,A=A1××AN,u:ARN){\displaystyle G=(N,A=A_{1}\times \dotsb \times A_{N},u\colon A\to R^{N})}be anN{\displaystyle N}-player game with action setsAi{\displaystyle A_{i}} for each playeri{\displaystyle i} and utility functionu{\displaystyle u}.Letui(s){\displaystyle u_{i}(s)} denote the payoff to playeri{\displaystyle i} whenstrategy profiles{\displaystyle s} is played.LetΔi{\displaystyle \Delta _{i}} be the space of probability distributions overAi{\displaystyle A_{i}}.A vector of strategiesσΔ=Δ1××ΔN{\displaystyle \sigma \in \Delta =\Delta _{1}\times \dotsb \times \Delta _{N}} is anε{\displaystyle \varepsilon }-Nash Equilibrium forG{\displaystyle G} if

ui(σ)ui(σi,σi)ε{\displaystyle u_{i}(\sigma )\geq u_{i}(\sigma _{i}^{'},\sigma _{-i})-\varepsilon } for allσiΔi,iN.{\displaystyle \sigma _{i}^{'}\in \Delta _{i},i\in N.}

Note that the utilities of all players are normalized to [0,1],[3] so this is actually amultiplicative approximation: the gain cannot be more thanε{\displaystyle \varepsilon } times the highest utility.

Well-supported approximate equilibrium

[edit]

The following definition[4]imposes the stronger requirement that a player may only assign positive probability to a pure strategya{\displaystyle a} if the payoff ofa{\displaystyle a} has expected payoff at mostε{\displaystyle \varepsilon } less than the best response payoff.Letxs{\displaystyle x_{s}} be the probability that strategy profiles{\displaystyle s} is played. For playerp{\displaystyle p} letSp{\displaystyle S_{-p}} be strategy profiles of players other thanp{\displaystyle p}; forsSp{\displaystyle s\in S_{-p}} and a pure strategyj{\displaystyle j} ofp{\displaystyle p} letjs{\displaystyle js} be the strategy profile wherep{\displaystyle p} playsj{\displaystyle j} and other players plays{\displaystyle s}.Letup(s){\displaystyle u_{p}(s)} be the payoff top{\displaystyle p} when strategy profiles{\displaystyle s} is used.The requirement can be expressed by the formula

sSpup(js)xs>ε+sSpup(js)xsxjp=0.{\displaystyle \sum _{s\in S_{-p}}u_{p}(js)x_{s}>\varepsilon +\sum _{s\in S_{-p}}u_{p}(j's)x_{s}\Longrightarrow x_{j'}^{p}=0.}

Results

[edit]

The existence of apolynomial-time approximation scheme (PTAS) for ε-Nash equilibria isequivalent to the question of whether there exists one for ε-well-supportedapproximate Nash equilibria,[5] but the existence of a PTAS remains an open problem.For constant values of ε, polynomial-time algorithms for approximate equilibriaare known for lower values of ε than are known for well-supportedapproximate equilibria.For games with payoffs in the range [0,1] and ε=0.3393, ε-Nash equilibria canbe computed in polynomial time.[6]For games with payoffs in the range [0,1] and ε=2/3, ε-well-supported equilibria canbe computed in polynomial time.[7]

Example

[edit]

The notion of ε-equilibria is important in the theory ofstochastic games of potentially infinite duration. There are simple examples of stochastic games with noNash equilibrium but with an ε-equilibrium for any ε strictly bigger than 0.

Perhaps the simplest such example is the following variant ofMatching Pennies, suggested by Everett. Player 1 hides a penny andPlayer 2 must guess if it is heads up or tails up. If Player 2 guesses correctly, hewins the penny from Player 1 and the game ends. If Player 2 incorrectly guesses that the penny is heads up,the game ends with payoff zero to both players. If he incorrectly guesses that it is tails up, the gamerepeats. If the play continues forever, the payoff to both players is zero.

Given a parameterε > 0, anystrategy profile where Player 2 guesses heads up withprobability ε and tails up with probability 1 − ε (at every stage of the game, and independentlyfrom previous stages) is anε-equilibrium for the game. The expected payoff of Player 2 insuch a strategy profile is at least 1 − ε. However, it is easy to see that there is nostrategy for Player 2 that can guarantee an expected payoff of exactly 1. Therefore, the gamehas noNash equilibrium.

Another simple example is the finitelyrepeated prisoner's dilemma for T periods, where the payoff is averaged over the T periods. The onlyNash equilibrium of this game is to choose Defect in each period. Now consider the two strategiestit-for-tat andgrim trigger. Although neithertit-for-tat norgrim trigger are Nash equilibria for the game, both of them areϵ{\displaystyle \epsilon }-equilibria for some positiveϵ{\displaystyle \epsilon }. The acceptable values ofϵ{\displaystyle \epsilon } depend on the payoffs of the constituent game and on the number T of periods.

In economics, the concept of apure strategyepsilon-equilibrium is used when the mixed-strategy approach is seen as unrealistic. In a pure-strategy epsilon-equilibrium, each player chooses a pure-strategy that is within epsilon of its best pure-strategy. For example, in theBertrand–Edgeworth model, where no pure-strategy equilibrium exists, a pure-strategy epsilon equilibrium may exist.

See also

[edit]

References

[edit]
Inline citations
  1. ^V. Bubelis (1979). "On equilibria in finite games".International Journal of Game Theory.8 (2):65–79.doi:10.1007/bf01768703.S2CID 122843303.
  2. ^Vazirani, Vijay V.;Nisan, Noam;Roughgarden, Tim;Tardos, Éva (2007).Algorithmic Game Theory(PDF). Cambridge, UK: Cambridge University Press.ISBN 0-521-87282-0.
  3. ^Tsaknakis, Haralampos; Spirakis, Paul G. (2007)."An Optimization Approach for Approximate Nash Equilibria". InDeng, Xiaotie; Graham, Fan Chung (eds.).Internet and Network Economics. Lecture Notes in Computer Science. Vol. 4858. Berlin, Heidelberg: Springer. pp. 42–56.doi:10.1007/978-3-540-77105-0_8.ISBN 978-3-540-77105-0.
  4. ^P.W. Goldberg andC.H. Papadimitriou (2006). "Reducibility Among Equilibrium Problems".38th Symposium on Theory of Computing. pp. 61–70.doi:10.1145/1132516.1132526.
  5. ^C. Daskalakis, P.W. Goldberg andC.H. Papadimitriou (2009). "The Complexity of Computing a Nash Equilibrium".SIAM Journal on Computing.39 (3):195–259.CiteSeerX 10.1.1.68.6111.doi:10.1137/070699652.
  6. ^H. Tsaknakis and Paul G. Spirakis (2008)."An optimisation approach for approximate Nash equilibria".Internet Mathematics.5 (4):365–382.doi:10.1080/15427951.2008.10129172.
  7. ^Spyros C. Kontogiannis and Paul G. Spirakis (2010). "Well Supported Approximate Equilibria in Bimatrix Games".Algorithmica.57 (4):653–667.doi:10.1007/s00453-008-9227-6.S2CID 15968419.
Sources
Traditionalgame theory
Definitions
Equilibrium
concepts
Strategies
Games
Theorems
Subfields
Key people
Core
concepts
Games
Mathematical
tools
Search
algorithms
Key people
Core
concepts
Games
Applications
Key people
Core
concepts
Theorems
Applications
Other topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Epsilon-equilibrium&oldid=1329678427"
Category:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp