Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Continuous game

From Wikipedia, the free encyclopedia
Generalization of games used in game theory
This articleneeds additional citations forverification. Please helpimprove this article byadding citations to reliable sources. Unsourced material may be challenged and removed.
Find sources: "Continuous game" – news ·newspapers ·books ·scholar ·JSTOR
(March 2009) (Learn how and when to remove this message)

Acontinuous game is a mathematical concept, used ingame theory, that generalizes the idea of an ordinary game like tic-tac-toe (noughts and crosses) or checkers (draughts). In other words, it extends the notion of a discrete game, where the players choose from a finite set of pure strategies. The continuous game concepts allows games to include more general sets of pure strategies, which may beuncountably infinite.

In general, a game with uncountably infinite strategy sets will not necessarily have aNash equilibrium solution. If, however, the strategy sets are required to becompact and the utility functionscontinuous, then a Nash equilibrium will be guaranteed; this is by Glicksberg's generalization of theKakutani fixed point theorem. The class of continuous games is for this reason usually defined and studied as a subset of the larger class of infinite games (i.e. games with infinite strategy sets) in which the strategy sets are compact and the utility functions continuous.

Formal definition

[edit]

Define then-player continuous gameG=(P,C,U){\displaystyle G=(P,\mathbf {C} ,\mathbf {U} )} where

P=1,2,3,,n{\displaystyle P={1,2,3,\ldots ,n}} is the set ofn{\displaystyle n\,} players,
C=(C1,C2,,Cn){\displaystyle \mathbf {C} =(C_{1},C_{2},\ldots ,C_{n})} where eachCi{\displaystyle C_{i}\,} is acompact set, in ametric space, corresponding to thei{\displaystyle i\,}th player's set of pure strategies,
U=(u1,u2,,un){\displaystyle \mathbf {U} =(u_{1},u_{2},\ldots ,u_{n})} whereui:CR{\displaystyle u_{i}:\mathbf {C} \to \mathbb {R} } is the utility function of playeri{\displaystyle i\,}
We defineΔi{\displaystyle \Delta _{i}\,} to be the set of Borelprobability measures onCi{\displaystyle C_{i}\,}, giving us the mixed strategy space of playeri.
Define the strategy profileσ=(σ1,σ2,,σn){\displaystyle {\boldsymbol {\sigma }}=(\sigma _{1},\sigma _{2},\ldots ,\sigma _{n})} whereσiΔi{\displaystyle \sigma _{i}\in \Delta _{i}\,}

Letσi{\displaystyle {\boldsymbol {\sigma }}_{-i}} be a strategy profile of all players except for playeri{\displaystyle i}. As with discrete games, we can define abest responsecorrespondence for playeri{\displaystyle i\,},bi {\displaystyle b_{i}\ }.bi{\displaystyle b_{i}\,} is a relation from the set of all probability distributions over opponent player profiles to a set of playeri{\displaystyle i}'s strategies, such that each element of

bi(σi){\displaystyle b_{i}(\sigma _{-i})\,}

is a best response toσi{\displaystyle \sigma _{-i}}. Define

b(σ)=b1(σ1)×b2(σ2)××bn(σn){\displaystyle \mathbf {b} ({\boldsymbol {\sigma }})=b_{1}(\sigma _{-1})\times b_{2}(\sigma _{-2})\times \cdots \times b_{n}(\sigma _{-n})}.

A strategy profileσ{\displaystyle {\boldsymbol {\sigma }}*} is aNash equilibrium if and only ifσb(σ){\displaystyle {\boldsymbol {\sigma }}*\in \mathbf {b} ({\boldsymbol {\sigma }}*)}The existence of a Nash equilibrium for any continuous game with continuous utility functions can be proven usingIrving Glicksberg's generalization of theKakutani fixed point theorem.[1] In general, there may not be a solution if we allow strategy spaces,Ci{\displaystyle C_{i}\,}'s which are not compact, or if we allow non-continuous utility functions.

Separable games

[edit]

Aseparable game is a continuous game where, for any i, the utility functionui:CR{\displaystyle u_{i}:\mathbf {C} \to \mathbb {R} } can be expressed in the sum-of-products form:

ui(s)=k1=1m1kn=1mnai,k1knf1(s1)fn(sn){\displaystyle u_{i}(\mathbf {s} )=\sum _{k_{1}=1}^{m_{1}}\ldots \sum _{k_{n}=1}^{m_{n}}a_{i\,,\,k_{1}\ldots k_{n}}f_{1}(s_{1})\ldots f_{n}(s_{n})}, wheresC{\displaystyle \mathbf {s} \in \mathbf {C} },siCi{\displaystyle s_{i}\in C_{i}},ai,k1knR{\displaystyle a_{i\,,\,k_{1}\ldots k_{n}}\in \mathbb {R} }, and the functionsfi,k:CiR{\displaystyle f_{i\,,\,k}:C_{i}\to \mathbb {R} } are continuous.

Apolynomial game is a separable game where eachCi{\displaystyle C_{i}\,} is a compact interval onR{\displaystyle \mathbb {R} \,} and each utility function can be written as a multivariate polynomial.

In general, mixed Nash equilibria of separable games are easier to compute than non-separable games as implied by the following theorem:

For any separable game there exists at least one Nash equilibrium where playeri mixes at mostmi+1{\displaystyle m_{i}+1\,} pure strategies.[2]

Whereas an equilibrium strategy for a non-separable game may require anuncountably infinitesupport, a separable game is guaranteed to have at least one Nash equilibrium with finitely supported mixed strategies.

Examples

[edit]

Separable games

[edit]

A polynomial game

[edit]

Consider a zero-sum 2-player game between playersX andY, withCX=CY=[0,1]{\displaystyle C_{X}=C_{Y}=\left[0,1\right]}. Denote elements ofCX{\displaystyle C_{X}\,} andCY{\displaystyle C_{Y}\,} asx{\displaystyle x\,} andy{\displaystyle y\,} respectively. Define the utility functionsH(x,y)=ux(x,y)=uy(x,y){\displaystyle H(x,y)=u_{x}(x,y)=-u_{y}(x,y)\,} where

H(x,y)=(xy)2{\displaystyle H(x,y)=(x-y)^{2}\,}.

The pure strategy best response relations are:

bX(y)={1,if y[0,1/2)0 or 1,if y=1/20,if y(1/2,1]{\displaystyle b_{X}(y)={\begin{cases}1,&{\mbox{if }}y\in \left[0,1/2\right)\\0{\text{ or }}1,&{\mbox{if }}y=1/2\\0,&{\mbox{if }}y\in \left(1/2,1\right]\end{cases}}}
bY(x)=x{\displaystyle b_{Y}(x)=x\,}

bX(y){\displaystyle b_{X}(y)\,} andbY(x){\displaystyle b_{Y}(x)\,} do not intersect, so there is no pure strategy Nash equilibrium.However, there should be a mixed strategy equilibrium. To find it, express the expected value,v=E[H(x,y)]{\displaystyle v=\mathbb {E} [H(x,y)]} as alinear combination of the first and secondmoments of the probability distributions ofX andY:

v=μX22μX1μY1+μY2{\displaystyle v=\mu _{X2}-2\mu _{X1}\mu _{Y1}+\mu _{Y2}\,}

(whereμXN=E[xN]{\displaystyle \mu _{XN}=\mathbb {E} [x^{N}]} and similarly forY).

The constraints onμX1{\displaystyle \mu _{X1}\,} andμX2{\displaystyle \mu _{X2}} (with similar constraints fory,) are given byHausdorff as:

μX1μX2μX12μX2μY1μY2μY12μY2{\displaystyle {\begin{aligned}\mu _{X1}\geq \mu _{X2}\\\mu _{X1}^{2}\leq \mu _{X2}\end{aligned}}\qquad {\begin{aligned}\mu _{Y1}\geq \mu _{Y2}\\\mu _{Y1}^{2}\leq \mu _{Y2}\end{aligned}}}

Each pair of constraints defines a compact convex subset in the plane. Sincev{\displaystyle v\,} is linear, any extrema with respect to a player's first two moments will lie on the boundary of this subset. Player i's equilibrium strategy will lie on

μi1=μi2 or μi12=μi2{\displaystyle \mu _{i1}=\mu _{i2}{\text{ or }}\mu _{i1}^{2}=\mu _{i2}}

Note that the first equation only permits mixtures of 0 and 1 whereas the second equation only permits pure strategies. Moreover, if the best response at a certain point to player i lies onμi1=μi2{\displaystyle \mu _{i1}=\mu _{i2}\,}, it will lie on the whole line, so that both 0 and 1 are a best response.bY(μX1,μX2){\displaystyle b_{Y}(\mu _{X1},\mu _{X2})\,} simply gives the pure strategyy=μX1{\displaystyle y=\mu _{X1}\,}, sobY{\displaystyle b_{Y}\,} will never give both 0 and 1.Howeverbx{\displaystyle b_{x}\,} gives both 0 and 1 when y = 1/2.A Nash equilibrium exists when:

(μX1,μX2,μY1,μY2)=(1/2,1/2,1/2,1/4){\displaystyle (\mu _{X1}*,\mu _{X2}*,\mu _{Y1}*,\mu _{Y2}*)=(1/2,1/2,1/2,1/4)\,}

This determines one unique equilibrium where Player X plays a random mixture of 0 for 1/2 of the time and 1 the other 1/2 of the time. Player Y plays the pure strategy of 1/2. The value of the game is 1/4.

Non-Separable Games

[edit]

A rational payoff function

[edit]

Consider a zero-sum 2-player game between playersX andY, withCX=CY=[0,1]{\displaystyle C_{X}=C_{Y}=\left[0,1\right]}. Denote elements ofCX{\displaystyle C_{X}\,} andCY{\displaystyle C_{Y}\,} asx{\displaystyle x\,} andy{\displaystyle y\,} respectively. Define the utility functionsH(x,y)=ux(x,y)=uy(x,y){\displaystyle H(x,y)=u_{x}(x,y)=-u_{y}(x,y)\,} where

H(x,y)=(1+x)(1+y)(1xy)(1+xy)2.{\displaystyle H(x,y)={\frac {(1+x)(1+y)(1-xy)}{(1+xy)^{2}}}.}

This game has no pure strategy Nash equilibrium. It can be shown[3] that a unique mixed strategy Nash equilibrium exists with the following pair ofcumulative distribution functions:

F(x)=4πarctanxG(y)=4πarctany.{\displaystyle F^{*}(x)={\frac {4}{\pi }}\arctan {\sqrt {x}}\qquad G^{*}(y)={\frac {4}{\pi }}\arctan {\sqrt {y}}.}

Or, equivalently, the following pair ofprobability density functions:

f(x)=2πx(1+x)g(y)=2πy(1+y).{\displaystyle f^{*}(x)={\frac {2}{\pi {\sqrt {x}}(1+x)}}\qquad g^{*}(y)={\frac {2}{\pi {\sqrt {y}}(1+y)}}.}

The value of the game is4/π{\displaystyle 4/\pi }.

Requiring a Cantor distribution

[edit]

Consider a zero-sum 2-player game between playersX andY, withCX=CY=[0,1]{\displaystyle C_{X}=C_{Y}=\left[0,1\right]}. Denote elements ofCX{\displaystyle C_{X}\,} andCY{\displaystyle C_{Y}\,} asx{\displaystyle x\,} andy{\displaystyle y\,} respectively. Define the utility functionsH(x,y)=ux(x,y)=uy(x,y){\displaystyle H(x,y)=u_{x}(x,y)=-u_{y}(x,y)\,} where

H(x,y)=n=012n(2xn((1x3)n(x3)n))(2yn((1y3)n(y3)n)){\displaystyle H(x,y)=\sum _{n=0}^{\infty }{\frac {1}{2^{n}}}\left(2x^{n}-\left(\left(1-{\frac {x}{3}}\right)^{n}-\left({\frac {x}{3}}\right)^{n}\right)\right)\left(2y^{n}-\left(\left(1-{\frac {y}{3}}\right)^{n}-\left({\frac {y}{3}}\right)^{n}\right)\right)}.

This game has a unique mixed strategy equilibrium where each player plays a mixed strategy with theCantor singular function as thecumulative distribution function.[4]

Further reading

[edit]
  • H. W. Kuhn and A. W. Tucker, eds. (1950).Contributions to the Theory of Games: Vol. II. Annals of Mathematics Studies28. Princeton University Press.ISBN 0-691-07935-8.

See also

[edit]

References

[edit]
  1. ^I.L. Glicksberg. A further generalization of the Kakutani fixed point theorem, with application to Nash equilibrium points. Proceedings of the American Mathematical Society, 3(1):170–174, February 1952.
  2. ^N. Stein, A. Ozdaglar and P.A. Parrilo. "Separable and Low-Rank Continuous Games".International Journal of Game Theory, 37(4):475–504, December 2008.https://arxiv.org/abs/0707.3462
  3. ^Irving Leonard Glicksberg & Oliver Alfred Gross (1950). "Notes on Games over the Square." Kuhn, H.W. & Tucker, A.W. eds.Contributions to the Theory of Games: Volume II. Annals of Mathematics Studies28, p.173–183. Princeton University Press.
  4. ^Gross, O. (1952). "A rational payoff characterization of the Cantor distribution." TechnicalReport D-1349, The RAND Corporation.
Retrieved from "https://en.wikipedia.org/w/index.php?title=Continuous_game&oldid=1238790782"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp