Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Optimal stopping

From Wikipedia, the free encyclopedia
Class of mathematical problems
Not to be confused withOptional stopping theorem.
"Dynkin game" redirects here. For the coupling card trick, seeDynkin's card trick.

Inmathematics, the theory ofoptimal stopping[1][2] orearly stopping[3] is concerned with the problem of choosing a time to take a particular action, in order tomaximise an expected reward or minimise an expected cost. Optimal stopping problems can be found in areas ofstatistics,economics, andmathematical finance (related to the pricing ofAmerican options). A key example of an optimal stopping problem is thesecretary problem. Optimal stopping problems can often be written in the form of aBellman equation, and are therefore often solved usingdynamic programming.

Definition

[edit]

Discrete time case

[edit]

Stopping rule problems are associated with two objects:

  1. A sequence of random variablesX1,X2,{\displaystyle X_{1},X_{2},\ldots }, whose joint distribution is something assumed to be known
  2. A sequence of 'reward' functions(yi)i1{\displaystyle (y_{i})_{i\geq 1}} which depend on the observed values of the random variables in 1:
    yi=yi(x1,,xi){\displaystyle y_{i}=y_{i}(x_{1},\ldots ,x_{i})}

Given those objects, the problem is as follows:

  • You are observing the sequence of random variables, and at each stepi{\displaystyle i}, you can choose to either stop observing or continue
  • If you stop observing at stepi{\displaystyle i}, you will receive rewardyi{\displaystyle y_{i}}
  • You want to choose astopping rule to maximize your expected reward (or equivalently, minimize your expected loss)

Continuous time case

[edit]

Consider a gain processG=(Gt)t0{\displaystyle G=(G_{t})_{t\geq 0}} defined on afiltered probability space(Ω,F,(Ft)t0,P){\displaystyle (\Omega ,{\mathcal {F}},({\mathcal {F}}_{t})_{t\geq 0},\mathbb {P} )} and assume thatG{\displaystyle G} isadapted to the filtration. The optimal stopping problem is to find thestopping timeτ{\displaystyle \tau ^{*}} which maximizes the expected gain

VtT=EGτ=suptτTEGτ{\displaystyle V_{t}^{T}=\mathbb {E} G_{\tau ^{*}}=\sup _{t\leq \tau \leq T}\mathbb {E} G_{\tau }}

whereVtT{\displaystyle V_{t}^{T}} is called thevalue function. HereT{\displaystyle T} can take value{\displaystyle \infty }.

A more specific formulation is as follows. We consider an adapted strongMarkov processX=(Xt)t0{\displaystyle X=(X_{t})_{t\geq 0}} defined on a filtered probability space(Ω,F,(Ft)t0,Px){\displaystyle (\Omega ,{\mathcal {F}},({\mathcal {F}}_{t})_{t\geq 0},\mathbb {P} _{x})} wherePx{\displaystyle \mathbb {P} _{x}} denotes theprobability measure where thestochastic process starts atx{\displaystyle x}. Given continuous functionsM,L{\displaystyle M,L}, andK{\displaystyle K}, the optimal stopping problem is

V(x)=sup0τTEx(M(Xτ)+0τL(Xt)dt+sup0tτK(Xt)).{\displaystyle V(x)=\sup _{0\leq \tau \leq T}\mathbb {E} _{x}\left(M(X_{\tau })+\int _{0}^{\tau }L(X_{t})dt+\sup _{0\leq t\leq \tau }K(X_{t})\right).}

This is sometimes called the MLS (which stand for Mayer, Lagrange, and supremum, respectively) formulation.[4]

Solution methods

[edit]

There are generally two approaches to solving optimal stopping problems.[4] When the underlying process (or the gain process) is described by its unconditionalfinite-dimensional distributions, the appropriate solution technique is the martingale approach, so called because it usesmartingale theory, the most important concept being theSnell envelope. In the discrete time case, if the planning horizonT{\displaystyle T} is finite, the problem can also be easily solved bydynamic programming.

When the underlying process is determined by a family of (conditional) transition functions leading to a Markov family of transition probabilities, powerful analytical tools provided by the theory ofMarkov processes can often be utilized and this approach is referred to as the Markov method. The solution is usually obtained by solving the associatedfree-boundary problems (Stefan problems).

A jump diffusion result

[edit]

LetYt{\displaystyle Y_{t}} be aLévy diffusion inRk{\displaystyle \mathbb {R} ^{k}} given by theSDE

dYt=b(Yt)dt+σ(Yt)dBt+Rkγ(Yt,z)N¯(dt,dz),Y0=y{\displaystyle dY_{t}=b(Y_{t})dt+\sigma (Y_{t})dB_{t}+\int _{\mathbb {R} ^{k}}\gamma (Y_{t-},z){\bar {N}}(dt,dz),\quad Y_{0}=y}

whereB{\displaystyle B} is anm{\displaystyle m}-dimensionalBrownian motion,N¯{\displaystyle {\bar {N}}} is anl{\displaystyle l}-dimensional compensatedPoisson random measure,b:RkRk{\displaystyle b:\mathbb {R} ^{k}\to \mathbb {R} ^{k}},σ:RkRk×m{\displaystyle \sigma :\mathbb {R} ^{k}\to \mathbb {R} ^{k\times m}}, andγ:Rk×RkRk×l{\displaystyle \gamma :\mathbb {R} ^{k}\times \mathbb {R} ^{k}\to \mathbb {R} ^{k\times l}} are given functions such that a unique solution(Yt){\displaystyle (Y_{t})} exists. LetSRk{\displaystyle {\mathcal {S}}\subset \mathbb {R} ^{k}} be anopen set (the solvency region) and

τS=inf{t>0:YtS}{\displaystyle \tau _{\mathcal {S}}=\inf\{t>0:Y_{t}\notin {\mathcal {S}}\}}

be the bankruptcy time. The optimal stopping problem is:

V(y)=supττSJτ(y)=supττSEy[M(Yτ)+0τL(Yt)dt].{\displaystyle V(y)=\sup _{\tau \leq \tau _{\mathcal {S}}}J^{\tau }(y)=\sup _{\tau \leq \tau _{\mathcal {S}}}\mathbb {E} _{y}\left[M(Y_{\tau })+\int _{0}^{\tau }L(Y_{t})dt\right].}

It turns out that under some regularity conditions,[5] the following verification theorem holds:

If a functionϕ:S¯R{\displaystyle \phi :{\bar {\mathcal {S}}}\to \mathbb {R} } satisfies

thenϕ(y)V(y){\displaystyle \phi (y)\geq V(y)} for allyS¯{\displaystyle y\in {\bar {\mathcal {S}}}}. Moreover, if

Thenϕ(y)=V(y){\displaystyle \phi (y)=V(y)} for allyS¯{\displaystyle y\in {\bar {\mathcal {S}}}} andτ=inf{t>0:YtD}{\displaystyle \tau ^{*}=\inf\{t>0:Y_{t}\notin D\}} is an optimal stopping time.

These conditions can also be written is a more compact form (theintegro-variational inequality):

Examples

[edit]

Coin tossing

[edit]

(Example whereE(yi){\displaystyle \mathbb {E} (y_{i})} converges)

You have a fair coin and are repeatedly tossing it. Each time, before it is tossed, you can choose to stop tossing it and get paid (in dollars, say) the average number of heads observed.

You wish to maximise the amount you get paid by choosing a stopping rule.IfXi (fori ≥ 1) forms a sequence of independent, identically distributed random variables withBernoulli distribution

Bern(12),{\displaystyle {\text{Bern}}\left({\frac {1}{2}}\right),}

and if

yi=1ik=1iXk{\displaystyle y_{i}={\frac {1}{i}}\sum _{k=1}^{i}X_{k}}

then the sequences(Xi)i1{\displaystyle (X_{i})_{i\geq 1}}, and(yi)i1{\displaystyle (y_{i})_{i\geq 1}} are the objects associated with this problem.

House selling

[edit]

(Example whereE(yi){\displaystyle \mathbb {E} (y_{i})} does not necessarily converge)

You have a house and wish to sell it. Each day you are offeredXn{\displaystyle X_{n}} for your house, and payk{\displaystyle k} to continue advertising it. If you sell your house on dayn{\displaystyle n}, you will earnyn{\displaystyle y_{n}}, whereyn=(Xnnk){\displaystyle y_{n}=(X_{n}-nk)}.

You wish to maximise the amount you earn by choosing a stopping rule.

In this example, the sequence (Xi{\displaystyle X_{i}}) is the sequence of offers for your house, and the sequence of reward functions is how much you will earn.[6]

Secretary problem

[edit]
Three cases of the secretary problem with icon height denoting desirability:
  1. Too small an exploration set selects a suboptimal candidate before the best (*) is seen.
  2. An ideal set identifies the best.
  3. If a too-large set includes the best, the last candidate is selected.
Main article:Secretary problem

(Example where(Xi){\displaystyle (X_{i})} is a finite sequence)

You are observing a sequence of objects which can be ranked from best to worst. You wish to choose a stopping rule which maximises your chance of picking the best object.

Here, ifR1,,Rn{\displaystyle R_{1},\ldots ,R_{n}} (n is some large number) are the ranks of the objects, andyi{\displaystyle y_{i}} is the chance you pick the best object if you stop intentionally rejecting objects at step i, then(Ri){\displaystyle (R_{i})} and(yi){\displaystyle (y_{i})} are the sequences associated with this problem. This problem was solved in the early 1960s by several people. An elegant solution to the secretary problem and several modifications of this problem is provided by the more recentodds algorithm of optimal stopping (Bruss algorithm).

Search theory

[edit]
Main article:Search theory

Economists have studied a number of optimal stopping problems similar to the 'secretary problem', and typically call this type of analysis 'search theory'.Search theory has especially focused on a worker's search for a high-wage job, or a consumer's search for a low-priced good.

Parking problem

[edit]

A special example of an application of search theory is the task of optimal selection of parking space by a driver going to the opera (theater, shopping, etc.). Approaching the destination, the driver goes down the street along which there are parking spaces – usually, only some places in the parking lot are free. The goal is clearly visible, so the distance from the target is easily assessed. The driver's task is to choose a free parking space as close to the destination as possible without turning around so that the distance from this place to the destination is the shortest.[7]

Option trading

[edit]

In the trading ofoptions onfinancial markets, the holder of anAmerican option is allowed to exercise the right to buy (or sell) the underlying asset at a predetermined price at any time before or at the expiry date. Therefore, the valuation of American options is essentially an optimal stopping problem. Consider a classicalBlack–Scholes set-up and letr{\displaystyle r} be therisk-free interest rate andδ{\displaystyle \delta } andσ{\displaystyle \sigma } be the dividend rate and volatility of the stock. The stock priceS{\displaystyle S} follows geometric Brownian motion

St=S0exp{(rδσ22)t+σBt}{\displaystyle S_{t}=S_{0}\exp \left\{\left(r-\delta -{\frac {\sigma ^{2}}{2}}\right)t+\sigma B_{t}\right\}}

under therisk-neutral measure.

When the option is perpetual, the optimal stopping problem is

V(x)=supτEx[erτg(Sτ)]{\displaystyle V(x)=\sup _{\tau }\mathbb {E} _{x}\left[e^{-r\tau }g(S_{\tau })\right]}

where the payoff function isg(x)=(xK)+{\displaystyle g(x)=(x-K)^{+}} for a call option andg(x)=(Kx)+{\displaystyle g(x)=(K-x)^{+}} for a put option. The variational inequality is

max{12σ2x2V(x)+(rδ)xV(x)rV(x),g(x)V(x)}=0{\displaystyle \max \left\{{\frac {1}{2}}\sigma ^{2}x^{2}V''(x)+(r-\delta )xV'(x)-rV(x),g(x)-V(x)\right\}=0}

for allx(0,){b}{\displaystyle x\in (0,\infty )\setminus \{b\}}whereb{\displaystyle b} is the exercise boundary. The solution is known to be[8]

On the other hand, when the expiry date is finite, the problem is associated with a 2-dimensional free-boundary problem with no known closed-form solution. Various numerical methods can, however, be used. SeeBlack–Scholes model#American options for various valuation methods here, as well asFugit for a discrete,tree based, calculation of the optimal time to exercise.

See also

[edit]

References

[edit]

Citations

[edit]
  1. ^Chow, Y.S.;Robbins, H.; Siegmund, D. (1971).Great Expectations: The Theory of Optimal Stopping. Boston:Houghton Mifflin.
  2. ^Ferguson, Thomas S. (2007).Optimal Stopping and Applications. UCLA.
  3. ^Hill, Theodore P. (2009). "Knowing When to Stop".American Scientist.97 (2):126–133.doi:10.1511/2009.77.126.ISSN 1545-2786.S2CID 124798270.
    (For French translation, seecover story in the July issue ofPour la Science (2009).)
  4. ^abPeskir, Goran;Shiryaev, Albert (2006).Optimal Stopping and Free-Boundary Problems. Lectures in Mathematics. ETH Zürich.doi:10.1007/978-3-7643-7390-0.ISBN 978-3-7643-2419-3.
  5. ^Øksendal, B.;Sulem, A. (2007).Applied Stochastic Control of Jump Diffusions.doi:10.1007/978-3-540-69826-5.ISBN 978-3-540-69825-8.S2CID 123531718.
  6. ^Ferguson, Thomas S.;Klass, Michael J. (2010). "House-hunting without second moments".Sequential Analysis.29 (3):236–244.doi:10.1080/07474946.2010.487423.ISSN 0747-4946.
  7. ^MacQueen, J.; Miller Jr., R.G. (1960). "Optimal persistence policies".Operations Research.8 (3):362–380.doi:10.1287/opre.8.3.362.ISSN 0030-364X.
  8. ^Karatzas, Ioannis;Shreve, Steven E. (1998).Methods of Mathematical Finance. Stochastic Modelling and Applied Probability. Vol. 39.doi:10.1007/b98840.ISBN 978-0-387-94839-3.

Sources

[edit]
National
Other
Retrieved from "https://en.wikipedia.org/w/index.php?title=Optimal_stopping&oldid=1290094266"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp