Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Extreme value theorem

From Wikipedia, the free encyclopedia
This article is about the calculus concept. For the statistical concept, seeFisher–Tippett–Gnedenko theorem.
Continuous real function on a closed interval has a maximum and a minimum
This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(June 2012) (Learn how and when to remove this message)
A continuous functionf(x){\displaystyle f(x)} on the closed interval[a,b]{\displaystyle [a,b]} showing the absolute max (red) and the absolute min (blue).

Inreal analysis, a branch ofmathematics, theextreme value theorem states that if a real-valuedfunctionf{\displaystyle f} iscontinuous on theclosed andbounded interval[a,b]{\displaystyle [a,b]}, thenf{\displaystyle f} must attain amaximum and aminimum, each at least once.[1][2] That is, there exist numbersc{\displaystyle c} andd{\displaystyle d} in[a,b]{\displaystyle [a,b]} such that:f(c)f(x)f(d)x[a,b].{\displaystyle f(c)\leq f(x)\leq f(d)\quad \forall x\in [a,b].}

The extreme value theorem is more specific than the relatedboundedness theorem, which states merely that a continuous functionf{\displaystyle f} on the closed interval[a,b]{\displaystyle [a,b]} isbounded on that interval; that is, there exist real numbersm{\displaystyle m} andM{\displaystyle M} such that:mf(x)Mx[a,b].{\displaystyle m\leq f(x)\leq M\quad \forall x\in [a,b].}

This does not say thatM{\displaystyle M} andm{\displaystyle m} are necessarily the maximum and minimum values off{\displaystyle f} on the interval[a,b],{\displaystyle [a,b],} which is what the extreme value theorem stipulates must also be the case.

The extreme value theorem is used to proveRolle's theorem. In a formulation due toKarl Weierstrass, this theorem states that a continuous function from a non-emptycompact space to asubset of thereal numbers attains a maximum and a minimum.

History

[edit]

The extreme value theorem was originally proven byBernard Bolzano in the 1830s in a workFunction Theory but the work remained unpublished until 1930. Bolzano's proof consisted of showing that a continuous function on a closed interval was bounded, and then showing that the function attained a maximum and a minimum value. Both proofs involved what is known today as theBolzano–Weierstrass theorem.[3]

Functions to which the theorem does not apply

[edit]

The following examples show why the function domain must be closed and bounded in order for the theorem to apply. Each fails to attain a maximum on the given interval.

  1. f(x)=x{\displaystyle f(x)=x} defined over[0,){\displaystyle [0,\infty )} is not bounded from above.
  2. f(x)=x1+x{\displaystyle f(x)={\frac {x}{1+x}}} defined over[0,){\displaystyle [0,\infty )} is bounded from below but does not attain its least upper bound1{\displaystyle 1}.
  3. f(x)=1x{\displaystyle f(x)={\frac {1}{x}}} defined over(0,1]{\displaystyle (0,1]} is not bounded from above.
  4. f(x)=1x{\displaystyle f(x)=1-x} defined over(0,1]{\displaystyle (0,1]} is bounded but never attains its least upper bound1{\displaystyle 1}.

Definingf(0)=0{\displaystyle f(0)=0} in the last two examples shows that both theorems require continuity on[a,b]{\displaystyle [a,b]}.

Generalization to metric and topological spaces

[edit]

When moving from the real lineR{\displaystyle \mathbb {R} } tometric spaces and generaltopological spaces, the appropriate generalization of a closed bounded interval is acompact set. A setK{\displaystyle K} is said to be compact if it has the following property: from every collection ofopen setsUα{\displaystyle U_{\alpha }} such thatUαK{\textstyle \bigcup U_{\alpha }\supset K}, a finite subcollectionUα1,,Uαn{\displaystyle U_{\alpha _{1}},\ldots ,U_{\alpha _{n}}}can be chosen such thati=1nUαiK{\textstyle \bigcup _{i=1}^{n}U_{\alpha _{i}}\supset K}. This is usually stated in short as "every open cover ofK{\displaystyle K} has a finite subcover". TheHeine–Borel theorem asserts that a subset of the real line is compact if and only if it is both closed and bounded. Correspondingly, a metric space has theHeine–Borel property if every closed and bounded set is also compact.

The concept of a continuous function can likewise be generalized. Given topological spacesV, W{\displaystyle V,\ W}, a functionf:VW{\displaystyle f:V\to W} is said to be continuous if for every open setUW{\displaystyle U\subset W},f1(U)V{\displaystyle f^{-1}(U)\subset V} is also open. Given these definitions, continuous functions can be shown to preserve compactness:[4]

TheoremIfV, W{\displaystyle V,\ W} are topological spaces,f:VW{\displaystyle f:V\to W} is a continuous function, andKV{\displaystyle K\subset V} is compact, thenf(K)W{\displaystyle f(K)\subset W} is also compact.

In particular, ifW=R{\displaystyle W=\mathbb {R} }, then this theorem implies thatf(K){\displaystyle f(K)} is closed and bounded for any compact setK{\displaystyle K}, which in turn implies thatf{\displaystyle f} attains itssupremum andinfimum on any (nonempty) compact setK{\displaystyle K}. Thus, we have the following generalization of the extreme value theorem:[4]

TheoremIfK{\displaystyle K} is a nonempty compact set andf:KR{\displaystyle f:K\to \mathbb {R} } is a continuous function, thenf{\displaystyle f} is bounded and there existp,qK{\displaystyle p,q\in K} such thatf(p)=supxKf(x){\displaystyle f(p)=\sup _{x\in K}f(x)} andf(q)=infxKf(x){\displaystyle f(q)=\inf _{x\in K}f(x)}.

Slightly more generally, this is also true for an upper semicontinuous function. (seecompact space#Functions and compact spaces).

Proving the theorems

[edit]

We look at the proof for theupper bound and the maximum off{\displaystyle f}. By applying these results to the functionf{\displaystyle -f}, the existence of the lower bound and the result for the minimum off{\displaystyle f} follows. Also note that everything in the proof is done within the context of thereal numbers.

We first prove the boundedness theorem, which is a step in the proof of the extreme value theorem. The basic steps involved in the proof of the extreme value theorem are:

  1. Prove the boundedness theorem.
  2. Find a sequence so that itsimage converges to thesupremum off{\displaystyle f}.
  3. Show that there exists asubsequence that converges to a point in thedomain.
  4. Use continuity to show that the image of the subsequence converges to the supremum.

Proof of the boundedness theorem

[edit]

Boundedness TheoremIff(x){\displaystyle f(x)} is continuous on[a,b],{\displaystyle [a,b],} then it is bounded on[a,b].{\displaystyle [a,b].}

Proof

Suppose the functionf{\displaystyle f} is not bounded above on the interval[a,b]{\displaystyle [a,b]}. Pick asequence(xn)nN{\displaystyle (x_{n})_{n\in \mathbb {N} }} such thatxn[a,b]{\displaystyle x_{n}\in [a,b]} andf(xn)>n{\displaystyle f(x_{n})>n}. Because[a,b]{\displaystyle [a,b]} is bounded, theBolzano–Weierstrass theorem implies that there exists a convergent subsequence(xnk)kN{\displaystyle (x_{n_{k}})_{k\in \mathbb {N} }} of(xn){\displaystyle ({x_{n}})}. Denote its limit byx{\displaystyle x}. As[a,b]{\displaystyle [a,b]} is closed, it containsx{\displaystyle x}. Becausef{\displaystyle f} is continuous atx{\displaystyle x}, we know thatf(xnk){\displaystyle f(x_{{n}_{k}})} converges to the real numberf(x){\displaystyle f(x)} (asf{\displaystyle f} issequentially continuous atx{\displaystyle x}). Butf(xnk)>nkk{\displaystyle f(x_{{n}_{k}})>n_{k}\geq k} for everyk{\displaystyle k}, which implies thatf(xnk){\displaystyle f(x_{{n}_{k}})} diverges to+{\displaystyle +\infty }, a contradiction. Therefore,f{\displaystyle f} is bounded above on[a,b]{\displaystyle [a,b]}

Alternative proof

Consider the setB{\displaystyle B} of pointsp{\displaystyle p} in[a,b]{\displaystyle [a,b]} such thatf(x){\displaystyle f(x)} is bounded on[a,p]{\displaystyle [a,p]}. We note thata{\displaystyle a} is one such point, forf(x){\displaystyle f(x)} is bounded on[a,a]{\displaystyle [a,a]} by the valuef(a){\displaystyle f(a)}. Ife>a{\displaystyle e>a} is another point, then all points betweena{\displaystyle a} ande{\displaystyle e} also belong toB{\displaystyle B}. In other wordsB{\displaystyle B} is an interval closed at its left end bya{\displaystyle a}.

Nowf{\displaystyle f} is continuous on the right ata{\displaystyle a}, hence there existsδ>0{\displaystyle \delta >0} such that|f(x)f(a)|<1{\displaystyle |f(x)-f(a)|<1} for allx{\displaystyle x} in[a,a+δ]{\displaystyle [a,a+\delta ]}. Thusf{\displaystyle f} is bounded byf(a)1{\displaystyle f(a)-1} andf(a)+1{\displaystyle f(a)+1} on the interval[a,a+δ]{\displaystyle [a,a+\delta ]} so that all these points belong toB{\displaystyle B}.

So far, we know thatB{\displaystyle B} is an interval of non-zero length, closed at its left end bya{\displaystyle a}.

Next,B{\displaystyle B} is bounded above byb{\displaystyle b}. Hence the setB{\displaystyle B} has a supremum in[a,b]{\displaystyle [a,b]} ; let us call its{\displaystyle s}. From the non-zero length ofB{\displaystyle B} we can deduce thats>a{\displaystyle s>a}.

Supposes<b{\displaystyle s<b}. Nowf{\displaystyle f} is continuous ats{\displaystyle s}, hence there existsδ>0{\displaystyle \delta >0} such that|f(x)f(s)|<1{\displaystyle |f(x)-f(s)|<1} for allx{\displaystyle x} in[sδ,s+δ]{\displaystyle [s-\delta ,s+\delta ]} so thatf{\displaystyle f} is bounded on this interval. But it follows from the supremacy ofs{\displaystyle s} that there exists a point belonging toB{\displaystyle B},e{\displaystyle e} say, which is greater thansδ/2{\displaystyle s-\delta /2}. Thusf{\displaystyle f} is bounded on[a,e]{\displaystyle [a,e]} which overlaps[sδ,s+δ]{\displaystyle [s-\delta ,s+\delta ]} so thatf{\displaystyle f} is bounded on[a,s+δ]{\displaystyle [a,s+\delta ]}. This however contradicts the supremacy ofs{\displaystyle s}.

We must therefore haves=b{\displaystyle s=b}. Nowf{\displaystyle f} is continuous on the left ats{\displaystyle s}, hence there existsδ>0{\displaystyle \delta >0} such that|f(x)f(s)|<1{\displaystyle |f(x)-f(s)|<1} for allx{\displaystyle x} in[sδ,s]{\displaystyle [s-\delta ,s]} so thatf{\displaystyle f} is bounded on this interval. But it follows from the supremacy ofs{\displaystyle s} that there exists a point belonging toB{\displaystyle B},e{\displaystyle e} say, which is greater thansδ/2{\displaystyle s-\delta /2}. Thusf{\displaystyle f} is bounded on[a,e]{\displaystyle [a,e]} which overlaps[sδ,s]{\displaystyle [s-\delta ,s]} so thatf{\displaystyle f} is bounded on[a,s]{\displaystyle [a,s]}.  

Proofs of the extreme value theorem

[edit]
Proof of the Extreme Value Theorem

By the boundedness theorem,f is bounded from above, hence, by theDedekind-completeness of the real numbers, the least upper bound (supremum)M off exists. It is necessary to find a pointd in [a,b] such thatM =f(d). Letn be a natural number. AsM is theleast upper bound,M − 1/n is not an upper bound forf. Therefore, there existsdn in [a,b] so thatM − 1/n <f(dn). This defines a sequence {dn}. SinceM is an upper bound forf, we haveM − 1/n <f(dn) ≤M for alln. Therefore, the sequence {f(dn)} converges toM.

TheBolzano–Weierstrass theorem tells us that there exists a subsequence {dnk{\displaystyle d_{n_{k}}}}, which converges to somed and, as [a,b] is closed,d is in [a,b]. Sincef is continuous atd, the sequence {f(dnk{\displaystyle d_{n_{k}}})} converges tof(d). But {f(dnk)} is a subsequence of {f(dn)} that converges toM, soM =f(d). Therefore,f attains its supremumM atd

Alternative Proof of the Extreme Value Theorem

The set{yR :y =f(x) for somex ∈ [a,b]} is a bounded set. Hence, itsleast upper bound exists byleast upper bound property of the real numbers. LetM = sup(f(x)) on [a,b]. If there is no pointx on [ab] so thatf(x) = M, thenf(x) <M on [ab]. Therefore,1/(Mf(x)) is continuous on [a,b].

However, to every positive numberε, there is always somex in [ab] such thatMf(x) <ε becauseM is the least upper bound. Hence,1/(Mf(x)) > 1/ε, which means that1/(Mf(x)) is not bounded. Since every continuous function on [a,b] is bounded, this contradicts the conclusion that1/(Mf(x)) was continuous on [ab]. Therefore, there must be a pointx in [ab] such thatf(x) = M.

Proof using the hyperreals

[edit]
Proof

In the setting ofnon-standard calculus, letN  be an infinitehyperinteger. The interval [0, 1] has a natural hyperreal extension. Consider its partition intoN subintervals of equalinfinitesimal length 1/N, with partition pointsxi =i /N asi "runs" from 0 toN. The functionƒ  is also naturally extended to a functionƒ* defined on the hyperreals between 0 and 1. Note that in the standard setting (whenN  is finite), a point with the maximal value ofƒ can always be chosen among theN+1 pointsxi, by induction. Hence, by thetransfer principle, there is a hyperintegeri0 such that 0 ≤i0 ≤N andf(xi0)f(xi){\displaystyle f^{*}(x_{i_{0}})\geq f^{*}(x_{i})}  for alli = 0, ..., N. Consider the real pointc=st(xi0){\displaystyle c=\mathbf {st} (x_{i_{0}})}wherest is thestandard part function. An arbitrary real pointx lies in a suitable sub-interval of the partition, namelyx[xi,xi+1]{\displaystyle x\in [x_{i},x_{i+1}]}, so that st(xi) =x. Applyingst to the inequalityf(xi0)f(xi){\displaystyle f^{*}(x_{i_{0}})\geq f^{*}(x_{i})}, we obtainst(f(xi0))st(f(xi)){\displaystyle \mathbf {st} (f^{*}(x_{i_{0}}))\geq \mathbf {st} (f^{*}(x_{i}))}. By continuity ofƒ  we have

st(f(xi0))=f(st(xi0))=f(c){\displaystyle \mathbf {st} (f^{*}(x_{i_{0}}))=f(\mathbf {st} (x_{i_{0}}))=f(c)}.

Henceƒ(c) ≥ƒ(x), for all realx, provingc to be a maximum ofƒ.[5]

Proof from first principles

[edit]

Statement      Iff(x){\displaystyle f(x)} is continuous on[a,b]{\displaystyle [a,b]} then it attains its supremum on[a,b]{\displaystyle [a,b]}

Proof

By the Boundedness Theorem,f(x){\displaystyle f(x)} is bounded above on[a,b]{\displaystyle [a,b]} and by the completeness property of the real numbers has a supremum in[a,b]{\displaystyle [a,b]}. Let us call itM{\displaystyle M}, orM[a,b]{\displaystyle M[a,b]}. It is clear that the restriction off{\displaystyle f} to the subinterval[a,x]{\displaystyle [a,x]} wherexb{\displaystyle x\leq b} has a supremumM[a,x]{\displaystyle M[a,x]} which is less than or equal toM{\displaystyle M}, and thatM[a,x]{\displaystyle M[a,x]} increases fromf(a){\displaystyle f(a)} toM{\displaystyle M} asx{\displaystyle x} increases froma{\displaystyle a} tob{\displaystyle b}.

Iff(a)=M{\displaystyle f(a)=M} then we are done. Suppose therefore thatf(a)<M{\displaystyle f(a)<M} and letd=Mf(a){\displaystyle d=M-f(a)}. Consider the setL{\displaystyle L} of pointsx{\displaystyle x} in[a,b]{\displaystyle [a,b]} such thatM[a,x]<M{\displaystyle M[a,x]<M}.

ClearlyaL{\displaystyle a\in L} ; moreover ife>a{\displaystyle e>a} is another point inL{\displaystyle L} then all points betweena{\displaystyle a} ande{\displaystyle e} also belong toL{\displaystyle L} becauseM[a,x]{\displaystyle M[a,x]} is monotonic increasing. HenceL{\displaystyle L} is a non-empty interval, closed at its left end bya{\displaystyle a}.

Nowf{\displaystyle f} is continuous on the right ata{\displaystyle a}, hence there existsδ>0{\displaystyle \delta >0} such that|f(x)f(a)|<d/2{\displaystyle |f(x)-f(a)|<d/2} for allx{\displaystyle x} in[a,a+δ]{\displaystyle [a,a+\delta ]}. Thusf{\displaystyle f} is less thanMd/2{\displaystyle M-d/2} on the interval[a,a+δ]{\displaystyle [a,a+\delta ]} so that all these points belong toL{\displaystyle L}.

Next,L{\displaystyle L} is bounded above byb{\displaystyle b} and has therefore a supremum in[a,b]{\displaystyle [a,b]}: let us call its{\displaystyle s}. We see from the above thats>a{\displaystyle s>a}. We will show thats{\displaystyle s} is the point we are seeking i.e. the point wheref{\displaystyle f} attains its supremum, or in other wordsf(s)=M{\displaystyle f(s)=M}.

Suppose the contrary viz.f(s)<M{\displaystyle f(s)<M}. Letd=Mf(s){\displaystyle d=M-f(s)} and consider the following two cases:

  1. s<b{\displaystyle s<b}.   Asf{\displaystyle f} is continuous ats{\displaystyle s}, there existsδ>0{\displaystyle \delta >0} such that|f(x)f(s)|<d/2{\displaystyle |f(x)-f(s)|<d/2} for allx{\displaystyle x} in[sδ,s+δ]{\displaystyle [s-\delta ,s+\delta ]}. This means thatf{\displaystyle f} is less thanMd/2{\displaystyle M-d/2} on the interval[sδ,s+δ]{\displaystyle [s-\delta ,s+\delta ]}. But it follows from the supremacy ofs{\displaystyle s} that there exists a point,e{\displaystyle e} say, belonging toL{\displaystyle L} which is greater thansδ{\displaystyle s-\delta }. By the definition ofL{\displaystyle L},M[a,e]<M{\displaystyle M[a,e]<M}. Letd1=MM[a,e]{\displaystyle d_{1}=M-M[a,e]} then for allx{\displaystyle x} in[a,e]{\displaystyle [a,e]},f(x)Md1{\displaystyle f(x)\leq M-d_{1}}. Takingd2{\displaystyle d_{2}} to be the minimum ofd/2{\displaystyle d/2} andd1{\displaystyle d_{1}}, we havef(x)Md2{\displaystyle f(x)\leq M-d_{2}} for allx{\displaystyle x} in[a,s+δ]{\displaystyle [a,s+\delta ]}.
    HenceM[a,s+δ]<M{\displaystyle M[a,s+\delta ]<M} so thats+δL{\displaystyle s+\delta \in L}. This however contradicts the supremacy ofs{\displaystyle s} and completes the proof.
  2. s=b{\displaystyle s=b}.   Asf{\displaystyle f} is continuous on the left ats{\displaystyle s}, there existsδ>0{\displaystyle \delta >0} such that|f(x)f(s)|<d/2{\displaystyle |f(x)-f(s)|<d/2} for allx{\displaystyle x} in[sδ,s]{\displaystyle [s-\delta ,s]}. This means thatf{\displaystyle f} is less thanMd/2{\displaystyle M-d/2} on the interval[sδ,s]{\displaystyle [s-\delta ,s]}. But it follows from the supremacy ofs{\displaystyle s} that there exists a point,e{\displaystyle e} say, belonging toL{\displaystyle L} which is greater thansδ{\displaystyle s-\delta }. By the definition ofL{\displaystyle L},M[a,e]<M{\displaystyle M[a,e]<M}. Letd1=MM[a,e]{\displaystyle d_{1}=M-M[a,e]} then for allx{\displaystyle x} in[a,e]{\displaystyle [a,e]},f(x)Md1{\displaystyle f(x)\leq M-d_{1}}. Takingd2{\displaystyle d_{2}} to be the minimum ofd/2{\displaystyle d/2} andd1{\displaystyle d_{1}}, we havef(x)Md2{\displaystyle f(x)\leq M-d_{2}} for allx{\displaystyle x} in[a,b]{\displaystyle [a,b]}. This contradicts the supremacy ofM{\displaystyle M} and completes the proof.

Extension to semi-continuous functions

[edit]

If the continuity of the functionf is weakened tosemi-continuity,then the corresponding half of the boundedness theorem and the extreme value theorem hold and the values −∞ or +∞, respectively, from theextended real number line can be allowed as possible values.[clarification needed]

A functionf:[a,b][,){\displaystyle f:[a,b]\to [-\infty ,\infty )} is said to beupper semi-continuous iflim supyxf(y)f(x)x[a,b].{\displaystyle \limsup _{y\to x}f(y)\leq f(x)\quad \forall x\in [a,b].}

TheoremIf a functionf : [a,b] →[–∞, ∞) is upper semi-continuous, thenf is bounded above and attains its supremum.

Proof

Iff(x)={\displaystyle f(x)=-\infty } for allx in [a,b], then the supremum is also{\displaystyle -\infty } and the theorem is true. In all other cases, the proof is a slight modification of the proofs given above. In the proof of the boundedness theorem, the upper semi-continuity off atx only implies that thelimit superior of the subsequence {f(xnk)} is bounded above byf(x) < ∞, but that is enough to obtain the contradiction. In the proof of the extreme value theorem, upper semi-continuity off atd implies that the limit superior of the subsequence {f(dnk)} is bounded above byf(d), but this suffices to conclude thatf(d) =M


Applying this result to −f proves a similar result for the infimums of lower semicontinuous functions. A functionf:[a,b][,){\displaystyle f:[a,b]\to [-\infty ,\infty )} is said to belower semi-continuous iflim infyxf(y)f(x)x[a,b].{\displaystyle \liminf _{y\to x}f(y)\geq f(x)\quad \forall x\in [a,b].}

TheoremIf a functionf : [a,b] →(–∞, ∞] is lower semi-continuous, thenf is bounded below and attains itsinfimum.

A real-valued function is upper as well as lower semi-continuous, if and only if it is continuous in the usual sense. Hence these two theorems imply the boundedness theorem and the extreme value theorem.

References

[edit]
  1. ^Spivak, Michael (September 1994).Calculus. Publish or Perish publishing.ISBN 978-0-914098-89-8.
  2. ^Abbott, Stephen (2001).Understanding Analysis. Undergraduate Texts in Mathematics. New York: Springer-Verlag.ISBN 978-0387950600.
  3. ^Rusnock, Paul; Kerr-Lawson, Angus (2005). "Bolzano and Uniform Continuity".Historia Mathematica.32 (3):303–311.doi:10.1016/j.hm.2004.11.003.
  4. ^abRudin, Walter (1976).Principles of Mathematical Analysis. New York: McGraw Hill. pp. 89–90.ISBN 0-07-054235-X.
  5. ^Keisler, H. Jerome (1986).Elementary Calculus : An Infinitesimal Approach(PDF). Boston: Prindle, Weber & Schmidt. p. 164.ISBN 0-87150-911-3.

Further reading

[edit]

External links

[edit]
Precalculus
Limits
Differential calculus
Integral calculus
Vector calculus
Multivariable calculus
Sequences and series
Special functions
and numbers
History of calculus
Lists
Integrals
Miscellaneous topics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Extreme_value_theorem&oldid=1300774617"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp