Movatterモバイル変換

[0]ホーム

Jump to content

Continuous mapping theorem

Edit links

From Wikipedia, the free encyclopedia

Probability theorem

Not to be confused with thecontraction mapping theorem.

Inprobability theory, thecontinuous mapping theorem states that continuous functionspreserve limits even if their arguments are sequences of random variables. A continuous function, inHeine's definition, is such a function that maps convergent sequences into convergent sequences: ifx_n →x theng(x_n) →g(x). Thecontinuous mapping theorem states that this will also be true if we replace the deterministic sequence {x_n} with a sequence of random variables {X_n}, and replace the standard notion of convergence of real numbers “→” with one of the types ofconvergence of random variables.

This theorem was first proved byHenry Mann andAbraham Wald in 1943,^[1] and it is therefore sometimes called theMann–Wald theorem.^[2] Meanwhile,Denis Sargan refers to it as thegeneral transformation theorem.^[3]

Statement

[edit]

Let {X_n},X berandom elements defined on ametric spaceS. Suppose a functiong:S→S′ (whereS′ is another metric space) has the set ofdiscontinuity pointsD_g such thatPr[X ∈ D_g] = 0. Then^[4]^[5]

{\begin{aligned}X_{n}\ {\xrightarrow {\text{d}}}\ X\quad &\Rightarrow \quad g(X_{n})\ {\xrightarrow {\text{d}}}\ g(X);\\[6pt]X_{n}\ {\xrightarrow {\text{p}}}\ X\quad &\Rightarrow \quad g(X_{n})\ {\xrightarrow {\text{p}}}\ g(X);\\[6pt]X_{n}\ {\xrightarrow {\!\!{\text{a.s.}}\!\!}}\ X\quad &\Rightarrow \quad g(X_{n})\ {\xrightarrow {\!\!{\text{a.s.}}\!\!}}\ g(X).\end{aligned}}

where the superscripts, "d", "p", and "a.s." denoteconvergence in distribution,convergence in probability, andalmost sure convergence respectively.

Proof

[edit]

This proof has been adopted from (van der Vaart 1998, Theorem 2.3)

SpacesS andS′ are equipped with certain metrics. For simplicity we will denote both of these metrics using the |x − y| notation, even though the metrics may be arbitrary and not necessarily Euclidean.

Convergence in distribution

[edit]

We will need a particular statement from theportmanteau theorem: that convergence in distribution $X_{n}{\xrightarrow {d}}X$ is equivalent to

\mathbb {E} f(X_{n})\to \mathbb {E} f(X)

for every bounded continuous functionalf.

So it suffices to prove that $\mathbb {E} f(g(X_{n}))\to \mathbb {E} f(g(X))$ for every bounded continuous functionalf. For simplicity we assumeg continuous. Note that $F=f\circ g$ is itself a bounded continuous functional. And so the claim follows from the statement above. The general case is slightly more technical.

Convergence in probability

[edit]

Fix an arbitraryε > 0. Then for anyδ > 0 consider the setB_δ defined as

B_{\delta }={\big \{}x\in S\mid x\notin D_{g}:\ \exists y\in S:\ |x-y|<\delta ,\,|g(x)-g(y)|>\varepsilon {\big \}}.

This is the set of continuity pointsx of the functiong(·) for which it is possible to find, within theδ-neighborhood ofx, a point which maps outside theε-neighborhood ofg(x). By definition of continuity, this set shrinks asδ goes to zero, so that lim_δ → 0B_δ = ∅.

Now suppose that |g(X) − g(X_n)| > ε. This implies that at least one of the following is true: either |X−X_n| ≥ δ, orX ∈ D_g, orX∈B_δ. In terms of probabilities this can be written as

\Pr {\big (}{\big |}g(X_{n})-g(X){\big |}>\varepsilon {\big )}\leq \Pr {\big (}|X_{n}-X|\geq \delta {\big )}+\Pr(X\in B_{\delta })+\Pr(X\in D_{g}).

On the right-hand side, the first term converges to zero asn → ∞ for any fixedδ, by the definition of convergence in probability of the sequence {X_n}. The second term converges to zero asδ → 0, since the setB_δ shrinks to an empty set. And the last term is identically equal to zero by assumption of the theorem. Therefore, the conclusion is that