Movatterモバイル変換

Jump to content

Total variation

From Wikipedia, the free encyclopedia

Measure of local oscillation behavior

Not to be confused withTotal variation distance of probability measures.

This articlerelies excessively onreferences toprimary sources. Please improve this article by addingsecondary or tertiary sources.
Find sources: "Total variation" – news ·newspapers ·books ·scholar ·JSTOR(February 2012) (Learn how and when to remove this message)

Inmathematics, thetotal variation identifies several slightly different concepts, related to the (local or global) structure of thecodomain of afunction or ameasure. For areal-valued continuous functionf, defined on aninterval [a,b] ⊂R, its total variation on the interval of definition is a measure of the one-dimensionalarclength of the curve with parametric equationx ↦f(x), forx ∈ [a,b]. Functions whose total variation is finite are calledfunctions of bounded variation.

Historical note

The concept of total variation for functions of one real variable was first introduced byCamille Jordan in the paper (Jordan 1881).^[1] He used the new concept in order to prove a convergence theorem forFourier series ofdiscontinuous periodic functions whose variation isbounded. The extension of the concept to functions of more than one variable however is not simple for various reasons.

Definitions

Total variation for functions of one real variable

Definition 1.1. Thetotal variation of areal-valued (or more generallycomplex-valued)function $f {\displaystyle f}$ , defined on aninterval $[a,b]\subset \mathbb {R}$ is the quantity

V_{a}^{b}(f)=\sup _{\mathcal {P}}\sum _{i=0}^{n_{P}-1}|f(x_{i+1})-f(x_{i})|,

where thesupremum runs over theset of allpartitions ${\mathcal {P}}=\left\{P=\{x_{0},\dots ,x_{n_{P}}\}\mid P{\text{ is a partition of }}[a,b]\right\}$ of the giveninterval. Which means that $a=x_{0}<x_{1}<...<x_{n_{P}}=b$ .

Total variation for functions ofn > 1 real variables

Definition 1.2.^[2] LetΩ be anopen subset ofRⁿ. Given a functionf belonging toL¹(Ω), thetotal variation off inΩ is defined as

V(f,\Omega ):=\sup \left\{\int _{\Omega }f(x)\operatorname {div} \phi (x)\,\mathrm {d} x\colon \phi \in C_{c}^{1}(\Omega ,\mathbb {R} ^{n}),\ \Vert \phi \Vert _{L^{\infty }(\Omega )}\leq 1\right\},

where

$C_{c}^{1}(\Omega ,\mathbb {R} ^{n})$ is theset ofcontinuously differentiable vector functions ofcompact support contained in $\Omega$ ,
$\Vert \;\Vert _{L^{\infty }(\Omega )}$ is theessential supremum norm, and
$\operatorname {div}$ is thedivergence operator.

This definitiondoes not require that thedomain $\Omega \subseteq \mathbb {R} ^{n}$ of the given function be abounded set.

Total variation in measure theory

Classical total variation definition

FollowingSaks (1937, p. 10), consider asigned measure $\mu$ on ameasurable space $(X,\Sigma )$ : then it is possible to define twoset functions ${\overline {\mathrm {W} }}(\mu ,\cdot )$ and ${\underline {\mathrm {W} }}(\mu ,\cdot )$ , respectively calledupper variation andlower variation, as follows

{\overline {\mathrm {W} }}(\mu ,E)=\sup \left\{\mu (A)\mid A\in \Sigma {\text{ and }}A\subset E\right\}\qquad \forall E\in \Sigma

{\underline {\mathrm {W} }}(\mu ,E)=\inf \left\{\mu (A)\mid A\in \Sigma {\text{ and }}A\subset E\right\}\qquad \forall E\in \Sigma

clearly

{\overline {\mathrm {W} }}(\mu ,E)\geq 0\geq {\underline {\mathrm {W} }}(\mu ,E)\qquad \forall E\in \Sigma

Definition 1.3. Thevariation (also calledabsolute variation) of the signed measure $\mu$ is the set function

|\mu |(E)={\overline {\mathrm {W} }}(\mu ,E)+\left|{\underline {\mathrm {W} }}(\mu ,E)\right|\qquad \forall E\in \Sigma

and itstotal variation is defined as the value of this measure on the whole space of definition, i.e.

\|\mu \|=|\mu |(X)

Modern definition of total variation norm

Saks (1937, p. 11) uses upper and lower variations to prove theHahn–Jordan decomposition: according to his version of this theorem, the upper and lower variation are respectively anon-negative and anon-positive measure. Using a more modern notation, define

\mu ^{+}(\cdot )={\overline {\mathrm {W} }}(\mu ,\cdot )\,,

\mu ^{-}(\cdot )=-{\underline {\mathrm {W} }}(\mu ,\cdot )\,,

Then $\mu ^{+}$ and $\mu ^{-}$ are two non-negativemeasures such that

\mu =\mu ^{+}-\mu ^{-}

|\mu |=\mu ^{+}+\mu ^{-}

The last measure is sometimes called, byabuse of notation,total variation measure.

Total variation norm of complex measures

If the measure $\mu$ iscomplex-valued i.e. is acomplex measure, its upper and lower variation cannot be defined and the Hahn–Jordan decomposition theorem can only be applied to its real and imaginary parts. However, it is possible to followRudin (1966, pp. 137–139) and define the total variation of the complex-valued measure $\mu$ as follows

Definition 1.4. Thevariation of the complex-valued measure $\mu$ is theset function

|\mu |(E)=\sup _{\pi }\sum _{A\in \pi }|\mu (A)|\qquad \forall E\in \Sigma

where thesupremum is taken over all partitions $\pi$ of ameasurable set $E {\displaystyle E}$ into a countable number of disjoint measurable subsets.

This definition coincides with the above definition $|\mu |=\mu ^{+}+\mu ^{-}$ for the case of real-valued signed measures.

Total variation norm of vector-valued measures

The variation so defined is apositive measure (seeRudin (1966, p. 139)) and coincides with the one defined by1.3 when $\mu$ is asigned measure: its total variation is defined as above. This definition works also if $\mu$ is avector measure: the variation is then defined by the following formula

|\mu |(E)=\sup _{\pi }\sum _{A\in \pi }\|\mu (A)\|\qquad \forall E\in \Sigma

where the supremum is as above. This definition is slightly more general than the one given byRudin (1966, p. 138) since it requires only to considerfinite partitions of the space $X {\displaystyle X}$ : this implies that it can be used also to define the total variation onfinite-additive measures.

Total variation of probability measures

This sectiondoes notcite anysources. Please helpimprove this section byadding citations to reliable sources. Unsourced material may be challenged andremoved.(May 2012) (Learn how and when to remove this message)

Main article:Total variation distance of probability measures

The total variation of anyprobability measure is exactly one, therefore it is not interesting as a means of investigating the properties of such measures. However, when μ and ν areprobability measures, thetotal variation distance of probability measures can be defined as $\|\mu -\nu \|$ where the norm is the total variation norm of signed measures. Using the property that $(\mu -\nu )(X)=0$ , we eventually arrive at the equivalent definition

\|\mu -\nu \|=|\mu -\nu |(X)=2\sup \left\{\,\left|\mu (A)-\nu (A)\right|:A\in \Sigma \,\right\}

^[3]

and its values are non-trivial. The factor $2 {\displaystyle 2}$ above is usually dropped (as is the convention in the articletotal variation distance of probability measures). Informally, this is the largest possible difference between the probabilities that the twoprobability distributions can assign to the same event. For acategorical distribution it is possible to write the total variation distance as follows

\delta (\mu ,\nu )=\sum _{x}\left|\mu (x)-\nu (x)\right|\;.

^[4]

It may also be normalized to values in $[0,1]$ by halving the previous definition as follows

\delta (\mu ,\nu )={\frac {1}{2}}\sum _{x}\left|\mu (x)-\nu (x)\right|

^[5]

Basic properties

Total variation of differentiable functions

The total variation of a $C^{1}({\overline {\Omega }})$ function $f {\displaystyle f}$ can be expressed as anintegral involving the given function instead of as thesupremum of thefunctionals of definitions1.1 and1.2.

The form of the total variation of a differentiable function of one variable

Theorem 1. Thetotal variation of adifferentiable function $f {\displaystyle f}$ , defined on aninterval $[a,b]\subset \mathbb {R}$ , has the following expression if $f^{'} {\displaystyle f'}$ is Riemann integrable

V_{a}^{b}(f)=\int _{a}^{b}|f'(x)|\mathrm {d} x

If $f {\displaystyle f}$ is differentiable andmonotonic, then the above simplifies to

V_{a}^{b}(f)=|f(a)-f(b)|

For any differentiable function $f {\displaystyle f}$ , we can decompose the domain interval $[a,b]$ , into subintervals $[a,a_{1}],[a_{1},a_{2}],\dots ,[a_{N},b]$ (with $a<a_{1}<a_{2}<\cdots <a_{N}<b$ ) in which $f {\displaystyle f}$ is locally monotonic, then the total variation of $f {\displaystyle f}$ over $[a,b]$ can be written as the sum of local variations on those subintervals:

{\begin{aligned}V_{a}^{b}(f)&=V_{a}^{a_{1}}(f)+V_{a_{1}}^{a_{2}}(f)+\,\cdots \,+V_{a_{N}}^{b}(f)\\[0.3em]&=|f(a)-f(a_{1})|+|f(a_{1})-f(a_{2})|+\,\cdots \,+|f(a_{N})-f(b)|\end{aligned}}

The form of the total variation of a differentiable function of several variables

Theorem 2. Given a $C^{1}({\overline {\Omega }})$ function $f {\displaystyle f}$ defined on abounded open set $\Omega \subseteq \mathbb {R} ^{n}$ , with $\partial \Omega$ of class $C^{1}$ , thetotal variation of $f {\displaystyle f}$ has the following expression

V(f,\Omega )=\int _{\Omega }\left|\nabla f(x)\right|\mathrm {d} x

.

Proof

The first step in the proof is to first prove an equality which follows from theGauss–Ostrogradsky theorem.

Lemma

Under the conditions of the theorem, the following equality holds:

\int _{\Omega }f\operatorname {div} \varphi =-\int _{\Omega }\nabla f\cdot \varphi

Proof of the lemma

From theGauss–Ostrogradsky theorem:

\int _{\Omega }\operatorname {div} \mathbf {R} =\int _{\partial \Omega }\mathbf {R} \cdot \mathbf {n}

by substituting $\mathbf {R} :=f\mathbf {\varphi }$ , we have:

\int _{\Omega }\operatorname {div} \left(f\mathbf {\varphi } \right)=\int _{\partial \Omega }\left(f\mathbf {\varphi } \right)\cdot \mathbf {n}

where $\mathbf {\varphi }$ is zero on the border of $\Omega$ by definition:

\int _{\Omega }\operatorname {div} \left(f\mathbf {\varphi } \right)=0

\int _{\Omega }\partial _{x_{i}}\left(f\mathbf {\varphi } _{i}\right)=0

\int _{\Omega }\mathbf {\varphi } _{i}\partial _{x_{i}}f+f\partial _{x_{i}}\mathbf {\varphi } _{i}=0

\int _{\Omega }f\partial _{x_{i}}\mathbf {\varphi } _{i}=-\int _{\Omega }\mathbf {\varphi } _{i}\partial _{x_{i}}f

\int _{\Omega }f\operatorname {div} \mathbf {\varphi } =-\int _{\Omega }\mathbf {\varphi } \cdot \nabla f

Proof of the equality

Under the conditions of the theorem, from the lemma we have:

\int _{\Omega }f\operatorname {div} \mathbf {\varphi } =-\int _{\Omega }\mathbf {\varphi } \cdot \nabla f\leq \left|\int _{\Omega }\mathbf {\varphi } \cdot \nabla f\right|\leq \int _{\Omega }\left|\mathbf {\varphi } \right|\cdot \left|\nabla f\right|\leq \int _{\Omega }\left|\nabla f\right|

in the last part $\mathbf {\varphi }$ could be omitted, because by definition its essential supremum is at most one.

On the other hand, we consider $\theta _{N}:=-\mathbb {I} _{\left[-N,N\right]}\mathbb {I} _{\{\nabla f\neq 0\}}{\frac {\nabla f}{\left|\nabla f\right|}}$ and $\theta _{N}^{*}$ which is the up to $\varepsilon$ approximation of $\theta _{N}$ in $C_{c}^{1}$ with the same integral. We can do this since $C_{c}^{1}$ is dense in $L^{1}$ . Now again substituting into the lemma:

{\begin{aligned}&\lim _{N\to \infty }\int _{\Omega }f\operatorname {div} \theta _{N}^{*}\\[4pt]&=\lim _{N\to \infty }\int _{\{\nabla f\neq 0\}}\mathbb {I} _{\left[-N,N\right]}\nabla f\cdot {\frac {\nabla f}{\left|\nabla f\right|}}\\[4pt]&=\lim _{N\to \infty }\int _{\left[-N,N\right]\cap {\{\nabla f\neq 0\}}}\nabla f\cdot {\frac {\nabla f}{\left|\nabla f\right|}}\\[4pt]&=\int _{\Omega }\left|\nabla f\right|\end{aligned}}

This means we have a convergent sequence of ${\textstyle \int _{\Omega }f\operatorname {div} \mathbf {\varphi } }$ that tends to ${\textstyle \int _{\Omega }\left|\nabla f\right|}$ as well as we know that ${\textstyle \int _{\Omega }f\operatorname {div} \mathbf {\varphi } \leq \int _{\Omega }\left|\nabla f\right|}$ .Q.E.D.

It can be seen from the proof that the supremum is attained when

\varphi \to {\frac {-\nabla f}{\left|\nabla f\right|}}.

Thefunction $f {\displaystyle f}$ is said to be ofbounded variation precisely if its total variation is finite.

Total variation of a measure

The total variation is anorm defined on the space of measures of bounded variation. The space of measures on a σ-algebra of sets is aBanach space, called theca space, relative to this norm. It is contained in the larger Banach space, called theba space, consisting offinitely additive (as opposed to countably additive) measures, also with the same norm. Thedistance function associated to the norm gives rise to the total variation distance between two measuresμ andν.

For finite measures onR, the link between the total variation of a measureμ and the total variation of a function, as described above, goes as follows. Givenμ, define a function $\varphi \colon \mathbb {R} \to \mathbb {R}$ by

\varphi (t)=\mu ((-\infty ,t])~.

Then, the total variation of the signed measureμ is equal to the total variation, in the above sense, of the function $\varphi$ . In general, the total variation of a signed measure can be defined usingJordan's decomposition theorem by

\|\mu \|_{TV}=\mu _{+}(X)+\mu _{-}(X)~,

for any signed measureμ on a measurable space $(X,\Sigma )$ .

Applications

Total variation can be seen as anon-negative real-valuedfunctional defined on the space ofreal-valued functions (for the case of functions of one variable) or on the space ofintegrable functions (for the case of functions of several variables). As a functional, total variation finds applications in several branches of mathematics and engineering, likeoptimal control,numerical analysis, andcalculus of variations, where the solution to a certain problem has tominimize its value. As an example, use of the total variation functional is common in the following two kind of problems

Numerical analysis of differential equations: it is the science of finding approximate solutions todifferential equations. Applications of total variation to these problems are detailed in the article "total variation diminishing"
Image denoising:^[6] inimage processing, denoising is a collection of methods used to reduce thenoise in animage reconstructed from data obtained by electronic means, for exampledata transmission orsensing. "Total variation denoising" is the name for the application of total variation to image noise reduction; further details can be found in the papers of (Rudin, Osher & Fatemi 1992) and (Caselles, Chambolle & Novaga 2007). A sensible extension of this model to colour images, called Colour TV, can be found in (Blomgren & Chan 1998).

See also

Notes

This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(February 2012) (Learn how and when to remove this message)

^According toGolubov & Vitushkin (2001).
^Ambrosio, Luigi; Fusco, Nicola; Pallara, Diego (2000).Functions of Bounded Variation and Free Discontinuity Problems. Oxford University Press. p. 119.doi:10.1093/oso/9780198502456.001.0001.ISBN 9780198502456.
^Billingsley, Patrick (1995).Probability and Measure. John Wiley & Sons. pp. 242–243.
^Le Cam, Lucien; Yang, Grace Lo (2000).Asymptotics in Statistics: Some Basic Concepts. Springer. pp. 16–18.
^Gibbs, Alison; Francis Edward Su (2002)."On Choosing and Bounding Probability Metrics"(PDF). p. 7. Retrieved8 April 2017.
^https://arxiv.org/pdf/1603.09599 Retrieved 12/15/2024

Historical references

Arzelà, Cesare (7 May 1905),"Sulle funzioni di due variabili a variazione limitata (On functions of two variables of bounded variation)",Rendiconto delle Sessioni della Reale Accademia delle Scienze dell'Istituto di Bologna, Nuova serie (in Italian),IX (4):100–107,JFM 36.0491.02, archived fromthe original on 2007-08-07.
Golubov, Boris I. (2001) [1994],"Arzelà variation",Encyclopedia of Mathematics,EMS Press.
Golubov, Boris I. (2001) [1994],"Fréchet variation",Encyclopedia of Mathematics,EMS Press.
Golubov, Boris I. (2001) [1994],"Hardy variation",Encyclopedia of Mathematics,EMS Press.
Golubov, Boris I. (2001) [1994],"Pierpont variation",Encyclopedia of Mathematics,EMS Press.
Golubov, Boris I. (2001) [1994],"Vitali variation",Encyclopedia of Mathematics,EMS Press.
Golubov, Boris I. (2001) [1994],"Tonelli plane variation",Encyclopedia of Mathematics,EMS Press.
Golubov, Boris I.;Vitushkin, Anatoli G. (2001) [1994],"Variation of a function",Encyclopedia of Mathematics,EMS Press
Jordan, Camille (1881),"Sur la série de Fourier",Comptes rendus hebdomadaires des séances de l'Académie des sciences (in French),92:228–230,JFM 13.0184.01 (available atGallica). This is, according to Boris Golubov, the first paper on functions of bounded variation.
Hahn, Hans (1921),Theorie der reellen Funktionen (in German), Berlin: Springer Verlag, pp. VII+600,JFM 48.0261.09.
Vitali, Giuseppe (1908) [17 dicembre 1907],"Sui gruppi di punti e sulle funzioni di variabili reali (On groups of points and functions of real variables)",Atti dell'Accademia delle Scienze di Torino (in Italian),43:75–92,JFM 39.0101.05, archived fromthe original on 2009-03-31. The paper containing the first proof ofVitali covering theorem.

References

Adams, C. Raymond; Clarkson, James A. (1933), "On definitions of bounded variation for functions of two variables",Transactions of the American Mathematical Society,35 (4):824–854,doi:10.1090/S0002-9947-1933-1501718-2,JFM 59.0285.01,MR 1501718,Zbl 0008.00602.
Cesari, Lamberto (1936),"Sulle funzioni a variazione limitata (On the functions of bounded variation)",Annali della Scuola Normale Superiore, II (in Italian),5 (3–4):299–313,JFM 62.0247.03,MR 1556778,Zbl 0014.29605. Available atNumdam.

Leoni, Giovanni (2017),A First Course in Sobolev Spaces: Second Edition, Graduate Studies in Mathematics, American Mathematical Society, pp. xxii+734,ISBN 978-1-4704-2921-8.
Saks, Stanisław (1937).Theory of the Integral. Monografie Matematyczne. Vol. 7 (2nd ed.). Warszawa–Lwów: G.E. Stechert & Co. pp. VI+347.JFM 63.0183.05.Zbl 0017.30004.. (available at thePolish Virtual Library of Science). English translation from the original French byLaurence Chisholm Young, with two additional notes byStefan Banach.
Rudin, Walter (1966),Real and Complex Analysis, McGraw-Hill Series in Higher Mathematics (1st ed.), New York: McGraw-Hill, pp. xi+412,MR 0210528,Zbl 0142.01701.

External links

One variable

"Total variation atPlanetMath.

One and more variables

Function of bounded variation atEncyclopedia of Mathematics

Measure theory

Rowland, Todd."Total Variation".MathWorld..
Jordan decomposition atPlanetMath..
Jordan decomposition atEncyclopedia of Mathematics

Applications

Caselles, Vicent; Chambolle, Antonin; Novaga, Matteo (2007),The discontinuity set of solutions of the TV denoising problem and some extensions,SIAM, Multiscale Modeling and Simulation, vol. 6 n. 3, archived fromthe original on 2011-09-27 (a work dealing with total variation application in denoising problems forimage processing).

Rudin, Leonid I.; Osher, Stanley; Fatemi, Emad (1992), "Nonlinear total variation based noise removal algorithms",Physica D: Nonlinear Phenomena,60 (1–4), Physica D: Nonlinear Phenomena 60.1: 259-268:259–268,Bibcode:1992PhyD...60..259R,doi:10.1016/0167-2789(92)90242-F.

Blomgren, Peter; Chan, Tony F. (1998), "Color TV: total variation methods for restoration of vector-valued images",IEEE Transactions on Image Processing,7 (3), Image Processing, IEEE Transactions on, vol. 7, no. 3: 304-309:304–309,Bibcode:1998ITIP....7..304B,doi:10.1109/83.661180,PMID 18276250.

Tony F. Chan and Jackie (Jianhong) Shen (2005),Image Processing and Analysis - Variational, PDE, Wavelet, and Stochastic Methods,SIAM,ISBN 0-89871-589-X (with in-depth coverage and extensive applications of Total Variations in modern image processing, as started by Rudin, Osher, and Fatemi).

Retrieved from "https://en.wikipedia.org/w/index.php?title=Total_variation&oldid=1324148072"

Mathematical analysis

Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp