Inmathematics, thetotal variation identifies several slightly different concepts, related to the (local or global) structure of thecodomain of afunction or ameasure. For areal-valuedcontinuous functionf, defined on aninterval [a,b] ⊂R, its total variation on the interval of definition is a measure of the one-dimensionalarclength of the curve with parametric equationx ↦f(x), forx ∈ [a,b]. Functions whose total variation is finite are calledfunctions of bounded variation.
The concept of total variation for functions of one real variable was first introduced byCamille Jordan in the paper (Jordan 1881).[1] He used the new concept in order to prove a convergence theorem forFourier series ofdiscontinuousperiodic functions whose variation isbounded. The extension of the concept to functions of more than one variable however is not simple for various reasons.
If the measure iscomplex-valued i.e. is acomplex measure, its upper and lower variation cannot be defined and the Hahn–Jordan decomposition theorem can only be applied to its real and imaginary parts. However, it is possible to followRudin (1966, pp. 137–139) and define the total variation of the complex-valued measure as follows
Definition 1.4. Thevariation of the complex-valued measure is theset function
where thesupremum is taken over all partitions of ameasurable set into a countable number of disjoint measurable subsets.
This definition coincides with the above definition for the case of real-valued signed measures.
The variation so defined is apositive measure (seeRudin (1966, p. 139)) and coincides with the one defined by1.3 when is asigned measure: its total variation is defined as above. This definition works also if is avector measure: the variation is then defined by the following formula
where the supremum is as above. This definition is slightly more general than the one given byRudin (1966, p. 138) since it requires only to considerfinite partitions of the space: this implies that it can be used also to define the total variation onfinite-additive measures.
The total variation of anyprobability measure is exactly one, therefore it is not interesting as a means of investigating the properties of such measures. However, when μ and ν areprobability measures, thetotal variation distance of probability measures can be defined as where the norm is the total variation norm of signed measures. Using the property that, we eventually arrive at the equivalent definition
and its values are non-trivial. The factor above is usually dropped (as is the convention in the articletotal variation distance of probability measures). Informally, this is the largest possible difference between the probabilities that the twoprobability distributions can assign to the same event. For acategorical distribution it is possible to write the total variation distance as follows
The total variation of a function can be expressed as anintegral involving the given function instead of as thesupremum of thefunctionals of definitions1.1 and1.2.
The form of the total variation of a differentiable function of one variable
Theorem 1. Thetotal variation of adifferentiable function, defined on aninterval, has the following expression if is Riemann integrable
If is differentiable andmonotonic, then the above simplifies to
For any differentiable function, we can decompose the domain interval, into subintervals (with) in which is locally monotonic, then the total variation of over can be written as the sum of local variations on those subintervals:
The form of the total variation of a differentiable function of several variables
Under the conditions of the theorem, from the lemma we have:
in the last part could be omitted, because by definition its essential supremum is at most one.
On the other hand, we consider and which is the up to approximation of in with the same integral. We can do this since is dense in. Now again substituting into the lemma:
This means we have a convergent sequence of that tends to as well as we know that.Q.E.D.
It can be seen from the proof that the supremum is attained when
The total variation is anorm defined on the space of measures of bounded variation. The space of measures on a σ-algebra of sets is aBanach space, called theca space, relative to this norm. It is contained in the larger Banach space, called theba space, consisting offinitely additive (as opposed to countably additive) measures, also with the same norm. Thedistance function associated to the norm gives rise to the total variation distance between two measuresμ andν.
For finite measures onR, the link between the total variation of a measureμ and the total variation of a function, as described above, goes as follows. Givenμ, define a function by
Then, the total variation of the signed measureμ is equal to the total variation, in the above sense, of the function. In general, the total variation of a signed measure can be defined usingJordan's decomposition theorem by
Total variation can be seen as anon-negativereal-valuedfunctional defined on the space ofreal-valuedfunctions (for the case of functions of one variable) or on the space ofintegrable functions (for the case of functions of several variables). As a functional, total variation finds applications in several branches of mathematics and engineering, likeoptimal control,numerical analysis, andcalculus of variations, where the solution to a certain problem has tominimize its value. As an example, use of the total variation functional is common in the following two kind of problems
Numerical analysis of differential equations: it is the science of finding approximate solutions todifferential equations. Applications of total variation to these problems are detailed in the article "total variation diminishing"
Jordan, Camille (1881),"Sur la série de Fourier",Comptes rendus hebdomadaires des séances de l'Académie des sciences (in French),92:228–230,JFM13.0184.01 (available atGallica). This is, according to Boris Golubov, the first paper on functions of bounded variation.
Leoni, Giovanni (2017),A First Course in Sobolev Spaces: Second Edition, Graduate Studies in Mathematics, American Mathematical Society, pp. xxii+734,ISBN978-1-4704-2921-8.
Rudin, Walter (1966),Real and Complex Analysis, McGraw-Hill Series in Higher Mathematics (1st ed.), New York: McGraw-Hill, pp. xi+412,MR0210528,Zbl0142.01701.
Rudin, Leonid I.; Osher, Stanley; Fatemi, Emad (1992), "Nonlinear total variation based noise removal algorithms",Physica D: Nonlinear Phenomena,60 (1–4), Physica D: Nonlinear Phenomena 60.1: 259-268:259–268,Bibcode:1992PhyD...60..259R,doi:10.1016/0167-2789(92)90242-F.
Blomgren, Peter; Chan, Tony F. (1998), "Color TV: total variation methods for restoration of vector-valued images",IEEE Transactions on Image Processing,7 (3), Image Processing, IEEE Transactions on, vol. 7, no. 3: 304-309:304–309,Bibcode:1998ITIP....7..304B,doi:10.1109/83.661180,PMID18276250.