Indescriptive statistics, therange of a set of data is size of the narrowestinterval which contains all the data.It is calculated as the difference between the largest and smallest values (also known as thesample maximum and minimum).[1] It is expressed in the sameunits as the data.
The range provides an indication ofstatistical dispersion. Closely related alternative measures are theInterdecile range and theInterquartile range.
Fornindependent and identically distributed continuous random variablesX1,X2, ...,Xn with thecumulative distribution function G(x) and aprobability density function g(x), let T denote the range of them, that is, T= max(X1,X2, ...,Xn)-min(X1,X2, ...,Xn).
The range, T, has the cumulative distribution function[2][3]
Gumbel notes that the "beauty of this formula is completely marred by the facts that, in general, we cannot expressG(x + t) byG(x), and that the numerical integration is lengthy and tiresome."[2]: 385
If the distribution of eachXi is limited to the right (or left) then the asymptotic distribution of the range is equal to the asymptotic distribution of the largest (smallest) value. For more general distributions the asymptotic distribution can be expressed as aBessel function.[2]
The mean range is given by[4]
wherex(G) is the inverse function. In the case where each of theXi has astandard normal distribution, the mean range is given by[5]
Please note that the following is an informal derivation of the result. It is a bit loose with the calculation of the probabilities.
Let denote respectively the min and max of the random variables.
The event that the range is smaller than can be decomposed into smaller events according to:
For a given index and minimum value, the probability of the joint event:
is:Summing over the indices and integrating over yields the total probability of the event: "the range is smaller than" which is exactly the cumulative density function of the range:which concludes the proof.
Outside of the IID case with continuous random variables, other cases have explicit formulas. These cases are of marginal interest.
The range is a specific example oforder statistics. In particular, the range is a linear function of order statistics, which brings it into the scope ofL-estimation.