Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Range (statistics)

From Wikipedia, the free encyclopedia
Concept in statistics
Not to be confused withMid-range.
For other uses, seeRange (disambiguation) § Mathematics.

Indescriptive statistics, therange of a set of data is size of the narrowestinterval which contains all the data.It is calculated as the difference between the largest and smallest values (also known as thesample maximum and minimum).[1] It is expressed in the sameunits as the data.

The range provides an indication ofstatistical dispersion. Closely related alternative measures are theInterdecile range and theInterquartile range.

Range of continuous IID random variables

[edit]

Fornindependent and identically distributed continuous random variablesX1,X2, ...,Xn with thecumulative distribution function G(x) and aprobability density function g(x), let T denote the range of them, that is, T= max(X1,X2, ...,Xn)-min(X1,X2, ...,Xn).

Distribution

[edit]

The range, T, has the cumulative distribution function[2][3]

F(t)=ng(x)[G(x+t)G(x)]n1dx.{\displaystyle F(t)=n\int _{-\infty }^{\infty }g(x)[G(x+t)-G(x)]^{n-1}\,{\text{d}}x.}

Gumbel notes that the "beauty of this formula is completely marred by the facts that, in general, we cannot expressG(x + t) byG(x), and that the numerical integration is lengthy and tiresome."[2]: 385 

If the distribution of eachXi is limited to the right (or left) then the asymptotic distribution of the range is equal to the asymptotic distribution of the largest (smallest) value. For more general distributions the asymptotic distribution can be expressed as aBessel function.[2]

Moments

[edit]

The mean range is given by[4]

n01x(G)[Gn1(1G)n1]dG{\displaystyle n\int _{0}^{1}x(G)[G^{n-1}-(1-G)^{n-1}]\,{\text{d}}G}

wherex(G) is the inverse function. In the case where each of theXi has astandard normal distribution, the mean range is given by[5]

(1(1Φ(x))nΦ(x)n)dx.{\displaystyle \int _{-\infty }^{\infty }(1-(1-\Phi (x))^{n}-\Phi (x)^{n})\,{\text{d}}x.}

Derivation of the distribution

[edit]

Please note that the following is an informal derivation of the result. It is a bit loose with the calculation of the probabilities.

Letm,M{\displaystyle m,M} denote respectively the min and max of the random variablesX1Xn{\displaystyle X_{1}\dots X_{n}}.

The event that the range is smaller thanT{\displaystyle T} can be decomposed into smaller events according to:

For a given indexi{\displaystyle i} and minimum valuex{\displaystyle x}, the probability of the joint event:

  1. Xi{\displaystyle X_{i}} is the minimum,
  2. andXi=x{\displaystyle X_{i}=x},
  3. and the range is smaller thanT{\displaystyle T},

is:g(x)[G(x+T)G(x)]n1{\displaystyle g(x)\left[G(x+T)-G(x)\right]^{n-1}}Summing over the indices and integrating overx{\displaystyle x} yields the total probability of the event: "the range is smaller thanT{\displaystyle T}" which is exactly the cumulative density function of the range:F(t)=ng(x)[G(t+x)G(x)]n1dx{\displaystyle F(t)=n\int _{-\infty }^{\infty }g(x)\left[G(t+x)-G(x)\right]^{n-1}\,{\text{d}}x}which concludes the proof.

The range in other models

[edit]

Outside of the IID case with continuous random variables, other cases have explicit formulas. These cases are of marginal interest.


Related quantities

[edit]

The range is a specific example oforder statistics. In particular, the range is a linear function of order statistics, which brings it into the scope ofL-estimation.

See also

[edit]

References

[edit]
  1. ^George Woodbury (2001).An Introduction to Statistics. Cengage Learning. p. 74.ISBN 0534377556.
  2. ^abcE. J. Gumbel (1947)."The Distribution of the Range".The Annals of Mathematical Statistics.18 (3):384–412.doi:10.1214/aoms/1177730387.JSTOR 2235736.
  3. ^abTsimashenka, I.; Knottenbelt, W.;Harrison, P. (2012). "Controlling Variability in Split-Merge Systems".Analytical and Stochastic Modeling Techniques and Applications(PDF). Lecture Notes in Computer Science. Vol. 7314. p. 165.doi:10.1007/978-3-642-30782-9_12.ISBN 978-3-642-30781-2.
  4. ^H. O. Hartley;H. A. David (1954)."Universal Bounds for Mean Range and Extreme Observation".The Annals of Mathematical Statistics.25 (1):85–99.doi:10.1214/aoms/1177728848.JSTOR 2236514.
  5. ^L. H. C. Tippett (1925). "On the Extreme Individuals and the Range of Samples Taken from a Normal Population".Biometrika.17 (3/4):364–387.doi:10.1093/biomet/17.3-4.364.JSTOR 2332087.
  6. ^Evans, D. L.; Leemis, L. M.; Drew, J. H. (2006). "The Distribution of Order Statistics for Discrete Random Variables with Applications to Bootstrapping".INFORMS Journal on Computing.18:19–30.doi:10.1287/ijoc.1040.0105.
  7. ^Irving W. Burr (1955)."Calculation of Exact Sampling Distribution of Ranges from a Discrete Population".The Annals of Mathematical Statistics.26 (3):530–532.doi:10.1214/aoms/1177728500.JSTOR 2236482.
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis (see alsoTemplate:Least squares and regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Retrieved from "https://en.wikipedia.org/w/index.php?title=Range_(statistics)&oldid=1322629133"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp