Movatterモバイル変換

[0]ホーム

Jump to content

Variance

Edit links

From Wikipedia, the free encyclopedia

Statistical measure of how far values spread from their average

This article is about the mathematical concept. For other uses, seeVariance (disambiguation).

Example of samples from two populations with the same mean but different variances. The red population has mean 100 and variance 100 (SD=10) while the blue population has mean 100 and variance 2500 (SD=50) where SD stands for Standard Deviation.

Inprobability theory andstatistics,variance is theexpected value of thesquared deviation from the mean of arandom variable. Thestandard deviation (SD) is obtained as the square root of the variance. Variance is a measure ofdispersion, meaning it is a measure of how far a set of numbers are spread out from their average value. It is the secondcentral moment of adistribution, and thecovariance of the random variable with itself, and it is often represented by $\sigma ^{2}$ , $s^{2}$ , $\operatorname {Var} (X)$ , $V(X)$ , or $\mathbb {V} (X)$ .^[1]

An advantage of variance as a measure of dispersion is that it is more amenable to algebraic manipulation than other measures of dispersion such as theexpected absolute deviation; for example, the variance of a sum of uncorrelated random variables is equal to the sum of their variances. A disadvantage of the variance for practical applications is that, unlike the standard deviation, its units differ from the random variable, which is why the standard deviation is more commonly reported as a measure of dispersion once the calculation is finished. Another disadvantage is that the variance is not finite for many distributions.

There are two distinct concepts that are both called "variance". One, as discussed above, is part of a theoreticalprobability distribution and is defined by an equation. The other variance is a characteristic of a set of observations. When variance is calculated from observations, those observations are typically measured from a real-world system. If all possible observations of the system are present, then the calculated variance is called the population variance. Normally, however, only a subset is available, and the variance calculated from this is called the sample variance. The variance calculated from a sample is considered an estimate of the full population variance. There are multiple ways to estimate the population variance on the basis of the sample variance, as discussed in the section below.

The two kinds of variance are closely related. To see how, consider that a theoretical probability distribution can be used as a generator of hypothetical observations. If an infinite number of observations are generated using a distribution, then the sample variance calculated from that infinite set will match the value calculated using the distribution's equation for variance. Variance has a central role in statistics, where some ideas that use it includedescriptive statistics,statistical inference,hypothesis testing,goodness of fit, andMonte Carlo sampling.

Name of the probability distribution	Probability distribution function	Mean	Variance
Binomial distribution	$\Pr \,(X=k)={\binom {n}{k}}p^{k}(1-p)^{n-k}$	$n p {\displaystyle np}$	$np(1-p)$
Geometric distribution	$\Pr \,(X=k)=(1-p)^{k-1}p$	${\frac {1}{p}}$	${\frac {(1-p)}{p^{2}}}$
Normal distribution	$f\left(x\mid \mu ,\sigma ^{2}\right)={\frac {1}{\sqrt {2\pi \sigma ^{2}}}}e^{-{\frac {1}{2}}{\left({\frac {x-\mu }{\sigma }}\right)}^{2}}$	$\mu$	$\sigma ^{2}$
Uniform distribution (continuous)	$f(x\mid a,b)={\begin{cases}{\frac {1}{b-a}}&{\text{for }}a\leq x\leq b,\\[3pt]0&{\text{for }}x<a{\text{ or }}x>b\end{cases}}$	${\frac {a+b}{2}}$	${\frac {(b-a)^{2}}{12}}$
Exponential distribution	$f(x\mid \lambda )=\lambda e^{-\lambda x}$	${\frac {1}{\lambda }}$	${\frac {1}{\lambda ^{2}}}$
Poisson distribution	$f(k\mid \lambda )={\frac {e^{-\lambda }\lambda ^{k}}{k!}}$	$\lambda$	$\lambda$

v t e Theory ofprobability distributions
probability mass function (pmf) probability density function (pdf) cumulative distribution function (cdf) quantile function
raw moment central moment mean variance standard deviation skewness kurtosis L-moment
moment-generating function (mgf) characteristic function probability-generating function (pgf) cumulant combinant

Authority control databases
International	GND
National	Japan

Movatterモバイル変換

Definition

Discrete random variable

Absolutely continuous random variable

Examples

Exponential distribution

Fair die

Commonly used probability distributions

Properties

Basic properties

Issues of finiteness

Decomposition

Calculation from the CDF

Characteristic property

Units of measurement

Propagation

Addition and multiplication by a constant

Linear combinations

Matrix notation for the variance of a linear combination

Sum of variables

Sum of uncorrelated variables

Sum of correlated variables

Sum of correlated variables with fixed sample size

Sum of uncorrelated variables with random sample size

Weighted sum of variables

Product of variables

Product of independent variables

Product of statistically dependent variables

Arbitrary functions

Population variance and sample variance

Population variance

Sample variance

Biased sample variance

Unbiased sample variance

Example

Distribution of the sample variance

Samuelson's inequality

Relations with the harmonic and arithmetic means

Tests of equality of variances

Moment of inertia

Semivariance

Etymology

Generalizations

For complex variables

For vector-valued random variables

As a matrix

As a scalar

See also

Types of variance

References