masked_array.var(axis=None,dtype=None,out=None,ddof=0,keepdims=<no value>)[source]¶Compute the variance along the specified axis.
Returns the variance of the array elements, a measure of the spread of adistribution. The variance is computed for the flattened array bydefault, otherwise over the specified axis.
| Parameters: |
|
|---|---|
| Returns: |
|
Notes
The variance is the average of the squared deviations from the mean,i.e.,var=mean(abs(x-x.mean())**2).
The mean is normally calculated asx.sum()/N, whereN=len(x).If, however,ddof is specified, the divisorN-ddof is usedinstead. In standard statistical practice,ddof=1 provides anunbiased estimator of the variance of a hypothetical infinite population.ddof=0 provides a maximum likelihood estimate of the variance fornormally distributed variables.
Note that for complex numbers, the absolute value is taken beforesquaring, so that the result is always real and nonnegative.
For floating-point input, the variance is computed using the sameprecision the input has. Depending on the input data, this can causethe results to be inaccurate, especially forfloat32 (see examplebelow). Specifying a higher-accuracy accumulator using thedtypekeyword can alleviate this issue.
Examples
>>>a=np.array([[1,2],[3,4]])>>>np.var(a)1.25>>>np.var(a,axis=0)array([ 1., 1.])>>>np.var(a,axis=1)array([ 0.25, 0.25])
In single precision, var() can be inaccurate:
>>>a=np.zeros((2,512*512),dtype=np.float32)>>>a[0,:]=1.0>>>a[1,:]=0.1>>>np.var(a)0.20250003
Computing the variance in float64 is more accurate:
>>>np.var(a,dtype=np.float64)0.20249999932944759>>>((1-0.55)**2+(0.1-0.55)**2)/20.2025