Movatterモバイル変換

Completeness (statistics)

From Wikipedia, the free encyclopedia

Statistics term

Instatistics,completeness is a property of astatistic computed on asample dataset in relation to a parametric model of the dataset. It is opposed to the concept of anancillary statistic. While an ancillary statistic contains no information about the model parameters, a complete statistic contains only information about the parameters, and no ancillary information. It is closely related to the concept of asufficient statistic which contains all of the information that the dataset provides about the parameters.^[1]

Definition

[edit]

Consider arandom variableX whose probability distribution belongs to aparametric modelP_θ parametrized by θ.

SayT is astatistic; that is, the composition of ameasurable function with a random sampleX₁,...,X_n.

The statisticT is said to becomplete for the distribution ofX if, for every measurable functiong,^[1]

{\text{if }}\operatorname {E} _{\theta }(g(T))=0{\text{ for all }}\theta {\text{ then }}\mathbf {P} _{\theta }(g(T)=0)=1{\text{ for all }}\theta .

The statisticT is said to beboundedly complete for the distribution ofX if this implication holds for every measurable functiong that is also bounded.

Examples

[edit]

Bernoulli model

[edit]

The Bernoulli model admits a complete statistic.^[1] LetX be arandom sample of sizen such that eachX_i has the sameBernoulli distribution with parameterp. LetT be the number of 1s observed in the sample, i.e. $\textstyle T=\sum _{i=1}^{n}X_{i}$ .T is a statistic ofX which has abinomial distribution with parameters (n,p). If the parameter space forp is (0,1), thenT is a complete statistic. To see this, note that

\operatorname {E} _{p}(g(T))=\sum _{t=0}^{n}{g(t){n \choose t}p^{t}(1-p)^{n-t}}=(1-p)^{n}\sum _{t=0}^{n}{g(t){n \choose t}\left({\frac {p}{1-p}}\right)^{t}}.

Observe also that neitherp nor 1 − p can be 0. Hence $E_{p}(g(T))=0$ if and only if:

\sum _{t=0}^{n}g(t){n \choose t}\left({\frac {p}{1-p}}\right)^{t}=0.

On denotingp/(1 − p) byr, one gets:

\sum _{t=0}^{n}g(t){n \choose t}r^{t}=0.

First, observe that the range ofr is thepositive reals. Also, E(g(T)) is apolynomial inr and, therefore, can only be identical to 0 if all coefficients are 0, that is,g(t) = 0 for all t.

It is important to notice that the result that all coefficients must be 0 was obtained because of the range ofr. Had the parameter space been finite and with a number of elements less than or equal ton, it might be possible to solve the linear equations ing(t) obtained by substituting the values ofr and get solutions different from 0. For example, ifn = 1 and the parameter space is {0.5}, a single observation and a single parameter value,T is not complete. Observe that, with the definition:

g(t)=2(t-0.5),\,

then, E(g(T)) = 0 althoughg(t) is not 0 fort = 0 nor fort = 1.

Gaussian model with fixed variance

[edit]

This example will show that, in a sampleX₁, X₂ of size 2 from anormal distribution with known variance, the statisticX₁ + X₂ is complete and sufficient. SupposeX₁,X₂ areindependent, identically distributed random variables,normally distributed with expectationθ and variance 1.The sum

s((X_{1},X_{2}))=X_{1}+X_{2}

is acomplete statistic forθ.

To show this, it is sufficient to demonstrate that there is no non-zero function $g {\displaystyle g}$ such that the expectation of

g(s(X_{1},X_{2}))=g(X_{1}+X_{2})

remains zero regardless of the value ofθ.

That fact may be seen as follows. The probability distribution ofX₁ + X₂ is normal with expectation 2θ and variance 2. Its probability density function in $x {\displaystyle x}$ is therefore proportional to

\exp \left(-(x-2\theta )^{2}/4\right).

The expectation ofg above would therefore be a constant times

\int _{-\infty }^{\infty }g(x)\exp \left(-(x-2\theta )^{2}/4\right)\,dx.

A bit of algebra reduces this to

k(\theta )\int _{-\infty }^{\infty }h(x)e^{x\theta }\,dx,

wherek(θ) is nowhere zero and

h(x)=g(x)e^{-x^{2}/4}.

As a function ofθ this is a two-sidedLaplace transform ofh, and cannot be identically zero unlessh is zero almost everywhere.^[2] The exponential is not zero, so this can only happen ifg is zero almost everywhere.

By contrast, the statistic ${\textstyle (X_{1},X_{2})}$ is sufficient but not complete. It admits a non-zero unbiased estimator of zero, namely ${\textstyle X_{1}-X_{2}}$ .

Sufficiency does not imply completeness

[edit]

Most parametric models have asufficient statistic which is not complete. This is important because theLehmann–Scheffé theorem cannot be applied to such models. Galili and Meilijson 2016^[3] propose the following didactic example.

Consider $n {\displaystyle n}$ independent samples from the uniform distribution:

X_{i}\sim U{\big (}(1-k)\theta ,(1+k)\theta {\big )}\qquad \qquad 0<k<1

$k {\displaystyle k}$ is a known design parameter. This model is ascale family (a specific case ofa location-scale family) model: scaling the samples by a multiplier $c {\displaystyle c}$ multiplies the parameter $\theta$ .

Galili and Meilijson show that the minimum and maximum of the samples are together a sufficient statistic: $X_{(1)},X_{(n)}$ (using the usual notation fororder statistics). Indeed, conditional on these two values, the distribution of the rest of the sample is simply uniform on the range they define: $\left[X_{(1)},X_{(n)}\right]$ .

However, their ratio has a distribution which does not depend on $\theta$ . This follows from the fact that this is a scale family: any change of scale impacts both variables identically. Subtracting the mean $m {\displaystyle m}$ from that distribution, we obtain:

\mathbb {E} \left[{\frac {X_{(n)}}{X_{(1)}}}\right]-m=0

We have thus shown that there exists a function $g\left(X_{(1)},X_{(n)}\right)$ which is not $0 {\displaystyle 0}$ everywhere but which has expectation $0 {\displaystyle 0}$ . The pair is thus not complete.

Importance of completeness

[edit]

The notion of completeness has many applications in statistics, particularly in the following theorems of mathematical statistics.

Lehmann–Scheffé theorem

[edit]

Completeness occurs in theLehmann–Scheffé theorem,^[1]which states that if a statistic that is unbiased,complete andsufficient for some parameterθ, then it is the best mean-unbiased estimator for θ. In other words, this statistic has a smaller expected loss for anyconvex loss function; in many practical applications with the squared loss-function, it has a smaller mean squared error among any estimators with the sameexpected value.

Examples exists that when the minimal sufficient statistic isnot complete then several alternative statistics exist for unbiased estimation ofθ, while some of them have lower variance than others.^[3]

Basu's theorem

[edit]

Bounded completeness occurs inBasu's theorem,^[1] which states that a statistic that is bothboundedly complete andsufficient isindependent of anyancillary statistic.

Bahadur's theorem

[edit]

Bounded completeness also occurs in Bahadur's theorem. In the case where there exists at least oneminimal sufficient statistic, a statistic which issufficient and boundedly complete, is necessarily minimal sufficient.^[4]

Notes

[edit]

^^a ^b ^c ^d ^eCasella, George; Berger, Roger W. (2001).Statistical inference. CRC Press.ISBN 978-1-032-59303-6.
^Lynn, Paul A. (1986). "The Laplace Transform and thez-transform".Electronic Signals and Systems. London: Macmillan Education UK. pp. 225–272.doi:10.1007/978-1-349-18461-3_6.ISBN 978-0-333-39164-8.
^^a ^bTal Galili; Isaac Meilijson (31 Mar 2016)."An Example of an Improvable Rao–Blackwell Improvement, Inefficient Maximum Likelihood Estimator, and Unbiased Generalized Bayes Estimator".The American Statistician.70 (1):108–113.doi:10.1080/00031305.2015.1100683.PMC 4960505.PMID 27499547.
^Bahadur, R. R. (1957)."On Unbiased Estimates of Uniformly Minimum Variance".Sankhyā: The Indian Journal of Statistics (1933-1960).18 (3/4):211–224.ISSN 0036-4452.

Statistics

Descriptive statistics

Continuous data

Center	Mean Arithmetic Arithmetic-Geometric Contraharmonic Cubic Generalized/power Geometric Harmonic Heronian Heinz Lehmer Median Mode
Dispersion	Average absolute deviation Coefficient of variation Interquartile range Percentile Range Standard deviation Variance
Shape	Central limit theorem Moments Kurtosis L-moments Skewness

Count data

Index of dispersion

Summary tables

Dependence

Graphics

Data collection

Study design	Effect size Missing data Optimal design Population Replication Sample size determination Statistic Statistical power
Survey methodology	Sampling Cluster Stratified Opinion poll Questionnaire Standard error
Controlled experiments	Blocking Factorial experiment Interaction Random assignment Randomized controlled trial Randomized experiment Scientific control
Adaptive designs	Adaptive clinical trial Stochastic approximation Up-and-down designs
Observational studies	Cohort study Cross-sectional study Natural experiment Quasi-experiment

Statistical inference

Statistical theory

Frequentist inference

Point estimation	Estimating equations Maximum likelihood Method of moments M-estimator Minimum distance Unbiased estimators Mean-unbiased minimum-variance Rao–Blackwellization Lehmann–Scheffé theorem Median unbiased Plug-in
Interval estimation	Confidence interval Pivot Likelihood interval Prediction interval Tolerance interval Resampling Bootstrap Jackknife
Testing hypotheses	1- & 2-tails Power Uniformly most powerful test Permutation test Randomization test Multiple comparisons
Parametric tests	Likelihood-ratio Score/Lagrange multiplier Wald

Specific tests

Z-test(normal) Student'st-test F-test
Goodness of fit	Chi-squared G-test Kolmogorov–Smirnov Anderson–Darling Lilliefors Jarque–Bera Normality(Shapiro–Wilk) Likelihood-ratio test Model selection Cross validation AIC BIC
Rank statistics	Sign Sample median Signed rank(Wilcoxon) Hodges–Lehmann estimator Rank sum(Mann–Whitney) Nonparametric anova 1-way(Kruskal–Wallis) 2-way(Friedman) Ordered alternative(Jonckheere–Terpstra) Van der Waerden test

Bayesian inference

Correlation	Pearson product-moment Partial correlation Confounding variable Coefficient of determination
Regression analysis (see alsoTemplate:Least squares and regression analysis	Errors and residuals Regression validation Mixed effects models Simultaneous equations models Multivariate adaptive regression splines (MARS)
Linear regression	Simple linear regression Ordinary least squares General linear model Bayesian regression
Non-standard predictors	Nonlinear regression Nonparametric Semiparametric Isotonic Robust Homoscedasticity and Heteroscedasticity
Generalized linear model	Exponential families Logistic(Bernoulli) / Binomial / Poisson regressions
Partition of variance	Analysis of variance (ANOVA, anova) Analysis of covariance Multivariate ANOVA Degrees of freedom

Categorical / multivariate / time-series / survival analysis

Categorical

Multivariate

Time-series

General	Decomposition Trend Stationarity Seasonal adjustment Exponential smoothing Cointegration Structural break Granger causality
Specific tests	Dickey–Fuller Johansen Q-statistic(Ljung–Box) Durbin–Watson Breusch–Godfrey
Time domain	Autocorrelation (ACF) partial (PACF) Cross-correlation (XCF) ARMA model ARIMA model(Box–Jenkins) Autoregressive conditional heteroskedasticity (ARCH) Vector autoregression (VAR) (Autoregressive model (AR))
Frequency domain	Spectral density estimation Fourier analysis Least-squares spectral analysis Wavelet Whittle likelihood

Survival

Survival function	Kaplan–Meier estimator (product limit) Proportional hazards models Accelerated failure time (AFT) model First hitting time
Hazard function	Nelson–Aalen estimator
Test	Log-rank test

Applications

Biostatistics	Bioinformatics Clinical trials / studies Epidemiology Medical statistics
Engineering statistics	Chemometrics Methods engineering Probabilistic design Process / quality control Reliability System identification
Social statistics	Actuarial science Census Crime statistics Demography Econometrics Jurimetrics National accounts Official statistics Population statistics Psychometrics
Spatial statistics	Cartography Environmental statistics Geographic information system Geostatistics Kriging

Retrieved from "https://en.wikipedia.org/w/index.php?title=Completeness_(statistics)&oldid=1268603896"

Category:

Statistical theory

Hidden categories:

[8]ページ先頭