Movatterモバイル変換

[0]ホーム

Jump to content

Durbin–Watson statistic

Edit links

From Wikipedia, the free encyclopedia

Test statistic

This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(December 2012) (Learn how and when to remove this message)

Instatistics, theDurbin–Watson statistic is atest statistic used to detect the presence ofautocorrelation at lag 1 in theresiduals (prediction errors) from aregression analysis. It is named afterJames Durbin andGeoffrey Watson. Thesmall sample distribution of this ratio was derived byJohn von Neumann (von Neumann, 1941). Durbin and Watson (1950, 1951) applied this statistic to the residuals fromleast squares regressions, and developed bounds tests for thenull hypothesis that the errors are serially uncorrelated against the alternative that they follow a first orderautoregressive process. Note that the distribution of this test statistic does not depend on the estimated regression coefficients and the variance of the errors.^[1]

A similar assessment can be also carried out with theBreusch–Godfrey test and theLjung–Box test.

Computing and interpreting the Durbin–Watson statistic

[edit]

If ${\textstyle e_{t}}$ is theresidual given by $e_{t}=\rho e_{t-1}+\nu _{t},$ the Durbin-Watsontest statistic is

d={\sum _{t=2}^{T}(e_{t}-e_{t-1})^{2} \over {\sum _{t=1}^{T}e_{t}^{2}}},

where ${\textstyle T}$ is the number of observations. For large ${\textstyle T}$ , ${\textstyle d}$ is approximately equal to ${\textstyle 2(1-{\hat {\rho }})}$ , where ${\hat {\rho }}$ is the sample autocorrelation of the residuals at lag 1.^[2] ${\textstyle d=2}$ therefore indicates no autocorrelation. The value of ${\textstyle d}$ always lies between ${\textstyle 0}$ and ${\textstyle 4}$ . If the Durbin–Watson statistic is substantially less than 2, there is evidence of positive serial correlation. As a rough rule of thumb, if Durbin–Watson is less than 1.0, there may be cause for alarm. Small values of ${\textstyle d}$ indicate successive error terms are positively correlated. If ${\textstyle d>2}$ , successive error terms are negatively correlated. In regressions, this can imply an underestimation of the level ofstatistical significance.

To test forpositive autocorrelation at significance ${\textstyle \alpha }$ , the test statistic ${\textstyle d}$ is compared to lower and upper critical values ( ${\textstyle d_{L,\alpha }}$ and ${\textstyle d_{U,\alpha }}$ ):

If ${\textstyle d<d_{L,\alpha }}$ , there is statistical evidence that the error terms are positively autocorrelated.
If ${\textstyle d>d_{U,\alpha }}$ , there isno statistical evidence that the error terms are positively autocorrelated.
If $d_{L,\alpha }<d<d_{U,\alpha }$ , the test is inconclusive.

Positive serial correlation is serial correlation in which a positive error for one observation increases the chances of a positive error for another observation.

To test fornegative autocorrelation at significance ${\textstyle \alpha }$ , the test statistic ${\textstyle (4-d)}$ is compared to lower and upper critical values ( ${\textstyle d_{L,\alpha }}$ and ${\textstyle d_{U,\alpha }}$ ):

If ${\textstyle (4-d)<d_{L,\alpha }}$ , there is statistical evidence that the error terms are negatively autocorrelated.
If ${\textstyle (4-d)>d_{U,\alpha }}$ , there isno statistical evidence that the error terms are negatively autocorrelated.
If $d_{L,\alpha }<(4-d)<d_{U,\alpha }$ , the test is inconclusive.

Negative serial correlation implies that a positive error for one observation increases the chance of a negative error for another observation and a negative error for one observation increases the chances of a positive error for another.

The critical values, ${\textstyle d_{L,\alpha }}$ and ${\textstyle d_{U,\alpha }}$ , vary by level of significance ( ${\textstyle \alpha }$ ) and the degrees of freedom in the regression equation. Their derivation is complex—statisticians typically obtain them from the appendices of statistical texts.

If thedesign matrix $\mathbf {X}$ of the regression is known, exact critical values for the distribution of $d {\displaystyle d}$ under the null hypothesis of no serial correlation can be calculated. Under the null hypothesis $d {\displaystyle d}$ is distributed as

{\frac {\sum _{i=1}^{n-k}\nu _{i}\xi _{i}^{2}}{\sum _{i=1}^{n-k}\xi _{i}^{2}}},

where ${\textstyle n}$ is the number of observations and ${\textstyle k}$ is number of regression variables; the $\xi _{i}$ are independent standard normal random variables; and the $\nu _{i}$ are the nonzero eigenvalues of $(\mathbf {I} -\mathbf {X} (\mathbf {X} ^{T}\mathbf {X} )^{-1}\mathbf {X} ^{T})\mathbf {A} ,$ where $\mathbf {A}$ is the matrix that transforms the residuals into the $d {\displaystyle d}$ statistic, i.e. $d=\mathbf {e} ^{T}\mathbf {A} \mathbf {e} .$ ^[3] A number of computational algorithms for finding percentiles of this distribution are available.^[4]

Although serial correlation does not affect the consistency of the estimated regression coefficients, it does affect our ability to conduct valid statistical tests. First, the F-statistic to test for overall significance of the regression may be inflated under positive serial correlation because the mean squared error (MSE) will tend to underestimate the population error variance. Second, positive serial correlation typically causes the ordinary least squares (OLS) standard errors for the regression coefficients to underestimate the true standard errors. As a consequence, if positive serial correlation is present in the regression, standard linear regression analysis will typically lead us to compute artificially small standard errors for the regression coefficient. These small standard errors will cause the estimated t-statistic to be inflated, suggesting significance where perhaps there is none. The inflated t-statistic, may in turn, lead us to incorrectly reject null hypotheses, about population values of the parameters of the regression model more often than we would if the standard errors were correctly estimated.

If the Durbin–Watson statistic indicates the presence of serial correlation of the residuals, this can be remedied by using theCochrane–Orcutt procedure.

The Durbin–Watson statistic, while displayed by many regression analysis programs, is not applicable in certain situations. For instance, when lagged dependent variables are included in the explanatory variables, then it is inappropriate to use this test. Durbin's h-test (see below) or likelihood ratio tests, that are valid in large samples, should be used.

Durbin h-statistic

[edit]

The Durbin–Watson statistic isbiased forautoregressive moving average models, so that autocorrelation is underestimated. But for large samples one can easily compute the unbiasednormally distributed h-statistic:

h=\left(1-{\frac {1}{2}}d\right){\sqrt {\frac {T}{1-T\cdot {\widehat {\operatorname {Var} }}({\widehat {\beta }}_{1}\,)}}},

using the Durbin–Watson statisticd and the estimated variance

{\widehat {\operatorname {Var} }}({\widehat {\beta }}_{1})

of the regression coefficient of the lagged dependent variable, provided

T\cdot {\widehat {\operatorname {Var} }}({\widehat {\beta }}_{1})<1.\,

Implementations in statistics packages

[edit]

R: thedwtest function in the lmtest package,durbinWatsonTest (or dwt for short) function in the car package, andpdwtest andpbnftest for panel models in the plm package.^[5]
MATLAB: the dwtest function in the Statistics Toolbox.
Mathematica: the Durbin–Watson (d) statistic is included as an option in the LinearModelFit function.
SAS: Is a standard output when using proc model and is an option (dw) when using proc reg.
EViews: Automatically calculated when using OLS regression
gretl: Automatically calculated when using OLS regression
Stata: the commandestat dwatson, followingregress in time series data.^[6] Engle's LM test for autoregressive conditional heteroskedasticity (ARCH), a test for time-dependent volatility, the Breusch–Godfrey test, and Durbin's alternative test for serial correlation are also available. All (except -dwatson-) tests separately for higher-order serial correlations. The Breusch–Godfrey test and Durbin's alternative test also allow regressors that are not strictly exogenous.
Excel: although Microsoft Excel 2007 does not have a specific Durbin–Watson function, thed-statistic may be calculated using=SUMXMY2(x_array,y_array)/SUMSQ(array)
Minitab: the option to report the statistic in the Session window can be found under the "Options" box under Regression and via the "Results" box under General Regression.
Python: a durbin_watson function is included in the statsmodels package (statsmodels.stats.stattools.durbin_watson), but statistical tables for critical values are not available there.
SPSS: Included as an option in the Regression function.
Julia: theDurbinWatsonTest function is available in theHypothesisTests package.^[7]

References

[edit]

^Chatterjee, Samprit; Simonoff, Jeffrey (2013).Handbook of Regression Analysis. John Wiley & Sons.ISBN 978-1118532812.
^Gujarati (2003) p. 469
^Durbin, J.; Watson, G. S. (1971). "Testing for serial correlation in least squares regression.III".Biometrika.58 (1):1–19.doi:10.2307/2334313.JSTOR 2334313.
^Farebrother, R. W. (1980). "Algorithm AS 153: Pan's procedure for the tail probabilities of the Durbin-Watson statistic".Journal of the Royal Statistical Society, Series C.29 (2):224–227.
^Hateka, Neeraj R. (2010)."Tests for Detecting Autocorrelation".Principles of Econometrics: An Introduction (Using R). SAGE Publications. pp. 379–82.ISBN 978-81-321-0660-9.
^"regress postestimation time series — Postestimation tools for regress with time series"(PDF).Stata Manual.
^"Time series tests".juliastats.org. Retrieved2020-02-04.

External links

[edit]

Table for highn andk Archived 2011-08-07 at theWayback Machine
Econometrics lecture (topic: Durbin–Watson statistic) onYouTube byMark Thoma

Statistics

Descriptive statistics

Continuous data

Center	Mean Arithmetic Arithmetic-Geometric Contraharmonic Cubic Generalized/power Geometric Harmonic Heronian Heinz Lehmer Median Mode
Dispersion	Average absolute deviation Coefficient of variation Interquartile range Percentile Range Standard deviation Variance
Shape	Central limit theorem Moments Kurtosis L-moments Skewness

Count data

Index of dispersion

Summary tables

Dependence

Graphics

Data collection

Study design	Effect size Missing data Optimal design Population Replication Sample size determination Statistic Statistical power
Survey methodology	Sampling Cluster Stratified Opinion poll Questionnaire Standard error
Controlled experiments	Blocking Factorial experiment Interaction Random assignment Randomized controlled trial Randomized experiment Scientific control
Adaptive designs	Adaptive clinical trial Stochastic approximation Up-and-down designs
Observational studies	Cohort study Cross-sectional study Natural experiment Quasi-experiment

Statistical inference

Statistical theory

Frequentist inference

Point estimation	Estimating equations Maximum likelihood Method of moments M-estimator Minimum distance Unbiased estimators Mean-unbiased minimum-variance Rao–Blackwellization Lehmann–Scheffé theorem Median unbiased Plug-in
Interval estimation	Confidence interval Pivot Likelihood interval Prediction interval Tolerance interval Resampling Bootstrap Jackknife
Testing hypotheses	1- & 2-tails Power Uniformly most powerful test Permutation test Randomization test Multiple comparisons
Parametric tests	Likelihood-ratio Score/Lagrange multiplier Wald

Specific tests

Z-test(normal) Student'st-test F-test
Goodness of fit	Chi-squared G-test Kolmogorov–Smirnov Anderson–Darling Lilliefors Jarque–Bera Normality(Shapiro–Wilk) Likelihood-ratio test Model selection Cross validation AIC BIC
Rank statistics	Sign Sample median Signed rank(Wilcoxon) Hodges–Lehmann estimator Rank sum(Mann–Whitney) Nonparametric anova 1-way(Kruskal–Wallis) 2-way(Friedman) Ordered alternative(Jonckheere–Terpstra) Van der Waerden test

Bayesian inference

Correlation	Pearson product-moment Partial correlation Confounding variable Coefficient of determination
Regression analysis (see alsoTemplate:Least squares and regression analysis	Errors and residuals Regression validation Mixed effects models Simultaneous equations models Multivariate adaptive regression splines (MARS)
Linear regression	Simple linear regression Ordinary least squares General linear model Bayesian regression
Non-standard predictors	Nonlinear regression Nonparametric Semiparametric Isotonic Robust Homoscedasticity and Heteroscedasticity
Generalized linear model	Exponential families Logistic(Bernoulli) / Binomial / Poisson regressions
Partition of variance	Analysis of variance (ANOVA, anova) Analysis of covariance Multivariate ANOVA Degrees of freedom

Categorical / multivariate / time-series / survival analysis

Categorical

Multivariate

Time-series

General	Decomposition Trend Stationarity Seasonal adjustment Exponential smoothing Cointegration Structural break Granger causality
Specific tests	Dickey–Fuller Johansen Q-statistic(Ljung–Box) Durbin–Watson Breusch–Godfrey
Time domain	Autocorrelation (ACF) partial (PACF) Cross-correlation (XCF) ARMA model ARIMA model(Box–Jenkins) Autoregressive conditional heteroskedasticity (ARCH) Vector autoregression (VAR) (Autoregressive model (AR))
Frequency domain	Spectral density estimation Fourier analysis Least-squares spectral analysis Wavelet Whittle likelihood

Survival

Survival function	Kaplan–Meier estimator (product limit) Proportional hazards models Accelerated failure time (AFT) model First hitting time
Hazard function	Nelson–Aalen estimator
Test	Log-rank test

Applications

Biostatistics	Bioinformatics Clinical trials / studies Epidemiology Medical statistics
Engineering statistics	Chemometrics Methods engineering Probabilistic design Process / quality control Reliability System identification
Social statistics	Actuarial science Census Crime statistics Demography Econometrics Jurimetrics National accounts Official statistics Population statistics Psychometrics
Spatial statistics	Cartography Environmental statistics Geographic information system Geostatistics Kriging