This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(February 2012) (Learn how and when to remove this message) |
Vector autoregression (VAR) is a statistical model used to capture the relationship between multiple quantities as they change over time. VAR is a type ofstochastic process model. VAR models generalize the single-variable (univariate)autoregressive model by allowing for multivariatetime series. VAR models are often used ineconomics and thenatural sciences.
Like the autoregressive model, each variable has an equation modelling its evolution over time. This equation includes the variable'slagged (past) values, the lagged values of the other variables in the model, and anerror term. VAR models do not require as much knowledge about the forces influencing a variable as dostructural models withsimultaneous equations. The only prior knowledge required is a list of variables which can be hypothesized to affect each other over time.
This section includes alist of references,related reading, orexternal links,but its sources remain unclear because it lacksinline citations. Please helpimprove this section byintroducing more precise citations.(February 2012) (Learn how and when to remove this message) |
A VAR model describes the evolution of a set ofk variables, calledendogenous variables, over time. Each period of time is numbered,t = 1, ...,T. The variables are collected in avector,yt, which is of lengthk. (Equivalently, this vector might be described as a (k × 1)-matrix.) The vector is modelled as a linear function of its previous value. The vector's components are referred to asyi,t, meaning the observation at timet of thei th variable. For example, if the first variable in the model measures the price of wheat over time, theny1,1998 would indicate the price of wheat in the year 1998.
VAR models are characterized by theirorder, which refers to the number of earlier time periods the model will use. Continuing the above example, a 5th-order VAR would model each year's wheat price as a linear combination of the last five years of wheat prices. Alag is the value of a variable in a previous time period. So in general apth-order VAR refers to a VAR model which includes lags for the lastp time periods. Apth-order VAR is denoted "VAR(p)" and sometimes called "a VAR withp lags". Apth-order VAR model is written as
The variables of the formyt−i indicate that variable's valuei time periods earlier and are called the "ith lag" ofyt. The variablec is ak-vector of constants serving as theintercept of the model.Ai is atime-invariant (k × k)-matrix andet is ak-vector oferror terms. The error terms must satisfy three conditions:
The process of choosing the maximum lagp in the VAR model requires special attention becauseinference is dependent on correctness of the selected lag order.[2][3]
Note that all variables have to be of the sameorder of integration. The following cases are distinct:
One can stack the vectors in order to write a VAR(p) as astochasticmatrix difference equation, with a concise matrix notation:
A VAR(1) in two variables can be written in matrix form (more compact notation) as
(in which only a singleA matrix appears because this example has a maximum lagp equal to 1), or, equivalently, as the following system of two equations
Each variable in the model has one equation. The current (timet) observation of each variable depends on its own lagged values as well as on the lagged values of each other variable in the VAR.
A VAR withp lags can always be equivalently rewritten as a VAR with only one lag by appropriately redefining the dependent variable. The transformation amounts to stacking the lags of the VAR(p) variable in the new VAR(1) dependent variable and appending identities to complete the precise number of equations.
For example, the VAR(2) model
can be recast as the VAR(1) model
whereI is theidentity matrix.
The equivalent VAR(1) form is more convenient for analytical derivations and allows more compact statements.
Astructural VAR with p lags (sometimes abbreviatedSVAR) is
wherec0 is ak × 1 vector of constants,Bi is ak × k matrix (for everyi = 0, ...,p) andεt is ak × 1 vector oferror terms. Themain diagonal terms of theB0 matrix (the coefficients on theith variable in theith equation) are scaled to 1.
The error terms εt (structural shocks) satisfy the conditions (1) - (3) in the definition above, with the particularity that all the elements in the off diagonal of the covariance matrix are zero. That is, the structural shocks are uncorrelated.
For example, a two variable structural VAR(1) is:
where
that is, thevariances of the structural shocks are denoted (i = 1, 2) and thecovariance is.
Writing the first equation explicitly and passingy2,t to theright hand side one obtains
Note thaty2,t can have a contemporaneous effect ony1,t ifB0;1,2 is not zero. This is different from the case whenB0 is theidentity matrix (all off-diagonal elements are zero — the case in the initial definition), wheny2,t can impact directlyy1,t+1 and subsequent future values, but noty1,t.
Because of theparameter identification problem,ordinary least squares estimation of the structural VAR would yieldinconsistent parameter estimates. This problem can be overcome by rewriting the VAR in reduced form.
From an economic point of view, if the joint dynamics of a set of variables can be represented by a VAR model, then the structural form is a depiction of the underlying, "structural", economic relationships. Two features of the structural form make it the preferred candidate to represent the underlying relations:
By premultiplying the structural VAR with the inverse ofB0
and denoting
one obtains thepth order reduced VAR
Note that in the reduced form all right hand side variables arepredetermined at timet. As there are no timet endogenous variables on the right hand side, no variable has adirect contemporaneous effect on other variables in the model.
However, the error terms in the reduced VAR are composites of the structural shockset =B0−1εt. Thus, the occurrence of one structural shockεi,t can potentially lead to the occurrence of shocks in all error termsej,t, thus creating contemporaneous movement in all endogenous variables. Consequently, the covariance matrix of the reduced VAR
can have non-zero off-diagonal elements, thus allowing non-zero correlation between error terms.
Starting from the concise matrix notation:
This can be written alternatively as:
where denotes theKronecker product and Vec thevectorization of the indicated matrix.
This estimator isconsistent andasymptotically efficient. It is furthermore equal to the conditionalmaximum likelihood estimator.[4]
As in the standard case, themaximum likelihood estimator (MLE) of the covariance matrix differs from the ordinary least squares (OLS) estimator.
MLE estimator:[citation needed]
OLS estimator:[citation needed] for a model with a constant,k variables andp lags.
In a matrix notation, this gives:
The covariance matrix of the parameters can be estimated as[citation needed]
Vector autoregression models often involve the estimation of many parameters. For example, with seven variables and four lags, each matrix of coefficients for a given lag length is 7 by 7, and the vector of constants has 7 elements, so a total of 49×4 + 7 = 203 parameters are estimated, substantially lowering thedegrees of freedom of the regression (the number of data points minus the number of parameters to be estimated). This can hurt the accuracy of the parameter estimates and hence of the forecasts given by the model.
Consider the first-order case (i.e., with only one lag), with equation of evolution
for evolving (state) vector and vector of shocks. To find, say, the effect of thej-th element of the vector of shocks upon thei-th element of the state vector 2 periods later, which is a particular impulse response, first write the above equation of evolution one period lagged:
Use this in the original equation of evolution to obtain
then repeat using the twice lagged equation of evolution, to obtain
From this, the effect of thej-th component of upon thei-th component of is thei, j element of the matrix
It can be seen from thisinduction process that any shock will have an effect on the elements ofy infinitely far forward in time, although the effect will become smaller and smaller over time assuming that the AR process is stable — that is, that all theeigenvalues of the matrixA are less than 1 inabsolute value.
An estimated VAR model can be used forforecasting, and the quality of the forecasts can be judged, in ways that are completely analogous to the methods used in univariate autoregressive modelling.
Christopher Sims has advocated VAR models, criticizing the claims and performance of earlier modeling inmacroeconomiceconometrics.[6] He recommended VAR models, which had previously appeared in time seriesstatistics and insystem identification, a statistical specialty incontrol theory. Sims advocated VAR models as providing a theory-free method to estimate economic relationships, thus being an alternative to the "incredible identification restrictions" in structural models.[6] VAR models are also increasingly used in health research for automatic analyses of diary data[7] or sensor data. Sio Iong Ao and R. E. Caraka found that the artificial neural network can improve its performance with the addition of the hybrid vector autoregression component.[8][9]