Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Simultaneous equations model

From Wikipedia, the free encyclopedia
Type of statistical model

Simultaneous equations models are a type ofstatistical model in which thedependent variables are functions of other dependent variables, rather than just independent variables.[1] This means some of the explanatory variables arejointly determined with the dependent variable, which ineconomics usually is the consequence of some underlyingequilibrium mechanism. Take the typicalsupply and demand model: whilst typically one would determine the quantity supplied and demanded to be a function of the price set by the market, it is also possible for the reverse to be true, where producers observe the quantity that consumers demandand then set the price.[2]

Simultaneity poses challenges for theestimation of the statistical parameters of interest, because theGauss–Markov assumption ofstrict exogeneity of the regressors is violated. And while it would be natural to estimate all simultaneous equations at once, this often leads to acomputationally costly non-linear optimization problem even for the simplestsystem of linear equations.[3] This situation prompted the development, spearheaded by theCowles Commission in the 1940s and 1950s,[4] of various techniques that estimate each equation in the model seriatim, most notablylimited information maximum likelihood andtwo-stage least squares.[5]

Structural and reduced form

[edit]

Suppose there arem regression equations of the form

yit=yi,tγi+xitβi+uit,i=1,,m,{\displaystyle y_{it}=y_{-i,t}'\gamma _{i}+x_{it}'\;\!\beta _{i}+u_{it},\quad i=1,\ldots ,m,}

wherei is the equation number, andt = 1, ...,T is the observation index. In these equationsxit is theki×1 vector of exogenous variables,yit is the dependent variable,y−i,t is theni×1 vector of all other endogenous variables which enter theith equation on the right-hand side, anduit are the error terms. The “−i” notation indicates that the vectory−i,t may contain any of they’s except foryit (since it is already present on the left-hand side). The regression coefficientsβi andγi are of dimensionski×1 andni×1 correspondingly. Vertically stacking theT observations corresponding to theith equation, we can write each equation in vector form as

yi=Yiγi+Xiβi+ui,i=1,,m,{\displaystyle y_{i}=Y_{-i}\gamma _{i}+X_{i}\beta _{i}+u_{i},\quad i=1,\ldots ,m,}

whereyi andui are1 vectors,Xi is aT×ki matrix of exogenous regressors, andY−i is aT×ni matrix of endogenous regressors on the right-hand side of theith equation. Finally, we can move all endogenous variables to the left-hand side and write them equations jointly in vector form as

YΓ=XB+U.{\displaystyle Y\Gamma =X\mathrm {B} +U.\,}

This representation is known as thestructural form. In this equationY = [y1y2 ...ym] is theT×m matrix of dependent variables. Each of the matricesY−i is in fact anni-columned submatrix of thisY. Them×m matrix Γ, which describes the relation between the dependent variables, has a complicated structure. It has ones on the diagonal, and all other elements of each columni are either the components of the vector−γi or zeros, depending on which columns ofY were included in the matrixY−i. TheT×k matrixX contains all exogenous regressors from all equations, but without repetitions (that is, matrixX should be of full rank). Thus, eachXi is aki-columned submatrix ofX. Matrix Β has sizek×m, and each of its columns consists of the components of vectorsβi and zeros, depending on which of the regressors fromX were included or excluded fromXi. Finally,U = [u1u2 ...um] is aT×m matrix of the error terms.

Postmultiplying the structural equation byΓ −1, the system can be written in thereduced form as

Y=XBΓ1+UΓ1=XΠ+V.{\displaystyle Y=X\mathrm {B} \Gamma ^{-1}+U\Gamma ^{-1}=X\Pi +V.\,}

This is already a simplegeneral linear model, and it can be estimated for example byordinary least squares. Unfortunately, the task of decomposing the estimated matrixΠ^{\displaystyle \scriptstyle {\hat {\Pi }}} into the individual factors Β andΓ −1 is quite complicated, and therefore the reduced form is more suitable for prediction but not inference.

Assumptions

[edit]

Firstly, the rank of the matrixX of exogenous regressors must be equal tok, both in finite samples and in the limit asT → ∞ (this later requirement means that in the limit the expression1TXX{\displaystyle \scriptstyle {\frac {1}{T}}X'\!X} should converge to a nondegeneratek×k matrix). Matrix Γ is also assumed to be non-degenerate.

Secondly, error terms are assumed to be seriallyindependent and identically distributed. That is, if thetth row of matrixU is denoted byu(t), then the sequence of vectors {u(t)} should be iid, with zero mean and some covariance matrix Σ (which is unknown). In particular, this implies thatE[U] = 0, andE[U′U] =T Σ.

Lastly, assumptions are required for identification.

Identification

[edit]

Theidentification conditions require that thesystem of linear equations be solvable for the unknown parameters.

More specifically, theorder condition, a necessary condition for identification, is that for each equationki + ni ≤ k, which can be phrased as “the number of excluded exogenous variables is greater or equal to the number of included endogenous variables”.

Therank condition, a stronger condition which is necessary and sufficient, is that therank ofΠi0 equalsni, whereΠi0 is a(k − kini matrix which is obtained fromΠ by crossing out those columns which correspond to the excluded endogenous variables, and those rows which correspond to the included exogenous variables.

Using cross-equation restrictions to achieve identification

[edit]

In simultaneous equations models, the most common method to achieveidentification is by imposing within-equation parameter restrictions.[6] Yet, identification is also possible using cross equation restrictions.

To illustrate how cross equation restrictions can be used for identification, consider the following example from Wooldridge[6]

y1=γ12y2+δ11z1+δ12z2+δ13z3+u1y2=γ21y1+δ21z1+δ22z2+u2{\displaystyle {\begin{aligned}y_{1}&=\gamma _{12}y_{2}+\delta _{11}z_{1}+\delta _{12}z_{2}+\delta _{13}z_{3}+u_{1}\\y_{2}&=\gamma _{21}y_{1}+\delta _{21}z_{1}+\delta _{22}z_{2}+u_{2}\end{aligned}}}

where z's are uncorrelated with u's and y's areendogenous variables. Without further restrictions, the first equation is not identified because there is no excluded exogenous variable. The second equation is just identified ifδ13≠0, which is assumed to be true for the rest of discussion.

Now we impose the cross equation restriction ofδ12=δ22. Since the second equation is identified, we can treatδ12 as known for the purpose of identification. Then, the first equation becomes:

y1δ12z2=γ12y2+δ11z1+δ13z3+u1{\displaystyle y_{1}-\delta _{12}z_{2}=\gamma _{12}y_{2}+\delta _{11}z_{1}+\delta _{13}z_{3}+u_{1}}

Then, we can use(z1,z2,z3) asinstruments to estimate the coefficients in the above equation since there are one endogenous variable (y2) and one excluded exogenous variable (z2) on the right hand side. Therefore, cross equation restrictions in place of within-equation restrictions can achieve identification.

Estimation

[edit]

Two-stage least squares (2SLS)

[edit]

The simplest and the most common estimation method for the simultaneous equations model is the so-calledtwo-stage least squares method,[7] developed independently byTheil (1953) andBasmann (1957).[8][9][10] It is an equation-by-equation technique, where the endogenous regressors on the right-hand side of each equation are being instrumented with the regressorsX from all other equations. The method is called “two-stage” because it conducts estimation in two steps:[7]

Step 1: RegressY−i onX and obtain the predicted valuesY^i{\displaystyle \scriptstyle {\hat {Y}}_{\!-i}};
Step 2: Estimateγi,βi by theordinary least squares regression ofyi onY^i{\displaystyle \scriptstyle {\hat {Y}}_{\!-i}} andXi.

If theith equation in the model is written as

yi=(YiXi)(γiβi)+uiZiδi+ui,{\displaystyle y_{i}={\begin{pmatrix}Y_{-i}&X_{i}\end{pmatrix}}{\begin{pmatrix}\gamma _{i}\\\beta _{i}\end{pmatrix}}+u_{i}\equiv Z_{i}\delta _{i}+u_{i},}

whereZi is a(ni + ki) matrix of both endogenous and exogenous regressors in theith equation, andδi is an (ni + ki)-dimensional vector of regression coefficients, then the 2SLS estimator ofδi will be given by[7]

δ^i=(Z^iZ^i)1Z^iyi=(ZiPZi)1ZiPyi,{\displaystyle {\hat {\delta }}_{i}={\big (}{\hat {Z}}'_{i}{\hat {Z}}_{i}{\big )}^{-1}{\hat {Z}}'_{i}y_{i}={\big (}Z'_{i}PZ_{i}{\big )}^{-1}Z'_{i}Py_{i},}

whereP =X (X ′X)−1X ′ is the projection matrix onto the linear space spanned by the exogenous regressorsX.

Indirect least squares

[edit]

Indirect least squares is an approach ineconometrics where thecoefficients in a simultaneous equations model are estimated from thereduced form model usingordinary least squares.[11][12] For this, the structural system of equations is transformed into the reduced form first. Once the coefficients are estimated the model is put back into the structural form.

Limited information maximum likelihood (LIML)

[edit]

The “limited information” maximum likelihood method was suggested byM. A. Girshick in 1947,[13] and formalized byT. W. Anderson andH. Rubin in 1949.[14] It is used when one is interested in estimating a single structural equation at a time (hence its name of limited information), say for observation i:

yi=Yiγi+Xiβi+uiZiδi+ui{\displaystyle y_{i}=Y_{-i}\gamma _{i}+X_{i}\beta _{i}+u_{i}\equiv Z_{i}\delta _{i}+u_{i}}

The structural equations for the remaining endogenous variables Y−i are not specified, and they are given in their reduced form:

Yi=XΠ+Ui{\displaystyle Y_{-i}=X\Pi +U_{-i}}

Notation in this context is different than for the simpleIV case. One has:

The explicit formula for the LIML is:[15]

δ^i=(Zi(IλM)Zi)1Zi(IλM)yi,{\displaystyle {\hat {\delta }}_{i}={\Big (}Z'_{i}(I-\lambda M)Z_{i}{\Big )}^{\!-1}Z'_{i}(I-\lambda M)y_{i},}

whereM =I − X (X ′X)−1X ′, andλ is the smallest characteristic root of the matrix:

([yiYi]Mi[yiYi])([yiYi]M[yiYi])1{\displaystyle {\Big (}{\begin{bmatrix}y_{i}\\Y_{-i}\end{bmatrix}}M_{i}{\begin{bmatrix}y_{i}&Y_{-i}\end{bmatrix}}{\Big )}{\Big (}{\begin{bmatrix}y_{i}\\Y_{-i}\end{bmatrix}}M{\begin{bmatrix}y_{i}&Y_{-i}\end{bmatrix}}{\Big )}^{\!-1}}

where, in a similar way,Mi =I − Xi (XiXi)−1Xi.

In other words,λ is the smallest solution of thegeneralized eigenvalue problem, seeTheil (1971, p. 503):

|[yiYi]Mi[yiYi]λ[yiYi]M[yiYi]|=0{\displaystyle {\Big |}{\begin{bmatrix}y_{i}&Y_{-i}\end{bmatrix}}'M_{i}{\begin{bmatrix}y_{i}&Y_{-i}\end{bmatrix}}-\lambda {\begin{bmatrix}y_{i}&Y_{-i}\end{bmatrix}}'M{\begin{bmatrix}y_{i}&Y_{-i}\end{bmatrix}}{\Big |}=0}

K class estimators

[edit]

The LIML is a special case of the K-class estimators:[16]

δ^=(Z(IκM)Z)1Z(IκM)y,{\displaystyle {\hat {\delta }}={\Big (}Z'(I-\kappa M)Z{\Big )}^{\!-1}Z'(I-\kappa M)y,}

with:

Several estimators belong to this class:

  • κ=0:OLS
  • κ=1: 2SLS. Note indeed that in this case,IκM=IM=P{\displaystyle I-\kappa M=I-M=P} the usual projection matrix of the 2SLS
  • κ=λ: LIML
  • κ=λ - α / (n-K):Fuller (1977) estimator.[17] Here K represents the number of instruments, n the sample size, and α a positive constant to specify. A value of α=1 will yield an estimator that is approximately unbiased.[16]

Three-stage least squares (3SLS)

[edit]

The three-stage least squares estimator was introduced byZellner & Theil (1962).[18][19] It can be seen as a special case of multi-equationGMM where the set ofinstrumental variables is common to all equations.[20] If all regressors are in fact predetermined, then 3SLS reduces toseemingly unrelated regressions (SUR). Thus it may also be seen as a combination oftwo-stage least squares (2SLS) with SUR.

Applications in social science

[edit]

Across fields and disciplines simultaneous equation models are applied to various observational phenomena. These equations are applied when phenomena are assumed to be reciprocally causal. The classic example is supply and demand ineconomics. In other disciplines there are examples such as candidate evaluations and party identification[21] or public opinion and social policy inpolitical science;[22][23] road investment and travel demand in geography;[24] and educational attainment and parenthood entry insociology ordemography.[25] The simultaneous equation model requires a theory of reciprocal causality that includes special features if the causal effects are to be estimated as simultaneous feedback as opposed to one-sided 'blocks' of an equation where a researcher is interested in the causal effect of X on Y while holding the causal effect of Y on X constant, or when the researcher knows the exact amount of time it takes for each causal effect to take place, i.e., the length of the causal lags. Instead of lagged effects, simultaneous feedback means estimating the simultaneous and perpetual impact of X and Y on each other. This requires a theory that causal effects are simultaneous in time, or so complex that they appear to behave simultaneously; a common example are the moods of roommates.[26] To estimate simultaneous feedback models a theory of equilibrium is also necessary – that X and Y are in relatively steady states or are part of a system (society, market, classroom) that is in a relatively stable state.[27]

See also

[edit]

References

[edit]
  1. ^Martin, Vance; Hurn, Stan; Harris, David (2013).Econometric Modelling with Time Series. Cambridge University Press. p. 159.ISBN 978-0-521-19660-4.
  2. ^Maddala, G. S.; Lahiri, Kajal (2009).Introduction to Econometrics (Fourth ed.). Wiley. pp. 355–357.ISBN 978-0-470-01512-4.
  3. ^Quandt, Richard E. (1983). "Computational Problems and Methods". In Griliches, Z.; Intriligator, M. D. (eds.).Handbook of Econometrics. Vol. I. North-Holland. pp. 699–764.ISBN 0-444-86185-8.
  4. ^Christ, Carl F. (1994). "The Cowles Commission's Contributions to Econometrics at Chicago, 1939–1955".Journal of Economic Literature.32 (1):30–59.JSTOR 2728422.
  5. ^Johnston, J. (1971). "Simultaneous-equation Methods: Estimation".Econometric Methods (Second ed.). New York: McGraw-Hill. pp. 376–423.ISBN 0-07-032679-7.
  6. ^abWooldridge, J.M., Econometric Analysis of Cross Section and Panel Data, MIT Press, Cambridge, Mass.
  7. ^abcGreene, William H. (2002).Econometric analysis (5th ed.). Prentice Hall. pp. 398–99.ISBN 0-13-066189-9.
  8. ^Theil, H. (1953). Estimation and Simultaneous Correlation in Complete Equation Systems (Memorandum). Central Planning Bureau. Reprinted inHenri Theil’s Contributions to Economics and Econometrics (Springer, 1992),doi:10.1007/978-94-011-2546-8_6.
  9. ^Basmann, R. L. (1957). "A generalized classical method of linear estimation of coefficients in a structural equation".Econometrica.25 (1):77–83.doi:10.2307/1907743.JSTOR 1907743.
  10. ^Theil, Henri (1971).Principles of Econometrics. New York: John Wiley.ISBN 978-0-471-85845-4.
  11. ^Park, S-B. (1974) "On Indirect Least Squares Estimation of a Simultaneous Equation System",The Canadian Journal of Statistics / La Revue Canadienne de Statistique, 2 (1), 75–82JSTOR 3314964
  12. ^Vajda, S.; Valko, P.; Godfrey, K.R. (1987). "Direct and indirect least squares methods in continuous-time parameter estimation".Automatica.23 (6):707–718.doi:10.1016/0005-1098(87)90027-6.
  13. ^First application byGirshick, M. A.; Haavelmo, Trygve (1947). "Statistical Analysis of the Demand for Food: Examples of Simultaneous Estimation of Structural Equations".Econometrica.15 (2):79–110.doi:10.2307/1907066.JSTOR 1907066.
  14. ^Anderson, T.W.; Rubin, H. (1949)."Estimator of the parameters of a single equation in a complete system of stochastic equations".Annals of Mathematical Statistics.20 (1):46–63.doi:10.1214/aoms/1177730090.JSTOR 2236803.
  15. ^Amemiya, Takeshi (1985).Advanced Econometrics. Cambridge, Massachusetts: Harvard University Press. p. 235.ISBN 0-674-00560-0.
  16. ^abDavidson, Russell; MacKinnon, James G. (1993).Estimation and inference in econometrics. Oxford University Press. p. 649.ISBN 0-19-506011-3.
  17. ^Fuller, Wayne (1977). "Some Properties of a Modification of the Limited Information Estimator".Econometrica.45 (4):939–953.doi:10.2307/1912683.JSTOR 1912683.
  18. ^Zellner, Arnold;Theil, Henri (1962). "Three-stage least squares: simultaneous estimation of simultaneous equations".Econometrica.30 (1):54–78.doi:10.2307/1911287.JSTOR 1911287.
  19. ^Kmenta, Jan (1986)."System Methods of Estimation".Elements of Econometrics (Second ed.). New York: Macmillan. pp. 695–701.ISBN 9780023650703.
  20. ^Hayashi, Fumio (2000)."Multiple-Equation GMM".Econometrics. Princeton University Press. pp. 276–279.ISBN 1400823838.
  21. ^Page, Benjamin I.; Jones, Calvin C. (1979-12-01). "Reciprocal Effects of Policy Preferences, Party Loyalties and the Vote".American Political Science Review.73 (4):1071–1089.doi:10.2307/1953990.ISSN 0003-0554.JSTOR 1953990.S2CID 144984371.
  22. ^Wlezien, Christopher (1995-01-01). "The Public as Thermostat: Dynamics of Preferences for Spending".American Journal of Political Science.39 (4):981–1000.doi:10.2307/2111666.JSTOR 2111666.
  23. ^Breznau, Nate (2016-07-01)."Positive Returns and Equilibrium: Simultaneous Feedback Between Public Opinion and Social Policy".Policy Studies Journal.45 (4):583–612.doi:10.1111/psj.12171.ISSN 1541-0072.
  24. ^Xie, F.; Levinson, D. (2010-05-01). "How streetcars shaped suburbanization: a Granger causality analysis of land use and transit in the Twin Cities".Journal of Economic Geography.10 (3):453–470.doi:10.1093/jeg/lbp031.hdl:11299/179996.ISSN 1468-2702.
  25. ^Marini, Margaret Mooney (1984-01-01). "Women's Educational Attainment and the Timing of Entry into Parenthood".American Sociological Review.49 (4):491–511.doi:10.2307/2095464.JSTOR 2095464.
  26. ^Wong, Chi-Sum; Law, Kenneth S. (1999-01-01). "Testing Reciprocal Relations by Nonrecursive Structuralequation Models Using Cross-Sectional Data".Organizational Research Methods.2 (1):69–87.doi:10.1177/109442819921005.ISSN 1094-4281.S2CID 122284566.
  27. ^2013. “Reverse Arrow Dynamics: Feedback Loops and Formative Measurement.” InStructural Equation Modeling: A Second Course, edited byGregory R. Hancock and Ralph O. Mueller, 2nd ed., 41–79. Charlotte, NC: Information Age Publishing

Further reading

[edit]

External links

[edit]
Retrieved from "https://en.wikipedia.org/w/index.php?title=Simultaneous_equations_model&oldid=1266875142"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp