Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

General linear model

From Wikipedia, the free encyclopedia
Statistical linear model
Not to be confused withMultiple linear regression,Generalized linear model orGeneral linear methods.
Part of a series on
Regression analysis
Models
Estimation
Background

Thegeneral linear model orgeneral multivariate regression model is a compact way of simultaneously writing severalmultiple linear regression models. In that sense it is not a separate statisticallinear model. The various multiple linear regression models may be compactly written as[1]

Y=XB+U,{\displaystyle \mathbf {Y} =\mathbf {X} \mathbf {B} +\mathbf {U} ,}

whereY is amatrix with series of multivariate measurements (each column being a set of measurements on one of thedependent variables),X is a matrix of observations onindependent variables that might be adesign matrix (each column being a set of observations on one of the independent variables),B is a matrix containing parameters that are usually to be estimated andU is a matrix containingerrors (noise). The errors are usually assumed to be uncorrelated across measurements, and follow amultivariate normal distribution. If the errors do not follow a multivariate normal distribution,generalized linear models may be used to relax assumptions aboutY andU.


The general linear model (GLM) encompasses several statistical models, includingANOVA,ANCOVA,MANOVA,MANCOVA, and ordinarylinear regression. Within this framework, both thet-test and theF-test can be applied. The general linear model is a generalization of multiple linear regression to the case of more than one dependent variable. IfY,B, andU werecolumn vectors, the matrix equation above would represent multiple linear regression.

Hypothesis tests with the general linear model can be made in two ways:multivariate or as several independentunivariate tests. In multivariate tests the columns ofY are tested together, whereas in univariate tests the columns ofY are tested independently, i.e., as multiple univariate tests with the same design matrix.

Comparison to multiple linear regression

[edit]
Further information:Multiple linear regression

Multiple linear regression is a generalization ofsimple linear regression to the case of more than one independent variable, and aspecial case of general linear models, restricted to one dependent variable. The basic model for multiple linear regression is

Yi=β0+β1Xi1+β2Xi2++βpXip+ϵi{\displaystyle Y_{i}=\beta _{0}+\beta _{1}X_{i1}+\beta _{2}X_{i2}+\ldots +\beta _{p}X_{ip}+\epsilon _{i}} or more compactlyYi=β0+k=1pβkXik+ϵi{\displaystyle Y_{i}=\beta _{0}+\sum \limits _{k=1}^{p}{\beta _{k}X_{ik}}+\epsilon _{i}}

for each observationi = 1, ... ,n.

In the formula above we considern observations of one dependent variable andp independent variables. Thus,Yi is theith observation of the dependent variable,Xik isith observation of thekth independent variable,k = 1, 2, ...,p. The valuesβk represent parameters to be estimated, andεi is theith independent identically distributed normal error.

In the more general multivariate linear regression, there is one equation of the above form for each ofm > 1 dependent variables that share the same set of explanatory variables and hence are estimated simultaneously with each other:

Yij=β0j+β1jXi1+β2jXi2++βpjXip+ϵij{\displaystyle Y_{ij}=\beta _{0j}+\beta _{1j}X_{i1}+\beta _{2j}X_{i2}+\ldots +\beta _{pj}X_{ip}+\epsilon _{ij}} or more compactlyYij=β0j+k=1pβkjXik+ϵij{\displaystyle Y_{ij}=\beta _{0j}+\sum \limits _{k=1}^{p}{\beta _{kj}X_{ik}}+\epsilon _{ij}}

for all observations indexed asi = 1, ... ,n and for all dependent variables indexed asj = 1,... ,m.

Note that, since each dependent variable has its own set of regression parameters to be fitted, from a computational point of view the general multivariate regression is simply a sequence of standard multiple linear regressions using the same explanatory variables.

Comparison to generalized linear model

[edit]

The general linear model and thegeneralized linear model (GLM)[2][3] are two commonly used families ofstatistical methods to relate some number of continuous and/or categoricalpredictors to a singleoutcome variable.

The main difference between the two approaches is that the general linear model strictly assumes that theresiduals will follow aconditionallynormal distribution,[4] while the GLM loosens this assumption and allows for a variety of otherdistributions from theexponential family for the residuals.[2] The general linear model is a special case of the GLM in which the distribution of the residuals follow a conditionally normal distribution.

The distribution of the residuals largely depends on the type and distribution of the outcome variable; different types of outcome variables lead to the variety of models within the GLM family. Commonly used models in the GLM family includebinary logistic regression[5] for binary or dichotomous outcomes,Poisson regression[6] for count outcomes, andlinear regression for continuous, normally distributed outcomes. This means that GLM may be spoken of as a general family of statistical models or as specific models for specific outcome types.

General linear modelGeneralized linear model
Typical estimation methodLeast squares,best linear unbiased predictionMaximum likelihood orBayesian
ExamplesANOVA,ANCOVA,linear regressionlinear regression,logistic regression,Poisson regression, gamma regression,[7] general linear model
Extensions and related methodsMANOVA,MANCOVA,linear mixed modelgeneralized linear mixed model (GLMM),generalized estimating equations (GEE)
R package and functionlm() in stats package (base R)glm() in stats package (base R) manova,
MATLAB functionmvregress()glmfit()
SAS proceduresPROC GLM,PROC REGPROC GENMOD,PROC LOGISTIC (for binary & ordered or unordered categorical outcomes)
Stata commandregressglm
SPSS commandregression,glmgenlin, logistic
Wolfram Language &Mathematica functionLinearModelFit[][8]GeneralizedLinearModelFit[][9]
EViews commandls[10]glm[11]
statsmodels Python Packageregression-and-linear-modelsGLM

Applications

[edit]

An application of the general linear model appears in the analysis of multiplebrain scans in scientific experiments whereY contains data from brain scanners,X contains experimental design variables and confounds. It is usually tested in a univariate way (usually referred to amass-univariate in this setting) and is often referred to asstatistical parametric mapping.[12]

See also

[edit]

Notes

[edit]
  1. ^Mardia, K. V.; Kent, J. T.; Bibby, J. M. (1979).Multivariate Analysis.Academic Press.ISBN 0-12-471252-5.
  2. ^abMcCullagh, P.;Nelder, J. A. (January 1, 1983). "An outline of generalized linear models".Generalized Linear Models. Springer US. pp. 21–47.doi:10.1007/978-1-4899-3242-6_2 (inactive 12 July 2025).ISBN 9780412317606.{{cite book}}: CS1 maint: DOI inactive as of July 2025 (link)
  3. ^Fox, J. (2015).Applied regression analysis and generalized linear models. Sage Publications.
  4. ^Cohen, J.; Cohen, P.; West, S. G.;Aiken, L. S. (2003). Applied multiple regression/correlation analysis for the behavioral sciences (Report).
  5. ^Hosmer Jr, D. W., Lemeshow, S., & Sturdivant, R. X. (2013).Applied logistic regression (Vol. 398). John Wiley & Sons.
  6. ^Gardner, W.; Mulvey, E. P.; Shaw, E. C. (1995). "Regression analyses of counts and rates: Poisson, overdispersed Poisson, and negative binomial models".Psychological Bulletin.118 (3):392–404.doi:10.1037/0033-2909.118.3.392.PMID 7501743.
  7. ^McCullagh, Peter;Nelder, John (1989).Generalized Linear Models (2nd ed.). Boca Raton: Chapman and Hall/CRC.ISBN 978-0-412-31760-6.
  8. ^LinearModelFit, Wolfram Language Documentation Center.
  9. ^GeneralizedLinearModelFit, Wolfram Language Documentation Center.
  10. ^ls, EViews Help.
  11. ^glm, EViews Help.
  12. ^Friston, K.J.; Holmes, A.P.; Worsley, K.J.; Poline, J.-B.; Frith, C.D.; Frackowiak, R.S.J. (1995). "Statistical Parametric Maps in functional imaging: A general linear approach".Human Brain Mapping.2 (4):189–210.doi:10.1002/hbm.460020402.S2CID 9898609.

References

[edit]
  • Christensen, Ronald (2020).Plane Answers to Complex Questions: The Theory of Linear Models (5th ed.). New York: Springer.ISBN 978-3-030-32096-6.
  • Wichura, Michael J. (2006).The coordinate-free approach to linear models. Cambridge Series in Statistical and Probabilistic Mathematics. Cambridge: Cambridge University Press. pp. xiv+199.ISBN 978-0-521-86842-6.MR 2283455.
  • Rawlings, John O.; Pantula, Sastry G.; Dickey, David A., eds. (1998).Applied Regression Analysis. Springer Texts in Statistics.doi:10.1007/b98890.ISBN 0-387-98454-2.
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis (see alsoTemplate:Least squares and regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Retrieved from "https://en.wikipedia.org/w/index.php?title=General_linear_model&oldid=1309210392"
Category:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp