Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Coefficient of multiple correlation

From Wikipedia, the free encyclopedia
Statistical concept
This article includes a list ofgeneral references, butit lacks sufficient correspondinginline citations. Please help toimprove this article byintroducing more precise citations.(November 2010) (Learn how and when to remove this message)

Instatistics, thecoefficient of multiple correlation is a measure of how well a given variable can be predicted using alinear function of a set of other variables. It is thecorrelation between the variable's values and the best predictions that can be computedlinearly from the predictive variables.[1]

The coefficient of multiple correlation takes values between 0 and 1. Higher values indicate higher predictability of thedependent variable from theindependent variables, with a value of 1 indicating that the predictions are exactly correct and a value of 0 indicating that no linear combination of the independent variables is a better predictor than is the fixedmean of the dependent variable.[2]

Correlation Coefficient (r)Direction and Strength of Correlation
1Perfectly positive
0.8Strongly positive
0.5Moderately positive
0.2Weakly positive
0No association
-0.2Weakly negative
-0.5Moderately negative
-0.8Strongly negative
-1Perfectly negative

The coefficient of multiple correlation is known as the square root of thecoefficient of determination, but under the particular assumptions that an intercept is included and that the best possible linear predictors are used, whereas the coefficient of determination is defined for more general cases, including those of nonlinear prediction and those in which the predicted values have not been derived from a model-fitting procedure.

Definition

[edit]

The coefficient of multiple correlation, denotedR, is ascalar that is defined as thePearson correlation coefficient between the predicted and the actual values of the dependent variable in a linear regression model that includes anintercept.

Computation

[edit]

The square of the coefficient of multiple correlation can be computed using thevectorc=(rx1y,rx2y,,rxNy){\displaystyle \mathbf {c} ={(r_{x_{1}y},r_{x_{2}y},\dots ,r_{x_{N}y})}^{\top }} ofcorrelationsrxny{\displaystyle r_{x_{n}y}} between the predictor variablesxn{\displaystyle x_{n}} (independent variables) and the target variabley{\displaystyle y} (dependent variable), and thecorrelation matrixRxx{\displaystyle R_{xx}} of correlations between predictor variables. It is given by

R2=cRxx1c,{\displaystyle R^{2}=\mathbf {c} ^{\top }R_{xx}^{-1}\,\mathbf {c} ,}

wherec{\displaystyle \mathbf {c} ^{\top }} is thetranspose ofc{\displaystyle \mathbf {c} }, andRxx1{\displaystyle R_{xx}^{-1}} is theinverse of the matrix

Rxx=(rx1x1rx1x2rx1xNrx2x1rxNx1rxNxN).{\displaystyle R_{xx}=\left({\begin{array}{cccc}r_{x_{1}x_{1}}&r_{x_{1}x_{2}}&\dots &r_{x_{1}x_{N}}\\r_{x_{2}x_{1}}&\ddots &&\vdots \\\vdots &&\ddots &\\r_{x_{N}x_{1}}&\dots &&r_{x_{N}x_{N}}\end{array}}\right).}

If all the predictor variables are uncorrelated, the matrixRxx{\displaystyle R_{xx}} is theidentity matrix andR2{\displaystyle R^{2}} simply equalscc{\displaystyle \mathbf {c} ^{\top }\,\mathbf {c} }, the sum of the squared correlations with the dependent variable. If the predictor variables are correlated among themselves, the inverse of the correlation matrixRxx{\displaystyle R_{xx}} accounts for this.

The squared coefficient of multiple correlation can also be computed as the fraction of variance of the dependent variable that is explained by the independent variables, which in turn is 1 minus the unexplained fraction. The unexplained fraction can be computed as thesum of squares of residuals—that is, the sum of the squares of the prediction errors—divided by thesum of squares of deviations of the values of the dependent variable from itsexpected value.

Properties

[edit]

With more than two variables being related to each other, the value of the coefficient of multiple correlation depends on the choice of dependent variable: a regression ofy{\displaystyle y} onx{\displaystyle x} andz{\displaystyle z} will in general have a differentR{\displaystyle R} than will a regression ofz{\displaystyle z} onx{\displaystyle x} andy{\displaystyle y}. For example, suppose that in a particular sample the variablez{\displaystyle z} isuncorrelated with bothx{\displaystyle x} andy{\displaystyle y}, whilex{\displaystyle x} andy{\displaystyle y} are linearly related to each other. Then a regression ofz{\displaystyle z} ony{\displaystyle y} andx{\displaystyle x} will yield anR{\displaystyle R} of zero, while a regression ofy{\displaystyle y} onx{\displaystyle x} andz{\displaystyle z} will yield a strictly positiveR{\displaystyle R}. This follows since the correlation ofy{\displaystyle y} with its best predictor based onx{\displaystyle x} andz{\displaystyle z} is in all cases at least as large as the correlation ofy{\displaystyle y} with its best predictor based onx{\displaystyle x} alone, and in this case withz{\displaystyle z} providing noexplanatory power it will be exactly as large.

References

[edit]
  1. ^Introduction to Multiple Regression
  2. ^Multiple correlation coefficient

Further reading

[edit]
  • Allison, Paul D. (1998).Multiple Regression: A Primer. London: Sage Publications.ISBN 9780761985334
  • Cohen, Jacob, et al. (2002).Applied Multiple Regression: Correlation Analysis for the Behavioral Sciences.ISBN 0805822232
  • Crown, William H. (1998).Statistical Models for the Social and Behavioral Sciences: Multiple Regression and Limited-Dependent Variable Models.ISBN 0275953165
  • Edwards, Allen Louis (1985).Multiple Regression and the Analysis of Variance and Covariance.ISBN 0716710811
  • Keith, Timothy (2006).Multiple Regression and Beyond. Boston: Pearson Education.
  • Fred N. Kerlinger, Elazar J. Pedhazur (1973).Multiple Regression in Behavioral Research. New York: Holt Rinehart Winston.ISBN 9780030862113
  • Stanton, Jeffrey M. (2001)."Galton, Pearson, and the Peas: A Brief History of Linear Regression for Statistics Instructors",Journal of Statistics Education, 9 (3).
Retrieved from "https://en.wikipedia.org/w/index.php?title=Coefficient_of_multiple_correlation&oldid=1312481095"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2025 Movatter.jp