Movatterモバイル変換


[0]ホーム

URL:


Jump to content
WikipediaThe Free Encyclopedia
Search

Correlation coefficient

From Wikipedia, the free encyclopedia
Numerical measure of a statistical relationship between variables

Acorrelation coefficient is anumerical measure of some type oflinearcorrelation, meaning alinear function between twovariables.[a] The variables may be two columns of a givendata set of observations, often called asample, or two components of amultivariate random variable with a knowndistribution.[citation needed]

Several types of correlation coefficient exist, each with their own definition and own range of usability and characteristics. They all assume values in the range from −1 to +1, where ±1 indicates the strongest possible correlation and 0 indicates no correlation.[2] As tools of analysis, correlation coefficients present certain problems, including the propensity of some types to be distorted byoutliers and the possibility of incorrectly being used to infer acausal relationship between the variables (for more, seeCorrelation does not imply causation).[3]

Types

[edit]

There are several different measures for the degree of correlation in data, depending on the kind of data: principally whether the data is a measurement,ordinal, orcategorical.

Pearson

[edit]

ThePearson product-moment correlation coefficient, also known asr,R, orPearson's r, is a measure of the strength and direction of thelinear relationship between two variables that is defined as thecovariance of the variables divided by the product of their standard deviations.[4] This is the best-known and most commonly used type of correlation coefficient. When the term "correlation coefficient" is used without further qualification, it usually refers to the Pearson product-moment correlation coefficient.

Intra-class

[edit]

Intraclass correlation (ICC) is a descriptive statistic that can be used, when quantitative measurements are made on units that are organized into groups; it describes how strongly units in the same group resemble each other.

Rank

[edit]

Rank correlation is a measure of the relationship between the rankings of two variables, or two rankings of the same variable:

Tetrachoric and polychoric

[edit]

Thepolychoric correlation coefficient measures association between two ordered-categorical variables. It's technically defined as the estimate of the Pearson correlation coefficient one would obtain if:

  1. The two variables were measured on a continuous scale, instead of as ordered-category variables.
  2. The two continuous variables followed abivariate normal distribution.

When both variables aredichotomous instead of ordered-categorical, thepolychoric correlation coefficient is called the tetrachoric correlation coefficient.

Interpreting correlation coefficient values

[edit]

The correlation between two variables have different associations that are measured in values such asr orR. Correlation values range from −1 to +1, where ±1 indicates the strongest possible correlation and 0 indicates no correlation between variables.[5]

r orRr orRStrength or weakness of association between variables[6]
+1.0 to +0.8-1.0 to -0.8Perfect or very strong association
+0.8 to +0.6-0.8 to -0.6Strong association
+0.6 to +0.4-0.6 to -0.4Moderate association
+0.4 to +0.2-0.4 to -0.2Weak association
+0.2 to 0.0-0.2 to 0.0Very weak or no association

See also

[edit]

Notes

[edit]
  1. ^Correlation coefficient: Astatistic used to show how the scores from one measure relate to scores on a second measure for the same group of individuals. A high value (approaching +1.00) is a strong direct relationship, values near 0.50 are considered moderate and values below 0.30 are considered to show weak relationship. A low negative value (approaching -1.00) is similarly a strong inverse relationship, and values near 0.00 indicate little, if any, relationship.[1]

References

[edit]
  1. ^"correlation coefficient".NCME.org.National Council on Measurement in Education. Archived fromthe original on July 22, 2017. RetrievedApril 17, 2014.
  2. ^Taylor, John R. (1997).An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements(PDF) (2nd ed.). Sausalito, CA: University Science Books. p. 217.ISBN 0-935702-75-X. Archived fromthe original(PDF) on 15 February 2019. Retrieved14 February 2019.
  3. ^Boddy, Richard; Smith, Gordon (2009).Statistical Methods in Practice: For scientists and technologists. Chichester, U.K.: Wiley. pp. 95–96.ISBN 978-0-470-74664-6.
  4. ^Weisstein, Eric W."Statistical Correlation".mathworld.wolfram.com. Retrieved2020-08-22.
  5. ^Taylor, John R. (1997).An Introduction to Error Analysis: The Study of Uncertainties in Physical Measurements(PDF) (2nd ed.). Sausalito, CA: University Science Books. p. 217.ISBN 0-935702-75-X. Archived fromthe original(PDF) on 15 February 2019. Retrieved14 February 2019.
  6. ^"The Correlation Coefficient (r)".Boston University.
Continuous data
Center
Dispersion
Shape
Count data
Summary tables
Dependence
Graphics
Study design
Survey methodology
Controlled experiments
Adaptive designs
Observational studies
Statistical theory
Frequentist inference
Point estimation
Interval estimation
Testing hypotheses
Parametric tests
Specific tests
Goodness of fit
Rank statistics
Bayesian inference
Correlation
Regression analysis
Linear regression
Non-standard predictors
Generalized linear model
Partition of variance
Categorical
Multivariate
Time-series
General
Specific tests
Time domain
Frequency domain
Survival
Survival function
Hazard function
Test
Biostatistics
Engineering statistics
Social statistics
Spatial statistics
Portal:
Retrieved from "https://en.wikipedia.org/w/index.php?title=Correlation_coefficient&oldid=1328275893"
Categories:
Hidden categories:

[8]ページ先頭

©2009-2026 Movatter.jp