The correlation coefficient, sometimes also called the cross-correlation coefficient, Pearson correlation coefficient (PCC), Pearson's , the Perason product-moment correlation coefficient (PPMCC), or the bivariate correlation, is a quantity that gives the quality of a least squares fitting to the original data. To define the correlation coefficient, first consider the sum of squared values , , and of a set of data points about their respective means,
These quantities are simply unnormalized forms of the variances and covariance of and given by
For linear least squares fitting, the coefficient in
(16)
is given by
and the coefficient in
(19)
is given by
(20)
The correlation coefficient (sometimes also denoted ) is then defined by
The correlation coefficient is also known as the product-moment coefficient of correlation or Pearson's correlation. The correlation coefficients for linear fits to increasingly noisy data are shown above.
The correlation coefficient has an important physical interpretation. To see this, define
(23)
and denote the "expected" value for as . Sums of are then
The sum of squared errors is then
and the sum of squared residuals is
But
so
and
(53)
The square of the correlation coefficient is therefore given by
In other words, is the proportion of which is accounted for by the regression.
If there is complete correlation, then the lines obtained by solving for best-fit and coincide (since all data points lie on them), so solving (◇) for and equating to (◇) gives
(57)
Therefore, and , giving
(58)
The correlation coefficient is independent of both origin and scale, so
(59)
where
See alsoCorrelation Index,
Correlation Coefficient--Bivariate Normal Distribution,
Correlation Ratio,
Covariance,
Least Squares Fitting,
Regression Coefficient,
Spearman Rank Correlation Coefficient,
Variance Explore this topic in the MathWorld classroom Explore with Wolfram|Alpha ReferencesActon, F. S. Analysis of Straight-Line Data. New York: Dover, 1966.Edwards, A. L. "The Correlation Coefficient." Ch. 4 in An Introduction to Linear Regression and Correlation. San Francisco, CA: W. H. Freeman, pp. 33-46, 1976.Gonick, L. and Smith, W. "Regression." Ch. 11 in The Cartoon Guide to Statistics. New York: Harper Perennial, pp. 187-210, 1993.Kenney, J. F. and Keeping, E. S. "Linear Regression and Correlation." Ch. 15 in Mathematics of Statistics, Pt. 1, 3rd ed. Princeton, NJ: Van Nostrand, pp. 252-285, 1962.Press, W. H.; Flannery, B. P.; Teukolsky, S. A.; and Vetterling, W. T. "Linear Correlation." §14.5 in Numerical Recipes in FORTRAN: The Art of Scientific Computing, 2nd ed. Cambridge, England: Cambridge University Press, pp. 630-633, 1992.Snedecor, G. W. and Cochran, W. G. "The Sample Correlation Coefficient " and "Properties of ." §10.1-10.2 in Statistical Methods, 7th ed. Ames, IA: Iowa State Press, pp. 175-178, 1980.Spiegel, M. R. "Correlation Theory." Ch. 14 in Theory and Problems of Probability and Statistics, 2nd ed. New York: McGraw-Hill, pp. 294-323, 1992.Whittaker, E. T. and Robinson, G. "The Coefficient of Correlation for Frequency Distributions which are not Normal." §166 in The Calculus of Observations: A Treatise on Numerical Mathematics, 4th ed. New York: Dover, pp. 334-336, 1967. Referenced on Wolfram|AlphaCorrelation Coefficient Cite this as:Weisstein, Eric W. "Correlation Coefficient." From MathWorld--A Wolfram Resource. https://mathworld.wolfram.com/CorrelationCoefficient.html
Subject classificationsRetroSearch is an open source project built by @garambo | Open a GitHub Issue
Search and Browse the WWW like it's 1997 | Search results from DuckDuckGo
HTML:
3.2
| Encoding:
UTF-8
| Version:
0.7.4