10.2: The Linear Correlation Coefficient

alt

Figure \(\PageIndex<1>\) illustrates linear relationships between two variables \(x\) and \(y\) of varying strengths. It is visually apparent that in the situation in panel (a), \(x\) could serve as a useful predictor of \(y\), it would be less useful in the situation illustrated in panel (b), and in the situation of panel (c) the linear relationship is so weak as to be practically nonexistent. The linear correlation coefficient is a number computed directly from the data that measures the strength of the linear relationship between the two variables \(x\) and \(y\).

Definition: linear correlation coefficient

The linear correlation coefficient for a collection of \(n\) pairs \(x\) of numbers in a sample is the number \(r\) given by the formula

  1. The value of \(r\) lies between \(−1\) and \(1\), inclusive.
  2. The sign of \(r\) indicates the direction of the linear relationship between \(x\) and \(y\):
  3. The size of \(|r|\) indicates the strength of the linear relationship between \(x\) and \(y\):
    1. If \(|r|\) is near \(1\) (that is, if \(r\) is near either \(1\) or \(−1\)), then the linear relationship between \(x\) and \(y\) is strong.
    2. If \(|r|\) is near \(0\) (that is, if \(r\) is near \(0\) and of either sign). then the linear relationship between \(x\) and \(y\) is weak.

    The number quantifies what is visually apparent from Figure \(\PageIndex\) weights tends to increase linearly with height (\(r\) is positive) and although the relationship is not perfect, it is reasonably strong (\(r\) is near \(1\)).

    alt

    Example \(\PageIndex\)

    Compute the linear correlation coefficient for the height and weight pairs plotted in Figure \(\PageIndex\).

    Even for small data sets like this one computations are too long to do completely by hand. In actual practice the data are entered into a calculator or computer and a statistics program is used. In order to clarify the meaning of the formulas we will display the data and related quantities in tabular form. For each

    \(x\) \(y\) \(x^2\) \(y^2\)
    68 151 4624 10268 22801
    69 146 4761 10074 21316
    70 157 4900 10990 24649
    70 164 4900 11480 26896
    71 171 5041 12141 29241
    72 160 5184 11520 25600
    72 163 5184 11736 26569
    72 180 5184 12960 32400
    73 170 5329 12410 28900
    73 175 5329 12775 30625
    74 178 5476 13172 31684
    75 188 5625 14100 35344
    859 2003 61537 143626 336025
    Key Takeaway

    This page titled 10.2: The Linear Correlation Coefficient is shared under a CC BY-NC-SA 3.0 license and was authored, remixed, and/or curated by Anonymous via source content that was edited to the style and standards of the LibreTexts platform.

    1. Back to top
      • 10.1: Linear Relationships Between Variables
      • 10.3: Modelling Linear Relationships with Randomness Present

    Recommended articles

    1. Article type Section or Page Author Anonymous License CC BY-NC-SA License Version 3.0 OER program or Publisher The Publisher Who Must Not Be Named Show TOC no
    2. Tags
      1. linear correlation coefficient
      2. source@https://2012books.lardbucket.org/books/beginning-statistics