The Pearson correlation coefficient $r$ measures the strength and direction of a linear association between two numerical variables.
$$-1 \leq r \leq 1$$
| Value of $r$ | Interpretation |
|---|---|
| $r = 1$ | Perfect positive linear association |
| \$0.75 \leq r < 1$ | Strong positive linear association |
| \$0.5 \leq r < 0.75$ | Moderate positive linear association |
| \$0.25 \leq r < 0.5$ | Weak positive linear association |
| $r \approx 0$ | No linear association |
| $-0.25 < r < 0$ | Weak negative linear association |
| $-0.5 < r \leq -0.25$ | Moderate negative linear association |
| $-0.75 < r \leq -0.5$ | Strong negative linear association |
| $r = -1$ | Perfect negative linear association |
Note: VCAA uses the guideline: $|r| \geq 0.75$ strong, \$0.5 \leq |r| < 0.75$ moderate, \$0.25 \leq |r| < 0.5$ weak, $|r| < 0.25$ very weak/no association.
In VCE General Mathematics, $r$ is calculated using a CAS calculator:
Example: If CAS gives $r = 0.92$, this indicates a strong, positive linear association.
$$r^2 = (\text{correlation coefficient})^2$$
$r^2$ gives the proportion of variation in $y$ that is explained by the linear relationship with $x$.
| $r$ | $r^2$ | Interpretation |
|---|---|---|
| 0.9 | 0.81 | 81% of variation in y is explained by x |
| 0.7 | 0.49 | 49% of variation in y is explained by x |
| 0.5 | 0.25 | 25% of variation in y is explained by x |
EXAM TIP: VCAA often asks for both $r$ and $r^2$ and their interpretation. Always express $r^2$ as a percentage and link it to the context. E.g. “81% of the variation in exam scores is explained by the linear relationship with hours studied.”
A high value of $|r|$ tells us there is a strong association — it does not tell us that $x$ causes $y$. There may be:
- A lurking variable affecting both
- Pure coincidence
KEY TAKEAWAY: $r$ measures the strength of a linear association. It cannot be used to conclude causation, and it cannot detect non-linear relationships.
COMMON MISTAKE: Stating $r = 0.85$ means “85% of the data follows the linear pattern.” The correct interpretation uses $r^2$: $r^2 = 0.72$ means 72% of variation in $y$ is explained by $x$.