Statistical Inference: Applications, Assumptions, and Limitations

Statistical inference involves using data from a sample to make valid conclusions or generalizations about an entire population. In VCE Specialist Mathematics, this focus is primarily on the population mean ($\mu$) using the sample mean ($\bar{x}$).

1. The Foundation: Sampling Distribution of the Mean

To apply statistical inference to real data, we must understand the behavior of the sample mean $\bar{X}$.

Expected Value: $E(\bar{X}) = \mu$
Standard Error: $SD(\bar{X}) = \frac{\sigma}{\sqrt{n}}$, where $\sigma$ is the population standard deviation and $n$ is the sample size.
The Central Limit Theorem (CLT): For a sufficiently large sample size ($n \ge 30$), the sampling distribution of the mean $\bar{X}$ is approximately normally distributed, regardless of the population’s original distribution:
\$$\bar{X} \approx N\left(\mu, \frac{\sigma^2}{n}\right)$\$

KEY TAKEAWAY: The Central Limit Theorem is the “bridge” that allows us to use normal distribution techniques on real-world data even when the underlying population distribution is unknown or non-normal.

2. Confidence Intervals for the Population Mean

A confidence interval (CI) provides a range of plausible values for the population mean $\mu$, based on a sample mean $\bar{x}$.

Calculation

For a $C\%$ confidence interval:
\$$\left( \bar{x} - z\frac{\sigma}{\sqrt{n}}, \bar{x} + z\frac{\sigma}{\sqrt{n}} \right)$\$

Common $z$-values for confidence levels:
| Confidence Level | $z$-score (approx.) |
| :— | :— |
| 90% | 1.645 |
| 95% | 1.96 |
| 99% | 2.576 |

Factors Affecting the Width of the Interval

Sample Size ($n$): Increasing $n$ decreases the width (more precision).
Confidence Level: Increasing the confidence level (e.g., 95% to 99%) increases the width.
Standard Deviation ($\sigma$): Higher variability in the population increases the width.

EXAM TIP: If an exam question asks you to interpret a 95% confidence interval, avoid saying “There is a 95% probability that $\mu$ is in this interval.” Instead, say: “In the long run, 95% of intervals constructed using this method will contain the true population mean $\mu$.”

3. Hypothesis Testing with Real Data

Hypothesis testing is a formal procedure for investigating whether a claim about a population mean is supported by sample evidence.

The Components

Null Hypothesis ($H_0$): The status quo or the claim to be tested (e.g., $H_0: \mu = \mu_0$).
Alternative Hypothesis ($H_1$): What we suspect might be true instead ($H_1: \mu \neq \mu_0$, $\mu > \mu_0$, or $\mu < \mu_0$).
Test Statistic: The $z$-score calculated from the sample:
\$$z = \frac{\bar{x} - \mu_0}{\sigma/\sqrt{n}}$\$
$p$-value: The probability of obtaining a sample mean at least as extreme as the one observed, assuming $H_0$ is true.

Decision Rule

If $p < \alpha$ (significance level, usually 0.05): Reject $H_0$. There is statistically significant evidence for $H_1$.
If $p \ge \alpha$: Do not reject $H_0$. There is insufficient evidence to support $H_1$.

VCAA FOCUS: Always state your conclusion in the context of the original problem. For example: “Since $p = 0.03 < 0.05$, we reject $H_0$. There is evidence to suggest the mean height of the plants has increased.”

4. Errors in Statistical Inference

When applying inference to real data, there is always a risk of reaching the wrong conclusion.

Error Type	Definition	Probability
Type I Error	Rejecting $H_0$ when $H_0$ is actually true.	$\alpha$ (Significance level)
Type II Error	Failing to reject $H_0$ when $H_0$ is false.	$\beta$

COMMON MISTAKE: Students often think $P(\text{Type I Error}) + P(\text{Type II Error}) = 1$. This is incorrect. However, there is a trade-off: decreasing the probability of a Type I error (by lowering $\alpha$) usually increases the probability of a Type II error.

5. Assumptions and Limitations

When working with real-world datasets, the validity of our inference depends on several key assumptions. If these are violated, the results may be misleading.

Key Assumptions

Random Sampling: The data must be a simple random sample. Every member of the population must have an equal chance of being selected.
Independence: Observations must be independent of one another.
Normality:
- If $n < 30$, the population from which the sample is drawn must be normally distributed.
- If $n \ge 30$, the Central Limit Theorem allows us to assume the sampling distribution of the mean is normal.
Known Standard Deviation: The population standard deviation ($\sigma$) is assumed to be known. In practice, if $n$ is large, the sample standard deviation ($s$) is often used as an estimate.

Limitations in Real Data

Sampling Bias: Real data is often “convenience data.” If the sample is biased (e.g., only surveying people at a library about literacy), the inference cannot be generalized to the whole population.
Outliers: Real data often contains outliers that can significantly skew the sample mean $\bar{x}$, leading to incorrect $z$-scores and $p$-values.
Non-Response Bias: In surveys, people who choose not to respond may have different characteristics than those who do, affecting the mean.
Causality vs. Correlation: Statistical inference can show that a mean has changed or that a relationship exists, but it cannot prove why it happened (causation) without a controlled experiment.

STUDY HINT: When asked to “critique” or “discuss limitations” of a statistical study in an exam, look for mentions of “small sample sizes” ($n < 30$), “voluntary surveys” (bias), or “non-random selection.”

6. Summary Table: Application Steps

Step	Action
1. Check Assumptions	Is the sample random? Is $n \ge 30$ or is the population normal?
2. State Hypotheses	Define $H_0$ and $H_1$ clearly.
3. Calculate Statistic	Find $\bar{x}$ and then the $z$-score.
4. Determine $p$-value	Use CAS or normal distribution tables.
5. Conclude	Compare $p$ to $\alpha$ and write a contextual sentence.

REMEMBER: “If the $p$ is low, the null must go!” (If $p < \alpha$, reject $H_0$).

Statistical Inference: Applications, Assumptions, and Limitations

Table of Contents

About these notes

Join StudyPulse