Making Decisions from Data
This topic brings together key ideas in statistics that help us make informed decisions using data. You will explore how data can be described, summarised, and used to draw conclusions with confidence.
- The Empirical Rule — understanding the spread of data in a normal distribution using the 68–95% rule.
- Margin of Error — measuring how accurate a sample result is, using the rule \( E = \frac{1}{\sqrt{n}} \).
- Confidence Intervals — estimating a range within which the true population proportion is likely to fall.
- Hypothesis Testing — deciding whether to accept or reject a claim about a population proportion.
Each section below includes short explanations, worked examples, and interactive quizzes to help you practise applying these ideas.
The Empirical Rule (68%–95%)
Many sets of data — like exam marks or heights — follow a normal distribution: a symmetric, bell-shaped curve centred at the mean \(\bar{x}\). The standard deviation \(\sigma\) measures how spread out the data are.
The Rule
- 68% of values lie within \(\bar{x}\pm\sigma\) (between \(-1\sigma\) and \(+1\sigma\)).
- 95% of values lie within \(\bar{x}\pm2\sigma\) (between \(-2\sigma\) and \(+2\sigma\)).
68% of marks: \(60\pm10\Rightarrow [50,70]\).
95% of marks: \(60\pm20\Rightarrow [40,80]\).
68%: \([162,178]\text{ cm}\).
95%: \([154,186]\text{ cm}\).
Empirical Rule Quiz
Labels: a = mean − 2σ, b = mean − 1σ, c = mean, d = mean + 1σ, e = mean + 2σ.
Margin of Error for a Population Proportion
At a 5% level of significance, the margin of error is given by:
Example
Find the margin of error for a sample of \(n = 400\).
Practice Question
Find the margin of error when \(n = 900\).
Population Proportion \( \hat{p} \)
The sample proportion \( \hat{p} \) represents the proportion of individuals in a sample who have a certain characteristic.
- \(x\) = number in the sample with the characteristic
- \(n\) = total sample size
Example
In a survey of 200 people, 90 said they travel to school by bus. Find \( \hat{p} \).
Quiz – Finding \( \hat{p} \)
Find the sample proportion \( \hat{p} = \dfrac{x}{n} \) and choose the correct percentage.
95% Confidence Interval for a Proportion
- Compute the sample proportion \( \hat{p}=\dfrac{x}{n} \) (write as a percentage).
- Find the margin of error \( E=\dfrac{1}{\sqrt{n}} \) (convert to a percentage).
- Write the interval \( [\,\hat{p}-E,\;\hat{p}+E\,] \) (in percentages).
Sample Question
In a survey, \(n=400\) and \(x=116\) said “Yes”. Construct a 95% confidence interval for the true percentage who would say “Yes”.
Quiz — Choose the Correct 95% Confidence Interval
For each question, compute \( \hat{p}=\dfrac{x}{n} \) and \( E=\dfrac{1}{\sqrt{n}} \), then pick the correct interval \( [\,\hat{p}-E,\ \hat{p}+E\,] \) (in percentages).
Hypothesis Test using the Margin of Error
This method tests a claim about a population proportion using the Margin of Error (E).
- Find the sample proportion: \( \hat{p} = \dfrac{x}{n} \)
- Find the margin of error: \( E = \dfrac{1}{\sqrt{n}} \)
- Construct the 95% confidence interval: \( [\,\hat{p} - E,\;\hat{p} + E\,] \)
- Check if the hypothesised value \( p_0 \) is inside or outside the interval.
- Conclude:
- If \( p_0 \) is inside → Do not reject the claim.
- If \( p_0 \) is outside → Reject the claim.
Example
In a survey of \(n = 400\) students, \(220\) said they prefer digital textbooks. Test, at the 5% level, whether the true proportion of students who prefer digital textbooks is 50%.
Step 1: Sample proportion
\[ \hat{p} = \frac{220}{400} = 0.55 = 55\% \]Step 2: Margin of error
\[ E = \frac{1}{\sqrt{400}} = \frac{1}{20} = 0.05 = 5\% \]Step 3: 95% Confidence interval
\[ [\,55\% - 5\%,\; 55\% + 5\%\,] = [\,50\%,\;60\%\,] \]Step 4: The hypothesised value \(p_0 = 50\%\) is inside the interval.
Step 5: Do not reject the claim. There is no evidence that the proportion differs from 50%.
Practice Question
A survey of \(n = 900\) adults finds that \(432\) support a new local policy. Test, at the 5% level, whether the true proportion of support is 45%.
Step 1: Sample proportion
\[ \hat{p} = \frac{432}{900} = 0.48 = 48\% \]Step 2: Margin of error
\[ E = \frac{1}{\sqrt{900}} = \frac{1}{30} \approx 0.0333 = 3.3\% \]Step 3: 95% Confidence interval
\[ [\,48\% - 3.3\%,\; 48\% + 3.3\%\,] = [\,44.7\%,\;51.3\%\,] \]Step 4: The hypothesised value \(p_0 = 45\%\) is inside the interval.
Step 5: Do not reject the claim. The data are consistent with 45% support.
Hypothesis Test – Accept or Reject?
Decide whether to accept or reject the claim based on the 95% confidence interval.