Printable Worksheets
Print or save as PDF — or build a custom worksheet from any module's questions.
Data and Probability Review
A survey of 100 students shows mean study time = 5 hours, median = 4 hours. What does this suggest? Is the sample representative if only Year 9 students were asked?
Learning Intentions
Know
- Mean vs median
- Sampling bias
- Probability rules
- Data interpretation
Understand
- When to use each measure of centre
- How bias affects conclusions
- When to apply addition and multiplication rules
Can Do
- Calculate and compare measures of centre
- Identify bias in data collection
- Solve multi-step probability problems
- Interpret box plots and Venn diagrams
Choosing Measures of Centre
Guidelines for choosing:
- Mean: Best for symmetric data without outliers
- Median: Best for skewed data or data with outliers
- Mode: Best for categorical data or finding most common value
Mean > Median suggests right skew (tail to the right).
Mean < Median suggests left skew (tail to the left).
Probability Rules Summary
Essential rules:
$P(A') = 1 - P(A)$
$P(A cup B) = P(A) + P(B) - P(A cap B)$
$P(A ext{ then } B) = P(A) imes P(B|A)$
For independent events: $P(A cap B) = P(A) imes P(B)$
Mutually exclusive: $P(A cap B) = 0$
Data Collection Checklist
When evaluating a statistical study:
- Was the sample representative?
- Was the sampling method appropriate?
- Were questions unbiased?
- Was the sample size adequate?
- Are conclusions supported by the data?
Be sceptical of claims based on small, biased, or unrepresentative samples.
Check Understanding
In a class of 40, 25 passed Maths, 20 passed Science, and 12 passed both. Find P(passed at least one subject) and P(passed neither).
Data and Probability Review
Data: 2, 5, 7, 8, 10, 15, 20. Find mean, median, mode, range, IQR.
Mean = 67/7 ≈ 9.57
Median = 8
Mode = none
Range = 18
Q1 = 5, Q3 = 15, IQR = 10
A bag has 3 red, 4 blue, 3 green. Two drawn without replacement. Find P(both blue).
P(first blue) = 4/10
P(second blue) = 3/9
P(both blue) = 4/10 × 3/9 = 12/90 = 2/15
A survey asks leading questions at a shopping centre during work hours. Identify three sources of bias.
1. Leading questions (response bias)
2. Shopping centre location excludes non-shoppers (selection bias)
3. Work hours excludes working people (selection bias)
Common Misconceptions
Mean is always the best measure. No — median is better for skewed data or data with outliers.
P(A or B) = P(A) + P(B) always. Only for mutually exclusive events. Otherwise subtract the overlap.
A large sample guarantees accurate results. A large biased sample is worse than a small representative one.
Practice — Data and Probability
Social Science and Policy
The Australian Bureau of Statistics publishes guidelines for evaluating statistical claims. Understanding sampling bias and appropriate measures helps citizens critically evaluate media reports, political claims, and advertising that use statistics.
📓 Copy Into Your Books
▼Measures
- Mean - symmetric data
- Median - skewed/outliers
- Mode - categorical
Probability
- Complement: 1 − P(A)
- Addition: P(A)+P(B)−P(A∩B)
- Multiplication: P(A) × P(B|A)
Bias check
- Representative sample?
- Appropriate method?
- Unbiased questions?
- Adequate size?