Skip to content
M
hscscience Maths Std · Y11
0/100daily goal
0
0
0 due
0
L1 · 0 XP
KJ
Your weak spots
Insights load after your first practice round.
Module 4 · L2 of 6 ~20 min MS-S1 ⚡ +90 XP available

Collecting Data

A bad sample ruins good analysis. It does not matter how sophisticated your calculations are if your data comes from a biased source. This lesson teaches you how to collect data properly: census versus sample, random versus stratified sampling, and the many forms of bias that can creep into your data.

Today's hook — You want to find the average amount of pocket money for teenagers in Australia. Would you ask only students at your school? Why or why not?
0/5QUESTS
Worksheets

Practise this lesson

Three printable worksheets that build from foundations to mastery — or build your own from any module’s questions.

01
Recall — your gut answer first
+5 XP warm-up

You want to find the average amount of pocket money for teenagers in Australia. Would you ask only students at your school? Why or why not?

Without looking ahead — write your gut feeling. We'll revisit this at the end of the lesson.

auto-saved
02
The key principles you need to own
+5 XP to read

Good statistical conclusions depend on representative data collection. A biased sample produces misleading results no matter how correct the subsequent calculations are.

Census
Data from every member of a population. Most accurate but expensive and time-consuming.
Sample
Data from a selected subset. Faster and cheaper, but may not represent the whole population.
Bias
A systematic error that causes results to consistently favour one outcome. Quality beats quantity.
Key insight: A representative sample is more valuable than a large biased sample. Quality of sampling beats quantity.
03
What you'll master
Know

Key facts

  • The difference between census and sample
  • Types of sampling methods
  • Types of bias in data collection
Understand

Concepts

  • Why samples can be biased
  • How to reduce bias in sampling
  • When a census is necessary
Can do

Skills

  • Design an appropriate sampling method
  • Identify and name sources of bias
  • Calculate stratified sample sizes
04
Key terms
CensusData collected from every single member of a population. Example: Australian Census every 5 years.
SampleA selected subset of the population from which data is collected.
Random sampleEvery member of the population has an equal chance of being selected.
Stratified samplePopulation divided into groups (strata), then randomly sampled from each group in proportion.
BiasA systematic error that causes data to consistently favour one outcome over others.
Selection biasSome groups are systematically excluded or under-represented in the sample.
05
Census vs sample — who should you ask?
core concept

Census: Collects data from every single member of a population.

  • Advantages: most accurate, complete picture
  • Disadvantages: expensive, time-consuming, often impractical

Sample: Collects data from a selected subset.

  • Advantages: faster, cheaper, practical for large populations
  • Disadvantages: may not represent the whole population

The Australian Census happens every 5 years and attempts to count every person. Most research, however, uses samples.

Sampling methods: Simple random (equal chance), Stratified (proportional groups), Systematic (every nth person), Convenience (whoever is nearby — often biased), Self-selected (volunteers — attracts strong opinions).
What to write in your book
  • Census = everyone in population. Sample = representative subset.
  • Random: equal chance. Stratified: proportional groups sampled randomly. Systematic: every nth.
  • Convenience and self-selected samples are common but prone to bias.

Quick check: A school with 40% Year 7, 35% Year 8, and 25% Year 9 wants a sample of 80. How many Year 7 students should be selected in a stratified sample?

06
Types of bias — when samples go wrong
core concept

Bias is a systematic error that distorts your results in a consistent direction.

Selection bias
Some groups are systematically excluded. A phone survey about internet usage excludes people without phones.
Response bias
Questions are leading, or people give socially desirable answers rather than honest ones.
Non-response bias
Those who do not respond differ systematically from those who do. Email surveys with low return rates suffer from this.

Self-selection bias: When people voluntarily participate, those with strong opinions are over-represented (e.g., TV call-in votes).

What to write in your book
  • Selection bias: some groups systematically missed.
  • Response bias: leading questions or dishonest answers.
  • Non-response bias: non-respondents differ from respondents.
  • Self-selection: volunteers over-represent strong opinions.

True or false: A TV station asking viewers to call in and vote is an example of selection bias because only motivated viewers call.

PROBLEM 1 · STRATIFIED SAMPLING

A school has 500 Year 11 students: 300 female, 200 male. A researcher wants to survey 50 students about homework time. Describe a stratified sampling method and explain why it is better than convenience sampling.

1
Calculate proportions
Female: 300/500 = 60% · Male: 200/500 = 40%
Stratified sampling means each group is sampled in proportion to its size in the population.
What to write in your book
  • Stratified formula: (group size ÷ total population) × sample size.
  • Always randomly select within each stratum — stratified is not the same as choosing the most convenient people in each group.

Fill the gap: A city has 60% adults and 40% teenagers. For a stratified sample of 100, you would select adults and teenagers.

1

A weight loss study only includes volunteers who are already trying to lose weight. What type of bias?

2

A teacher surveys only the front row about classroom temperature. What type of bias?

3

A company emails 10,000 customers and gets 200 responses. Name at least two sources of bias.

4

Why might a random sample still be biased even if selection was truly random?

5

A school has 200 Year 10 and 300 Year 11 students. In a stratified sample of 100, how many from each year?

Match each scenario to its bias type:

Top 3 list: Name THREE advantages of using a stratified sample over a simple random sample.

08
Revisit your thinking

Look back at what you wrote in the Think First section. Asking only your school would create selection bias — your school may have different socioeconomic status, location, or culture than the national average. A better approach: stratified random sampling across multiple school types and locations. Even then, self-reporting may introduce response bias as students may exaggerate or underreport.

What did you get right? What surprised you?

auto-saved
01
Multiple choice
+5 XP per correct · +25 XP all-correct

Pick your answer, then rate your confidence — that tells the system what to drill next.

02
Short answer
ApplyBand 42 marks

SA 1. A company wants to survey customer satisfaction. They send an email to 10,000 customers and receive 200 responses. (a) What type of sample is this? (b) Identify at least two sources of bias. (c) Suggest a better sampling method. (2 marks)

auto-saved
AnalyseBand 53 marks

SA 2. (a) Explain why the Australian Census attempts to count every person rather than using a sample. (b) A researcher studies drug use among teenagers by standing outside a nightclub on Saturday night. Identify all forms of bias. (c) Design a better sampling strategy for this sensitive topic, explaining your choices. (3 marks)

auto-saved
📖 Comprehensive answers (click to reveal)

Drill 1: Selection/self-selection bias — only motivated participants volunteer. 2: Convenience bias — front row may differ from back row. 3: Non-response bias (only motivated customers reply) + selection bias (only email-accessible customers). 4: Non-response bias still possible; chance alone may over-represent one group. 5: Year 10: 40, Year 11: 60.

SA 1 (2 marks): (a) Self-selected (voluntary response) sample [0.5]. (b) Non-response bias — only satisfied/dissatisfied customers respond; selection bias — only email users included [0.5]. (c) Random sample with follow-up, or stratified by customer type [1].

SA 2 (3 marks): (a) Census provides complete data for government planning and funding — some data must be exact [1]. (b) Selection bias (nightclub-goers only), time bias (Saturday night only), response bias (may lie about illegal activity), self-selection [1]. (c) Anonymous online/paper survey through schools, stratified by school type and location — anonymity reduces response bias [1].

01
Boss battle · The Bias Buster
earn bronze · silver · gold

Five timed questions on sampling methods and bias. Beat the boss to bank a tier — gold (90% + speed), silver (75%), or bronze (50%). Replays welcome.

⚔ Enter the arena
02
Science Jump · platform challenge

Climb platforms using sampling concepts. Pool: lesson 2.

Mark lesson as complete

Tick when you've finished the practice and review.