Introduction to Bivariate Data and Scatterplots
Bureau of Meteorology records daily temperature and ice-cream sales across Australian cities — one number predicts the other surprisingly well. In this lesson you will learn what bivariate data is, how to plot it on a scatterplot, and how to read and interpret the pattern of points.
Practise this lesson
Three printable worksheets that build from foundations to mastery — or build your own from any module’s questions.
A researcher records hours of study and exam score for 20 students. What kind of graph would best show the relationship between those two variables? Write your thoughts before reading on.
Bivariate data pairs two measurements on each individual — for example (hours studied, exam score) for each student.
A scatterplot places each pair as a point on a coordinate grid. The independent variable goes on the x-axis; the dependent variable goes on the y-axis.
The pattern of points reveals whether the two variables are related and in what direction.
Key facts
- What bivariate data is
- The role of each axis on a scatterplot
- What each plotted point represents
Concepts
- Why a scatterplot suits bivariate data
- How to decide which variable is independent
- What patterns, outliers and clusters mean
Skills
- Plot bivariate data on a labelled scatterplot
- Read coordinates in context
- Describe the trend shown by a scatterplot
Bivariate means "two variables." We record two measurements per individual and explore whether they vary together.
How to set up the axes:
- Draw a horizontal x-axis and a vertical y-axis.
- Label each axis with the variable name and units.
- Choose a scale that fits all data points — it does not need to start at zero if the data is in a narrow range.
- Put the independent variable on x and the dependent variable on y.
Example: A researcher measures daily temperature (°C) and ice-cream sales (units) for 30 days. Temperature is the independent variable (x-axis) because sales may depend on temperature. Ice-cream sales is the dependent variable (y-axis).
What to write in your book
- Bivariate data = two measurements per individual. Plot as (x, y) on a scatterplot.
- Independent variable (predictor) on x-axis. Dependent variable (response) on y-axis.
- Label axes with variable name and units. Choose a scale that fits all data points.
Quick check: A study records hours of sleep (x) and reaction time in ms (y). Which variable belongs on the y-axis?
Once points are plotted, ask three questions:
- Direction: Does the pattern go up (positive) or down (negative), or is there no consistent direction?
- Strength: Are the points tightly grouped near a line (strong) or widely scattered (weak)?
- Unusual features: Are there outliers (points far from the main group) or clusters (separate subgroups)?
Example: A scatterplot of study hours (x) vs exam score (y) shows points rising from lower-left to upper-right. The points are fairly close together. One student studied 10 hours but scored only 30% — an outlier in the bottom-right region.
What to write in your book
- Direction: up = positive, down = negative, no pattern = no linear trend.
- Strength: tightly grouped = strong, widely scattered = weak.
- Identify outliers (far from group) and clusters (separate subgroups).
Which does NOT belong? Things you describe when reading a scatterplot:
Every point represents a real individual. When describing a scatterplot, always use the variable names and units, not just "x" and "y."
Reading a single point: The point (4, 72) on a study-hours vs score plot means "one student studied 4 hours and scored 72%." On a temperature vs sales plot, (28, 350) means "on a day that reached 28°C, 350 ice-creams were sold."
Describing the overall pattern:
- State the direction: "As temperature increases, ice-cream sales tend to increase."
- State the strength: "The points are fairly closely grouped, suggesting a moderately strong relationship."
- Mention outliers: "One outlier at approximately (12°C, 380 sales) is unexpectedly high for a cool day."
What to write in your book
- Always name the variables when describing a scatterplot — not just x and y.
- Template: "As [x variable] increases, [y variable] tends to [increase/decrease]."
- One point = one individual's pair of measurements in context.
Complete: On a scatterplot of daily temperature (x) vs ice-cream sales (y), the point (32, 480) represents a day when the temperature was degrees and ice-creams were sold.
Worked examples · 3 in a row, reveal as you go
Plot the data below on a scatterplot and describe the trend.
| Hours studied | 1 | 2 | 3 | 4 | 5 |
|---|---|---|---|---|---|
| Score (%) | 45 | 55 | 62 | 74 | 80 |
A scientist measures shoe size and reading speed for 30 adults. Which is independent and which is dependent?
A scatterplot shows daily rainfall (mm) on x and umbrella sales on y. What does the point (15, 42) represent?
What to write in your book
- Steps: (1) Identify variables, (2) Label axes with units, (3) Plot all points, (4) Describe trend in context.
- A point (x, y) = one individual's x-value AND y-value — always interpret in context.
A café records daily maximum temperature (°C) and cups of hot coffee sold over 6 days:
| Temp (°C) | 12 | 15 | 18 | 22 | 25 | 28 |
|---|---|---|---|---|---|---|
| Coffee sold | 95 | 88 | 74 | 60 | 52 | 38 |
- Identify the independent and dependent variables. Justify your choice.
- Describe the axis labels and scale you would use.
- Describe the trend in the data. Would the points go up or down?
- What does the point (22, 60) represent in context?
At the start you were asked what graph would best show the relationship between hours of study and exam score. The answer is a scatterplot — it places each student as one point (hours, score), making any pattern immediately visible. A bar chart or line graph cannot show the association between two continuous variables measured for the same individuals.
Pick your answer, then rate your confidence. Each retry pulls a fresh mix from the bank.
Q1. A physiotherapist records sessions attended (x) and pain score out of 10 (y) for 8 patients. (a) Which variable goes on the x-axis? Justify. (b) Describe what the point (6, 2) represents. (c) If the points go from upper-left to lower-right, what does this suggest? (3 marks)
Q2. Explain why a scatterplot is more suitable than a bar chart for displaying bivariate data. (2 marks)
Answers (click to reveal)
Q1 (3 marks):
(a) Sessions attended belongs on the x-axis because it is the independent variable — the number of sessions may predict the change in pain score [1].
(b) The point (6, 2) represents a patient who attended 6 physiotherapy sessions and reported a pain score of 2 out of 10 [1].
(c) The pattern going from upper-left to lower-right (negative trend) suggests that as sessions increase, pain scores tend to decrease — a negative association [1].
Q2 (2 marks): A scatterplot shows both variables simultaneously for each individual as a single point [1], making it possible to see whether there is an association and what direction it takes [1]. A bar chart can only display one variable at a time.
Activity answers: (1) Temperature is independent (x) because it drives coffee demand. (2) x: temperature 10–30°C; y: coffee sold 30–100 units. (3) The points go down (negative trend) — as temperature increases, hot coffee sales decrease. (4) The point (22, 60) represents a day when the temperature reached 22°C and 60 cups of coffee were sold.
Identify variables, describe trends, and interpret points. Beat the boss to bank a tier. Replays welcome.
⚔ Enter the arenaClimb platforms answering scatterplot questions. Pool: lesson 01.
Mark lesson as complete
Tick when you've finished the practice and review.