The Hertzsprung-Russell Diagram
In 1913, Henry Norris Russell at Princeton plotted absolute magnitude against spectral class for 300 nearby stars, independently replicating a discovery Ejnar Hertzsprung in Copenhagen had made using colour in 1909. Both found that stars cluster into distinct populations: a main sequence diagonal, a giant branch, and faint white dwarfs. The Gaia satellite (2022) has now extended the HR diagram to 1.7 billion stars, confirming the same structure across the entire Milky Way. The distance to any star cluster can be determined by matching its turn-off point to the theoretical main sequence.
Practise this lesson
Four printable worksheets that build from the foundations up to exam-style questions — start at whatever level suits you.
Imagine plotting every visible star by its surface temperature (hot to cool) on the x-axis and its luminosity (dim to bright) on the y-axis.
Before reading on, answer:
- Would you expect stars to be scattered randomly, or to cluster in distinct regions?
- Where on such a diagram would you place the Sun?
- What must be true about a star that is very cool (red) yet extremely luminous?
Warm-up: On the HR diagram, temperature increases in which direction along the horizontal axis?
Know — HR Diagram Features
- Main sequence, red giants, supergiants
- White dwarfs
- Spectral types O B A F G K M
Understand — Stellar Properties
- $L = 4\pi R^2 \sigma T^4$ links L, R, T
- Mass–luminosity relation on main sequence
- Turn-off point and cluster age
Can Do — Analyse Stellar Data
- Classify stars from HR position
- Estimate radius from $L$ and $T$
- Infer cluster age from turn-off mass
Core Content
Luminosity vs temperature reveals stellar evolution
Plot the luminosity of 300 nearby stars against their surface temperature and something remarkable emerges: they do not scatter randomly across the graph. About 90% fall along a single diagonal band running from top-left (hot, luminous) to bottom-right (cool, dim). A smaller group of cool but extremely luminous stars clusters in the upper right, and a group of hot but very faint stars sits in the lower left. This is the Hertzsprung-Russell (HR) diagram, which plots luminosity on the vertical axis against surface temperature on the horizontal axis — with temperature increasing to the left.
Stars are not randomly distributed. They cluster into distinct regions that reflect their evolutionary state:
- Main sequence: A diagonal band running from top-left (hot, luminous, massive O/B stars) to bottom-right (cool, dim, low-mass M stars). Stars spend roughly 90% of their lives here, fusing hydrogen in their cores. Mass increases from bottom-right to top-left.
- Red giants and supergiants: Above and to the right of the main sequence. These are evolved stars that have exhausted their core hydrogen and expanded enormously. They are cool but very luminous, which — by the Stefan-Boltzmann law — requires a very large radius.
- White dwarfs: Below and to the left. Hot but very dim, implying a tiny radius. These are the dense remnants of low-to-intermediate-mass stars after they shed their outer layers.
The Stefan-Boltzmann law connects these three quantities directly:
$$L = 4\pi R^2 \sigma T^4$$At the same temperature, a red giant's enormous radius $R$ makes it orders of magnitude more luminous than the Sun. A white dwarf's tiny radius makes it dim despite a high $T$.
Figure 1 — Schematic HR diagram. Temperature increases to the left; luminosity increases upward. Stars cluster into distinct populations reflecting their evolutionary stage.
$L = 4\pi R^2 \sigma T^4$ — luminosity, radius, temperature
$\sigma = 5.67\times10^{-8}\ \text{W m}^{-2}\text{K}^{-4}$ — Stefan-Boltzmann constant
At fixed $T$: $R \propto \sqrt{L}$ — doubling luminosity increases radius by $\sqrt{2}$
At fixed $L$: $R \propto T^{-2}$ — hotter means smaller at the same luminosity
A star has $T = 3\,500$ K and $L = 100\,L_\odot$. Is it on the main sequence? Use $L = 4\pi R^2 \sigma T^4$ and the Sun's values ($T_\odot = 5\,800$ K, $L_\odot = 1$) to estimate its radius in solar radii.
The HR diagram plots luminosity (y) vs temperature (x) with temperature increasing to the left. Main sequence = diagonal H-fusion band (OBAFGKM: hottest→coolest; Sun = G2V); red giants top-right (cool, huge $R$); white dwarfs bottom-left (hot, tiny $R$). Stefan-Boltzmann: $L = 4\pi R^2\sigma T^4$, so at the same $T$, higher $L$ means larger $R$.
Pause — copy the highlighted structure and law into your book before moving on.
A star has $T = 3\,000$ K and $L = 10\,000\,L_\odot$. Where does it sit on the HR diagram?
From observation to physical properties
We just saw the HR diagram's structure — main sequence, red giants, and white dwarfs — and the Stefan-Boltzmann law linking luminosity, temperature, and radius. That raises a question: how do we actually use the HR diagram to find a star's mass, lifetime, or distance? This card answers it → via the mass-luminosity relation $L \propto M^{3.5}$, the lifetime relation $t_{MS} \propto M^{-2.5}$, and spectroscopic parallax.
The HR diagram is more than a classification tool — it reveals fundamental physical properties of stars that cannot be directly observed.
- Mass: Along the main sequence, mass increases from bottom-right to top-left. The mass-luminosity relation $L \propto M^{3.5}$ means a $10\,M_\odot$ star is roughly $10^{3.5} \approx 3\,000$ times more luminous than the Sun.
- Radius: From $L = 4\pi R^2\sigma T^4$ we get $R \propto \sqrt{L}/T^2$. Stars of the same temperature at different luminosities must have different radii — this is what separates giants from dwarfs.
- Age (cluster turn-off): In a star cluster all stars formed simultaneously. More massive stars have shorter main-sequence lifetimes ($t_{MS} \propto M^{-2.5}$) and evolve off the main sequence first. The turn-off point — the most massive star still on the main sequence — directly indicates the cluster's age.
- Distance (spectroscopic parallax): Measure a star's spectral type (hence $T$), use the HR diagram to read off its absolute magnitude, then compare to its apparent magnitude. The difference gives the distance modulus and hence distance.
Figure 2 — HR diagram of a star cluster. The ZAMS (dashed) shows where stars start. The turn-off point (gold) marks the most massive star still on the main sequence; evolved stars peel off to the right and upward. The higher the turn-off, the younger the cluster.
$L \propto M^{3.5}$ — mass-luminosity relation (main sequence)
$t_{MS} \propto M/L \propto M^{-2.5}$ — main sequence lifetime
$\lambda_{max} T = 2.898\times10^{-3}\ \text{m·K}$ — Wien's displacement law
$t_{MS}(M) = t_\odot \times (M/M_\odot)^{-2.5}$ — cluster age from turn-off mass ($t_\odot \approx 10$ Gyr)
A star cluster's main sequence turns off at spectral type B ($M \approx 10\,M_\odot$). Estimate the cluster's age relative to the Sun's main sequence lifetime (~10 Gyr). Use $t_{MS} \propto M^{-2.5}$.
Mass-luminosity: $L \propto M^{3.5}$ (a $10\,M_\odot$ star is ~3000× more luminous). Main sequence lifetime: $t_{MS} \propto M^{-2.5}$, so cluster age $t = 10\,(M/M_\odot)^{-2.5}$ Gyr. Spectroscopic parallax: spectral type → HR diagram absolute magnitude → distance modulus → distance. Radius: $R \propto \sqrt{L}/T^2$ from Stefan-Boltzmann.
Add the highlighted relations to your notes before the check below.
A higher turn-off point (more massive turn-off star) indicates a younger star cluster.
Two stars at the same temperature must have the same luminosity.
The mass-luminosity relation $L \propto M^{3.5}$ means a $10\,M_\odot$ star is roughly 3 000 times more luminous than the Sun.
Applying Stefan-Boltzmann to estimate stellar radii
We just saw the key HR diagram relations for mass, lifetime, and distance. That raises a question: how do we chain those formulas together in a single worked problem to calculate, say, a red giant's radius and a cluster's age? This card answers it → step by step: use $R/R_\odot = \sqrt{L/L_\odot} \times (T_\odot/T)^2$, and $t = 10\,(M/M_\odot)^{-2.5}$ Gyr.
Star X has a surface temperature $T_X = 3\,500$ K and luminosity $L_X = 100\,L_\odot$. The Sun has $T_\odot = 5\,800$ K and $L_\odot = 3.83\times10^{26}$ W.
- Calculate the ratio $R_X / R_\odot$.
- Classify Star X on the HR diagram.
- A cluster's turn-off is at $5\,M_\odot$. Estimate the cluster's age.
- Radius ratio. From $L = 4\pi R^2\sigma T^4$: $$\frac{R_X}{R_\odot} = \sqrt{\frac{L_X}{L_\odot}} \cdot \left(\frac{T_\odot}{T_X}\right)^2 = \sqrt{100} \times \left(\frac{5800}{3500}\right)^2$$ $$= 10 \times (1.657)^2 = 10 \times 2.75 \approx \mathbf{27.5\ R_\odot}$$
- Classification. $T = 3\,500$ K (M-type, cool) with $L = 100\,L_\odot$ (well above main sequence for that temperature) and $R \approx 28\,R_\odot$ — this is a red giant.
- Cluster age. Using $t = t_\odot(M/M_\odot)^{-2.5}$ with $t_\odot = 10$ Gyr: $$t = 10 \times (5)^{-2.5} = 10 \times 5^{-2.5}$$ $$5^{2.5} = 5^2 \times 5^{0.5} = 25 \times 2.236 = 55.9$$ $$t = 10/55.9 \approx \mathbf{0.18\ \text{Gyr} = 180\ \text{Myr}}$$
A white dwarf has $T = 20\,000$ K and $L = 0.01\,L_\odot$. Calculate its radius relative to the Sun. Is this roughly Earth-sized, Sun-sized, or in between?
Radius ratio: $R_X/R_\odot = \sqrt{L_X/L_\odot} \times (T_\odot/T_X)^2$. Red giants (cool, ~3000–5000 K, but $10^2$–$10^4\,L_\odot$) have radii tens–hundreds of $R_\odot$; white dwarfs (hot, dim, $L \sim 10^{-3}$–$10^{-1}\,L_\odot$) have radii $\sim 0.01\,R_\odot \approx$ Earth's radius. For $5\,M_\odot$ turn-off: $t \approx 180$ Myr.
Pause — write the highlighted formula and worked results into your book before the check below.
A star has the same temperature as the Sun but luminosity $L = 9\,L_\odot$. Using $R \propto \sqrt{L}$ at fixed $T$, its radius is _____ solar radii. (Enter a whole number.)
The most common exam trap: temperature increases to the left on the HR diagram — this is opposite to most graphs. A second trap: confusing luminosity (intrinsic power output) with apparent brightness (which depends on distance). The HR diagram plots luminosity. A third trap on radius calculations: from $L = 4\pi R^2\sigma T^4$, at fixed temperature $R \propto \sqrt{L}$, so 100× luminosity means only 10× radius. A fourth trap: cluster age. Use $t \propto M^{-2.5}$, not $M^{-1}$. A $10\,M_\odot$ star lives $10^{-2.5} \approx 0.003$ times as long as the Sun — about 30 Myr, not 1 Gyr.
Use Stefan-Boltzmann to classify stars and estimate radii
- A star has $T = 10\,000$ K and $L = 80\,L_\odot$. Calculate its radius in solar radii and classify it (main sequence, giant, or white dwarf).
- Star A has $T = 6\,000$ K and $L = 1\,L_\odot$. Star B has $T = 6\,000$ K and $L = 10\,000\,L_\odot$. Calculate the ratio $R_B/R_A$ and identify what type of star B is.
- Using $\lambda_{max}T = 2.898\times10^{-3}$ m·K, find the peak wavelength for: (a) an O star at 30 000 K, and (b) a red giant at 3 500 K. What colour do these appear?
- Explain why two stars of identical colour (same spectral type) can have very different luminosities.
Connect the turn-off point to cluster age using lifetime scaling
- A cluster has a main-sequence turn-off at $2\,M_\odot$. Estimate its age using $t = 10\,(M/M_\odot)^{-2.5}$ Gyr.
- A cluster is estimated to be 1 Gyr old. What is the approximate turn-off mass?
- Explain why the most massive stars in a cluster are the first to leave the main sequence, even though they have more fuel to burn.
- Describe how the HR diagram of a very old cluster (12 Gyr) would differ from that of a very young cluster (10 Myr). Sketch the key difference.
Three of these statements about the HR diagram are correct. Pick the odd one out.
A fresh five-question set drawn from this lesson's bank — feedback shown immediately. +5 XP per correct · +25 XP all correct
Pick your answer, then rate your confidence — that tells the system what to drill next.
ApplyBand 4(3 marks) 1. A star has a surface temperature of $4\,000$ K and luminosity $1\,000\,L_\odot$. (a) Use the Stefan-Boltzmann law to calculate the star's radius in solar radii ($T_\odot = 5\,800$ K). (b) State which region of the HR diagram it occupies. (c) Name the most likely stellar type.
1 mark: correct $R/R_\odot$ · 1 mark: correct region · 1 mark: correct type
AnalyseBand 6(5 marks) 2. (a) Describe the key features of the HR diagram, including the location and physical characteristics of main sequence stars, red giants, supergiants, and white dwarfs. (b) Explain how the Stefan-Boltzmann law accounts for red giants being located above and to the right of the main sequence. (c) A star cluster's main sequence turns off at $8\,M_\odot$. Estimate the cluster's age in millions of years. (d) Explain why spectroscopic parallax can be used to determine the distance to stars too far away for direct parallax measurement.
1 mark: key features described · 1 mark: S-B argument · 1 mark: correct age · 1 mark: spectroscopic parallax method · 1 mark: limitation of direct parallax
Show all answers
Multiple choice
MC answers and full explanations are shown inline as you complete each question. Use the retry button to attempt a fresh set drawn from the lesson bank.
Short Answer — Model Answers
Q1 (3 marks): (a) $R/R_\odot = \sqrt{L/L_\odot}\times(T_\odot/T)^2 = \sqrt{1000}\times(5800/4000)^2 = 31.6\times2.10 \approx 66\,R_\odot$ (1 mark). (b) Above and to the right of the main sequence — the red giant / supergiant region (1 mark). (c) Red giant (or red supergiant given the high luminosity) (1 mark).
Q2 (5 marks): (a) Main sequence: diagonal band top-left (O, hot, massive, luminous) to bottom-right (M, cool, low-mass, dim); stars fuse H in core. Red giants: cool ($\sim$3 000–5 000 K), high luminosity, above and right of MS. Supergiants: extremely luminous, can be hot or cool, top of diagram. White dwarfs: hot but very dim, below and left of MS, tiny radius (1 mark). (b) A red giant has low $T$ but high $L$; from $L = 4\pi R^2\sigma T^4$ at low $T$, $R$ must be very large to produce high $L$ — hence giants have radii tens to hundreds of times the Sun's (1 mark). (c) $t = 10\times(8)^{-2.5} = 10/(8^{2.5})$; $8^{2.5} = 8^2\times8^{0.5} = 64\times2.83 = 181$; $t = 10/181 \approx 0.055$ Gyr $\approx \mathbf{55\ \text{Myr}}$ (1 mark). (d) From a star's spectral type (absorption lines), determine its temperature; use the HR diagram to read its absolute magnitude (intrinsic luminosity). Compare to apparent magnitude: the difference (distance modulus $m - M = 5\log(d/10)$) gives the distance. This extends to stars too far for direct trigonometric parallax ($>$a few kpc) (1 mark). Direct parallax fails at large distances because the parallax angle becomes too small to measure (1 mark).
At the start you were asked about the pattern that Henry Norris Russell at Princeton discovered in 1913 (and Ejnar Hertzsprung in Copenhagen had independently found in 1909): that stars cluster into distinct groups rather than scattering randomly when plotted by luminosity and temperature.
- Did you predict stars cluster in distinct regions? Correct — the HR diagram reveals four main populations: main sequence, red giants, supergiants, and white dwarfs, each reflecting a different evolutionary stage — exactly the pattern Russell and Hertzsprung identified a century ago.
- Did you place the Sun on the main sequence? Correct — the Sun is a G2V star at $T_\odot \approx 5\,800$ K, $L_\odot = 1$, sitting in the middle of the main sequence diagonal.
- Did you reason that a cool but very luminous star must have an enormous radius? Correct — from $L = 4\pi R^2\sigma T^4$, low $T$ and high $L$ forces a very large $R$. Red supergiants can exceed $1\,000\,R_\odot$.
Extend: A star has spectral type M and luminosity $L = 50\,L_\odot$. (a) Estimate its surface temperature (M-type: $\sim$3 500 K). (b) Calculate its radius in solar radii. (c) Classify it (main sequence, red giant, or supergiant). (d) How does its main-sequence lifetime compare to the Sun's if its mass is $0.4\,M_\odot$?
Five timed questions on the HR diagram and stellar properties. Beat the boss to bank a tier — gold (perfect + fast), silver (80%+), or bronze (cleared).
Enter the arena