Biology • Year 12 • Module 5 • Lesson 19

Predicting Population Genetic Patterns — Strengths, Limits and Synthesis

Build HSC Band 5–6 extended-response technique on prediction-with-uncertainty, evaluating sources, and synthesising the full Module 5 chain from reproduction to population data.

Master · Extended Response

1. Extended response — compare and evaluate prediction vs uncertainty in population genetics (Band 5–6)

7 marks   Band 5–6

Q1. Compare and evaluate what population genetics can predict reliably and what it cannot predict with certainty, using examples drawn from Module 5. In your response you must:

  • Define prediction and uncertainty as used in population genetics.
  • Compare the two on at least three criteria (e.g. type of conclusion, scale of evidence, role of environment, exam wording).
  • Use at least one named biological example per side (e.g. BRCA1 risk, ABO allele frequencies, mtDNA relatedness trends, Punnett-square ratios vs exact phenotype, allele-frequency projections under selection / drift / migration).
  • Reach a context-dependent judgement — strong claims at population level, cautious claims at individual level — rather than a one-winner ranking.
Stuck? Plan first: define both terms → 3 criteria with examples → "what's reliable / what isn't" structure → environment-dependent judgement. Use BRCA1 as your hinge example.

2. Stimulus-based extended response — predicting cystic fibrosis risk in a population (Band 5–6)

8 marks   Band 5–6

Stimulus. Cystic fibrosis (CF) is an autosomal recessive condition caused by mutations in the CFTR gene. In a large European-ancestry population, large-scale sequencing studies estimate the carrier frequency at approximately 1 in 25 (allele frequency ≈ 0.02). Hardy–Weinberg predicts that the homozygous affected genotype occurs in ≈ 1 in 2 500 newborns, a figure that closely matches clinical incidence. A genetic counsellor is asked by a couple — both unaffected, both carriers (confirmed by sequencing) — to predict (i) the chance that any given pregnancy will produce an affected child, (ii) the severity of disease their affected child would experience, and (iii) the future carrier frequency in this population 50 years from now.

Source: pooled carrier frequencies summarised in Ratbi et al. (2019), J Cyst Fibros 18(3): 318–322.

Q2. Analyse and evaluate how confidently the counsellor can answer each of the three questions, drawing on the full Module 5 chain (reproduction → meiosis → gene expression → inheritance models → population data).

In your answer:

  • Use a Punnett square (or written equivalent) to address question (i).
  • Explain why question (ii) cannot be answered with the same level of confidence as question (i).
  • Identify which assumptions question (iii) depends on, and explain why those assumptions add uncertainty.
  • Reach a justified summary of which questions population genetics answers well, and which it does not.
Stuck? Use the lesson's three-card "individuals / future populations / phenotype" framework as your spine, then attach a worked Punnett square as your hinge for question (i).

3. Evaluate this claim (Band 5–6)

6 marks   Band 5–6

"Module 5 has taught us that DNA is the master controller. Once you know an organism's full genome sequence, you can predict its phenotype exactly, predict how its population will look in 100 generations, and even predict which two populations are most closely related — all with no remaining uncertainty. Anything else is just sloppy biology."

Q3. Evaluate this claim against the content of Module 5 (Lessons 1–19). Identify which elements are defensible, which are wrong, and reformulate the claim into a biologically defensible statement that respects the Module 5 → Module 6 handoff.

Stuck? Revisit Lesson 19 § Card 4 ("Strong vs Weak wording"), the misconceptions box, and Lesson 12 on gene–environment interaction.
Answers — Do not peek before attempting

Q1 — Sample Band 6 response (7 marks), annotated

In population genetics, a prediction is a scientifically supported expectation based on evidence and stated assumptions, while uncertainty is the recognised limit on how exact or complete that expectation can be, even when the evidence is strong. [1 — defines both terms]

Population genetics predicts reliably when conclusions are made at the same level as the evidence: risk patterns (e.g. BRCA1 carriers face approximately a 70% lifetime breast-cancer risk versus ~12% in the general population), allele distribution trends (e.g. the ABO blood-group allele frequencies differ predictably between populations), and relatedness trends (e.g. mitochondrial DNA evidence supports inference of human migration patterns across continents). [1 — what is reliable; 1 — example of reliable]

What it does not predict reliably is the exact phenotype of a specific individual — identical twins sharing the same genome can develop different phenotypes for many traits, including disease onset — and the exact future state of a population, because long-run projections diverge dramatically under different assumptions about mutation, selection, drift and migration. [1 — what is uncertain; 1 — example of uncertainty]

The reason for this asymmetry is structural. Population evidence is built from many samples, so random noise averages out and trends emerge robustly. Individual outcomes, by contrast, are shaped not only by genotype but also by environment, gene interactions and chance — gene–environment interaction was explicit in Lesson 12 and underlies every "risk-not-destiny" example since. [1 — why the two differ; gene–environment link]

The strongest Biology answers therefore match the wording to the level of evidence: strong claims at population level ("indicates increased risk", "suggests a trend", "supports inference of relatedness") and cautious claims at individual level ("the actual outcome cannot be predicted with certainty"). Neither approach is universally superior; they are matched to different scales of inference, which is exactly what Lesson 19 frames as the heart of population-genetics reasoning. [1 — explicit context-dependent judgement]

Q2 — Sample Band 6 response (8 marks), annotated

(i) Risk per pregnancy. Both parents are carriers (Cc × Cc). A Punnett square gives 1 CC : 2 Cc : 1 cc, i.e. 25% affected, 50% carrier, 25% unaffected non-carrier per pregnancy. [1 — correct Punnett-square application] This 25% figure is a robust population-level expectation, but it does not guarantee that exactly one in four of this couple's children will be affected — the actual outcome of any individual pregnancy is governed by which gametes happen to fuse. [1 — Mendelian model + individual-level uncertainty]

(ii) Severity. Severity of cystic fibrosis depends on which specific CFTR mutations are inherited (different mutations have different functional impact on the chloride channel), as well as modifier genes elsewhere in the genome, environmental factors, infection history and access to clinical care. The counsellor can predict the presence of CF reliably from genotype but cannot predict severity with the same confidence — phenotype is shaped by gene–environment interaction (Lesson 12). [1 — severity depends on genotype + environment; not predictable from genotype alone]

(iii) Future carrier frequency. Predicting the carrier frequency 50 years from now requires the Hardy–Weinberg assumptions: large population, random mating, no migration, no selection, no new mutation, equal allele frequencies between sexes. [1 — names Hardy–Weinberg assumptions] In a real population, at least two of these will plausibly fail: (a) carrier screening programs identify carriers before reproduction, allowing reproductive choices that can shift allele frequency; (b) migration in / out of the population alters allele frequencies; (c) emerging CF treatments and gene therapies alter the fitness of affected individuals; (d) selection against affected individuals historically reduced the cc frequency. [1 — at least two plausible violations]

The current carrier frequency itself is well-characterised because Lesson 16 (frequency data and SNP analysis), Lesson 17 (sequencing and profiling) and Lesson 18 (large-scale population data) supply the empirical evidence — a 1-in-25 carrier estimate is robust because it is built from millions of sequenced individuals. So the present population trend is a reliable prediction; it is the future projection that grows uncertain over decades. [1 — invokes Lessons 16–18 to justify reliability of the present estimate]

Summary. The counsellor can answer question (i) confidently as a population-level probability; can answer question (ii) only partially and with substantial individual-level uncertainty; and can answer question (iii) only with major caveats about Hardy–Weinberg assumptions and future evolutionary forces. This is the exact pattern Lesson 19 frames: population-level trends are stronger than exact individual or future-state predictions. [1 — evaluative summary linked to Lesson 19's framing] Precise terminology used throughout (allele frequency, carrier, autosomal recessive, Hardy–Weinberg, gene–environment interaction, prediction, uncertainty). [1 — terminology + cautious wording throughout]

Q3 — Sample Band 6 response (6 marks)

The claim is partly defensible but seriously overstated. [1 — judgement]

What is defensible. Full genome sequence data can reliably support broad inferences about relatedness between populations — Lesson 17 (sequencing / profiling) and Lesson 18 (large-scale population data) demonstrate exactly this, with mtDNA and Y-chromosome data underpinning credible human-ancestry trends. [1 — concedes defensible element]

What is wrong.

  • "Predict phenotype exactly." Wrong. Phenotype is shaped by genotype and environment, gene interactions and chance (Lesson 12). Even identical twins, who share a genome, can differ in phenotype — particularly for complex traits and many diseases. [1 — refutes exact phenotype prediction]
  • "Predict population state in 100 generations." Wrong. Long-run population projections depend on assumptions about mutation, selection, drift and migration; the same starting frequency produces very different outcomes 100 generations later depending on which forces dominate. Lesson 19 explicitly identifies future-state uncertainty as a core limit of the framework. [1 — refutes long-run projection certainty]
  • "No remaining uncertainty / sloppy biology." Wrong, and backwards. Strong Biology answers use probability and trend language precisely because uncertainty is a feature of population-level inference, not a defect. The lesson's "Strong wording" table ("suggests a trend", "indicates increased risk", "supports inference of relatedness") is the marker of high-quality reasoning. [1 — refutes "no uncertainty" with explicit reference to wording norms]

Defensible reformulation. "Module 5 supports strong, evidence-based claims about population-level trends — risk patterns, allele distributions and relatedness — while exact phenotype, exact individual outcomes and exact long-run population states all carry uncertainty because they depend on environment, gene interactions and assumptions about future evolutionary forces. Module 6 then extends this framework by asking how mutation and biotechnology can further alter or investigate these inherited patterns." [1 — defensible reformulation with explicit Module 5 → Module 6 handoff]