Mixed Statistical Problems
A pharmaceutical company tests a new drug on 20 patients, knowing each independently has a 30% chance of remission. The clinical team must answer: what is the expected number of successes, the variance of the outcome, and the probability that at least 8 patients improve? Real HSC questions chain these calculations together. This lesson trains the multi-technique habit — moving fluently between $E(X)$, $\operatorname{Var}(X)$ and $P(X = k)$ in a single solution.
Consider a binomial random variable $X \sim B(n, p)$. Before using any formula — write the three core results: $P(X=k) = \ldots$, $E(X) = \ldots$, $\operatorname{Var}(X) = \ldots$. Sketch your reasoning below.
Every mixed statistics problem rewards two habits: identify $n$ and $p$ precisely (the Bernoulli parameters), then match the question to the correct formula — $E(X)$, $\operatorname{Var}(X)$, or $P(X=k)$ — before evaluating. Rushing to a calculator without checking which result is asked is the single biggest error in multi-part questions.
The identify-and-match strategy: (1) read the wording to find $n$ (number of trials) and $p$ (probability of success), (2) classify the sub-question as mean, variance, or probability, (3) write the formula in full before substituting.
$E(X) = np$ · $\operatorname{Var}(X) = np(1-p)$ · $P(X=k) = \binom{n}{k}p^k(1-p)^{n-k}$
Key facts
- Binomial PMF: $P(X = k) = \binom{n}{k}p^k(1-p)^{n-k}$
- Mean: $E(X) = np$ and Variance: $\operatorname{Var}(X) = np(1-p)$
- Complement: $P(X \geq 1) = 1 - P(X = 0) = 1 - (1-p)^n$
Concepts
- Why $n$ and $p$ must be identified before any computation
- How mean, variance and probability connect through the same parameters
- Why the complement is preferred for "at least one" questions
Skills
- Solve multi-part HSC questions combining $E(X)$, $\operatorname{Var}(X)$ and $P(X = k)$
- Use the complement to simplify "at least" and "at most" expressions
- Apply the general strategy: read, classify, compute, verify
HSC binomial questions almost always combine two or three sub-parts in one question. The four-step strategy never fails:
- Read. Extract $n$ (number of trials) and $p$ (probability of success) from the wording.
- Classify. For each sub-part, identify: mean ($E$), variance ($\operatorname{Var}$), or probability ($P$).
- Compute. Write the formula explicitly, then substitute. Never skip writing $P(X = k) = \binom{n}{k}p^k(1-p)^{n-k}$.
- Verify. Check that probabilities lie in $[0, 1]$, $E(X)$ is between $0$ and $n$, and $\operatorname{Var}(X) \geq 0$.
Worked through the hook: For $X \sim B(20, 0.3)$:
- $E(X) = np = 20 \times 0.3 = 6$.
- $\operatorname{Var}(X) = np(1-p) = 20 \times 0.3 \times 0.7 = 4.2$.
- $P(X = 6) = \displaystyle\binom{20}{6}(0.3)^6(0.7)^{14} = 38760 \times (0.3)^6 \times (0.7)^{14} \approx 0.1916$.
HSC binomial questions almost always combine two or three sub-parts in one question. The four-step strategy never fails:
Pause — copy the four-step multi-part strategy (read, classify, compute, verify) and identify which formula to apply for mean, variance, and probability sub-parts into your book.
Quick check: Let $X \sim B(15, 0.4)$. Which expression correctly gives $P(X = 5)$?
We just saw the four-step multi-part strategy: read (extract $n,p$), classify each sub-part as mean/variance/probability, write the formula, verify. That raises a question: when HSC questions chain mean, variance, and probability sub-parts together, how do you avoid re-reading parameters each time? This card answers it → extract $n$ and $p$ once at the top, reuse them in all three formulas ($E=np$, $\text{Var}=npq$, PMF) without re-defining variables.
HSC questions often chain three sub-parts: find the mean, find the variance, then compute a specific probability. Because all three depend on the same $n$ and $p$, you only need to read the parameters once and reuse them.
Example: A fair die is rolled 12 times. Let $X$ be the number of sixes. Here $X \sim B(12, 1/6)$.
- Mean: $E(X) = np = 12 \times \tfrac{1}{6} = 2$.
- Variance: $\operatorname{Var}(X) = np(1-p) = 12 \times \tfrac{1}{6} \times \tfrac{5}{6} = \tfrac{10}{6} = \tfrac{5}{3}$.
- $P(X \geq 1)$: using the complement, $P(X \geq 1) = 1 - P(X = 0) = 1 - \left(\tfrac{5}{6}\right)^{12} \approx 0.8878$.
HSC questions often chain three sub-parts: find the mean, find the variance, then compute a specific probability. Because all three depend on the same $n$ and $p$, you only need to read the parameters once and reuse them.
Pause — copy the three chained formulas ($E(X)=np$, $\text{Var}(X)=npq$, $P(X=k)=\binom{n}{k}p^k q^{n-k}$) with a worked three-part example into your book.
Did you get this? True or false: for $X \sim B(12, 1/6)$, the probability of obtaining at least one six in 12 rolls is $1 - (5/6)^{12}$.
Worked examples · 3 in a row, reveal as you go
A multiple-choice test has 20 questions, each with 4 options and exactly one correct answer. A student guesses every answer. (a) Find $E(X)$ and $\operatorname{Var}(X)$ for the number of correct answers. (b) Find $P(X = 7)$ to 4 d.p.
A factory produces light globes; 5% are defective. A batch of 25 is inspected. (a) Find the expected number of defectives. (b) Find the probability that the batch contains at least one defective globe.
A basketball player has a free-throw success rate of $p = 0.7$. She takes 10 free throws. Find $P(X \leq 2)$, the probability she sinks at most 2 baskets.
Fill the gap: For $X \sim B(50, 0.4)$, the variance is $\operatorname{Var}(X) = 50 \times 0.4 \times = 12$.
Misconceptions to fix · the 3 traps that cost marks
Did you get this? True or false: for $X \sim B(n, p)$, the standard deviation is given by $\sigma = np(1-p)$.
Activities · practice with the ideas
A coin is biased so that $P(\text{head}) = 0.6$. It is tossed 10 times. Find $E(X)$ and $\operatorname{Var}(X)$ where $X$ is the number of heads.
In a town, 8% of people are left-handed. A random sample of 30 people is taken. Find the probability that exactly 2 are left-handed.
A drug cures 80% of patients. It is given to 15 patients. Find the probability that at least 14 are cured.
A factory has a 2% defect rate. A box of 100 items is inspected. Find the probability of at least one defective item.
In a class of 20 students, each independently has a 0.4 probability of completing homework. Find (a) the expected number, (b) the standard deviation, and (c) $P(X = 8)$.
Odd one out: Three of these statements about $X \sim B(20, 0.3)$ are correct. Which one is NOT?
Earlier you wrote out $E(X)$, $\operatorname{Var}(X)$ and the $P(X = 6)$ setup for $X \sim B(20, 0.3)$.
The expected value is $E(X) = np = 6$. Variance is $\operatorname{Var}(X) = np(1-p) = 4.2$. The exact probability is $P(X = 6) = \binom{20}{6}(0.3)^{6}(0.7)^{14} \approx 0.1916$. Be especially careful that the exponent of $p$ matches the number of successes — swapping the exponents is the single most common error.
Pick your answer, then rate your confidence — that tells the system what to drill next. Each retry pulls a fresh mix from the bank.
Q1. A fair coin is tossed 8 times. Let $X$ be the number of heads. Find $E(X)$ and $\operatorname{Var}(X)$. (2 marks)
Q2. A pharmaceutical trial finds that a drug works on 60% of patients. It is given to 12 patients. Find (a) $P(X = 7)$ and (b) $P(X \geq 1)$. Leave answers to 4 d.p. (3 marks)
Q3. A test has 25 multiple-choice questions, each with 5 options. A student guesses every answer. Find the expected score, the variance, and $P(X \geq 10)$ using the binomial formula. (You may leave $P(X \geq 10)$ as a sum of terms.) (3 marks)
Comprehensive answers (click to reveal)
Activity answers:
1. $X \sim B(10, 0.6)$: $E(X) = 6$, $\operatorname{Var}(X) = 10 \times 0.6 \times 0.4 = 2.4$.
2. $X \sim B(30, 0.08)$: $P(X=2) = \binom{30}{2}(0.08)^2(0.92)^{28} = 435 \times 0.0064 \times 0.0987 \approx 0.2745$.
3. $X \sim B(15, 0.8)$: $P(X \geq 14) = P(X=14) + P(X=15) = \binom{15}{14}(0.8)^{14}(0.2) + (0.8)^{15} = 15(0.8)^{14}(0.2) + (0.8)^{15} \approx 0.1319 + 0.0352 = 0.1671$.
4. $X \sim B(100, 0.02)$: $P(X \geq 1) = 1 - (0.98)^{100} \approx 1 - 0.1326 = 0.8674$.
5. $X \sim B(20, 0.4)$: (a) $E(X) = 8$; (b) $\operatorname{Var}(X) = 4.8$, $\sigma \approx 2.19$; (c) $P(X=8) = \binom{20}{8}(0.4)^8(0.6)^{12} = 125970 \times 6.554\times10^{-4} \times 2.177\times10^{-3} \approx 0.1797$.
Q1 (2 marks): $X \sim B(8, 0.5)$ [1]. $E(X) = 4$, $\operatorname{Var}(X) = 2$ [1].
Q2 (3 marks): $X \sim B(12, 0.6)$ [1]. (a) $P(X=7) = \binom{12}{7}(0.6)^7(0.4)^5 = 792 \times 0.02799 \times 0.01024 \approx 0.2270$ [1]. (b) $P(X \geq 1) = 1 - (0.4)^{12} \approx 1 - 0.0000168 \approx 1.0000$ [1].
Q3 (3 marks): $X \sim B(25, 0.2)$ [1]. $E(X) = 5$, $\operatorname{Var}(X) = 25 \times 0.2 \times 0.8 = 4$ [1]. $P(X \geq 10) = \sum_{k=10}^{25} \binom{25}{k}(0.2)^k(0.8)^{25-k} \approx 0.0173$ [1] (evaluating each term separately; some answers may also express it as $1 - P(X \leq 9)$).
Five timed questions combining $E(X)$, $\operatorname{Var}(X)$ and binomial probabilities. Beat the boss to bank a tier — gold (90% + speed), silver (75%), or bronze (50%). Replays welcome.
⚔ Enter the arenaClimb platforms by answering mixed binomial questions. Lighter alternative to the boss.
Mark lesson as complete
Tick when you've finished the practice and review.