Statistics: All Roads Lead to Hypothesis Testing
Whether you are testing a coin, a correlation, or a variance, every hypothesis test asks exactly the same question. Once you see this, the subject stops feeling like a collection of unrelated procedures and becomes one repeating idea.
The unifying idea — every time
Assume the null hypothesis H₀ is true. Ask: how probable is it that we would see data at least this extreme? If that probability is small enough, we have evidence against H₀.
For a two-tailed test at significance level α, split the region equally: compare each tail to α/2. For tests that use a critical value from tables (PMCC, Spearman, Wilcoxon), the table already encodes this threshold — the underlying logic is identical.
1. Binomial distribution
A-level MathsEdexcelAQAOCROCR MEIUsed when counting the number of successes in a fixed number of independent trials, each with the same probability of success. The classic example is testing whether a coin (or process) is biased.
Setup
Under H₀, the test statistic X (number of successes) follows:
Decision rule
2. Normal distribution (z-test)
A-level MathsEdexcelAQAOCROCR MEIUsed to test the mean of a normally distributed population when the population variance σ² is known. The sample mean X̄ is itself normally distributed.
Setup
Under H₀:
Decision rule
3. Product moment correlation coefficient (PMCC)
A-level MathsEdexcelAQAOCROCR MEITests whether there is linear correlation between two variables in a bivariate normal population. The null hypothesis is always that the population correlation ρ is zero.
Setup
The test statistic is the sample PMCC r. The critical values of r are read from tables for the given n and significance level; they implicitly encode P(R ≥ r | ρ = 0) = α.
Decision rule
4. Poisson distribution
A-level Further MathsEdexcel (Further Statistics 1)AQA (Statistics)OCR (Statistics)OCR MEI (Statistics)Used when counting events that occur randomly in a fixed interval of time or space. Tests whether the underlying rate λ has changed from a known baseline.
Setup
Under H₀:
Decision rule
5. Geometric distribution
A-level Further MathsAQA (Statistics)OCR MEI (Statistics)Models the number of trials needed to achieve the first success. Useful for testing whether the underlying probability of success has changed.
Setup
Under H₀, X = number of trials to first success has distribution:
A lower probability of success means we expect to wait longer, so H₁: p < p₀ corresponds to large values of X.
Decision rule
6. Negative binomial distribution
A-level Further MathsOCR MEI (Statistics)Extends the geometric distribution to the number of trials needed to achieve the r-th success. It is examined in OCR MEI Further Maths and uses the same hypothesis-testing framework.
Setup
Under H₀, X = number of trials to r-th success:
Decision rule
7. Chi-squared test
A-level Further MathsEdexcel (Further Statistics 1)AQA (Statistics)OCR (Statistics A)OCR MEI (Statistics)The χ² test appears in two forms: goodness of fit (does data follow a proposed distribution?) and independence (are two categorical variables related?). The test statistic and decision rule are the same in both cases.
Test statistic
where Oi are observed frequencies, Ei are expected frequencies, and ν is the degrees of freedom:
Decision rule
8. Spearman's rank correlation coefficient
A-level Further MathsEdexcel (Further Statistics 1)AQA (Statistics)OCR MEI (Statistics)A non-parametric alternative to the PMCC. It tests for monotonic association between two variables by ranking the data — no normality assumption is needed.
Setup
The test statistic is:
where di is the difference between ranks for each pair.
Decision rule
9. F distribution
A-level Further MathsEdexcel (Further Statistics 2)AQA (Statistics)OCR MEI (Statistics)Used to compare the variances of two independent normal populations. The F statistic is the ratio of two sample variances. By convention, the larger sample variance goes in the numerator so the observed F is always ≥ 1.
Setup
Under H₀:
Decision rule
10. Student's t distribution
A-level Further MathsEdexcel (Further Statistics 1)AQA (Statistics)OCR (Statistics A)OCR MEI (Statistics)Used to test a population mean when the variance is unknown and must be estimated from the sample. This is more realistic than the z-test. A two-sample version tests whether two population means are equal.
One-sample setup
Under H₀:
Decision rule
Two-sample t-test
To compare the means of two independent populations with unknown but equal variances (pooled t-test):
11. Wilcoxon rank-sum test (Mann–Whitney)
A-level Further MathsEdexcel (Further Statistics 2)AQA (Statistics)OCR MEI (Statistics)A non-parametric test for comparing two independent populations — useful when data are not normally distributed. It tests whether the two populations have the same distribution (or equivalently, the same median).
Procedure
- Combine all n₁ + n₂ observations and rank them from 1 to n₁ + n₂.
- Sum the ranks for one group: W = sum of ranks for group 1.
- Compare W to the critical values Wlower and Wupper from tables.
Decision rule
Summary: the same question, eleven ways
| Test | Level | Model under H₀ | "More extreme" means |
|---|---|---|---|
| Binomial | Maths | X ~ B(n, p₀) | X ≥ x or X ≤ x |
| Normal z-test | Maths | Z ~ N(0, 1) | |Z| ≥ z or Z ≥ z |
| PMCC | Maths | ρ = 0 | |r| ≥ rcrit |
| Poisson | Further | X ~ Po(λ₀) | X ≥ x or X ≤ x |
| Geometric | Further | X ~ Geo(p₀) | X ≤ x or X ≥ x |
| Negative binomial | Further | X ~ NB(r, p₀) | X ≥ x or X ≤ x |
| Chi-squared | Further | χ² ~ χ²(ν) | χ² ≥ χ²obs |
| Spearman's rank | Further | ρs = 0 | |rs| ≥ rs,crit |
| F distribution | Further | F ~ F(ν₁, ν₂) | F ≥ Fcrit |
| Student's t | Further | T ~ t(n−1) | |T| ≥ tcrit |
| Wilcoxon rank sum | Further | same median | W ≤ Wlo or W ≥ Whi |
