Multilevel Model: Variance Decomposition
How much of the variation in school achievement is persistent school quality, shared cohort effects, or subject-specific? A multilevel binomial model decomposes the variance properly — accounting for school size, bounded outcomes, and province-level test difficulty shifts.
Variance decomposition
On the logit scale, total variance in school×year×subject achievement splits into four components:
% School
Persistent school quality — stable across years and subjects. Driven by SES, demographics, staffing.
% Subject profile
Persistent school×subject divergence — e.g. a school that is consistently stronger in Math than its overall level.
% Cohort
Year-specific school effect shared across all three subjects — the general cohort factor.
% Noise
Observation-level variation — sampling noise plus any true within-cell fluctuation.
Within-school decomposition
Zooming in on the within-school variance only (excluding persistent school quality):
Shrinkage: raw vs model-estimated school quality
The multilevel model pulls noisy small-school estimates toward the grand mean — the classic Gelman shrinkage effect. Each dot is one school; the x-axis is the raw mean L3/4 proportion (averaged across all years and subjects), the y-axis is the model's partial-pooling estimate.
Cohort effects over time
The cohort effect (school×year random intercept minus school random intercept) captures the year-specific shared component — how much better or worse a school's entire cohort performed relative to the school's long-run average, across all subjects.
The cohort effects are centred near zero each year (by construction — the fixed effects absorb year-level means). The spread shows how much schools vary in their year-to-year fortunes. Wider distributions indicate more cohort-driven volatility.
Model progression
Subject profile effects
A school's subject profile is its persistent subject-specific deviation — how much stronger or weaker it is in one subject relative to its own overall level. A school with a positive Math profile is consistently better at Math than its school intercept alone would predict, across all years.
Each dot below is one school×subject pair; x-axis is the subject profile on the logit scale (positive = relatively stronger, negative = relatively weaker). Schools near zero have uniform profiles across subjects.
Interpretation
The dominant source of achievement variation is persistent school quality (
Persistent subject profiles account for
The "general cohort factor" accounts for
Observation-level noise is
Practical implications:
- A single year's achievement score for a small school tells you mostly about persistent school quality, a little about the cohort, and almost nothing subject-specific. Partial pooling (shrinkage) gives better estimates than raw percentages.
- Year-over-year changes at a school are ~
% "real cohort" and the rest is subject-specific noise. Attributing a one-year swing to a curriculum intervention requires more evidence. - Persistent subject profiles mean that a school's relative strength in Math vs Reading is a real,
stable feature — not just year-to-year noise. But at
% of total variance, it is a small effect compared to overall school quality. - The logit-scale model properly handles schools near the ceiling (>90% L3/4) or floor (<20%), where percentage-point changes are compressed. A 5pp gain at 50% is a different signal than 5pp at 90%.