Episode 4 · Cognitive biases · Research notes
WWII planes came home riddled with bullet holes, and the obvious fix was to armor where the damage was. A statistician named Abraham Wald said the opposite: armor the clean spots. The planes hit there never came back to be counted. That gap between the winners you see and the losers you don't is survivorship bias.
Abraham Wald fled the Nazis in 1938; most of his family was later killed at Auschwitz. In America he joined the Statistical Research Group at Columbia, a secret team of elite statisticians. His 1943 memoranda, "A Method of Estimating Plane Vulnerability Based on Damage of Survivors," used only the returning planes to reconstruct the damage the missing planes must have taken. The skill was not refusing to look at winners; it was modeling the filter that produced them.
The logic is real. Most of the drama around it is not. Wald wrote about "a plane," generically; the heroic B-17 framing came later. He never wrote "armor the engines," that memorable line is a distillation by later authors (notably Jordan Ellenberg), not a quote. The scene of Wald facing down stubborn generals is undocumented. And the iconic diagram of a plane peppered with red dots was made by designer Cameron Moll around 2005, his own dots on a modern aircraft outline. Over the decades, only the tidy version survived, which is itself a quiet case of survivorship bias.
Mutual funds that fail get shut down and quietly drop out of the long-run averages, so the surviving-fund average overstates real returns; correcting for the dead funds can pull a headline figure down by a percentage point or more. "This billionaire dropped out of college" shows you Gates and Jobs, not the vast invisible crowd who dropped out and did not win. "They don't build them like they used to" compares today's average to history's survivors, after the flimsy majority already rotted away. In each case the missing denominator, the failures nobody kept, is exactly the data you never see.
No, and that is the honest part. Wald looked at winners; survivor data can be perfectly valid. The error is only ignoring the filter, and the danger scales with how tightly survival is tied to the thing you are measuring. Some famous examples are shakier than they sound, the claim that cats fall from higher floors and get hurt less is a real study but a contested survivorship reading, with a later study finding the opposite. And in controlled experiments (Enke, 2020), many people do ignore hidden data, but a single reminder to "consider what you are not being shown" roughly halved the mistake. It is a default that slips under load, not a fixed flaw.