TWO ANSWERS, ONE VOICE
The Most Valuable Thing an AI Can Say Is "I Don't Know"
The same model can make you forty percent better or nineteen points worse while sounding identically sure. Abstention is the only signal that tells the two apart.
SEVEN HUNDRED CONSULTANTS
Capability keeps no smooth edge
Two tasks of equal apparent difficulty can fall on opposite sides of the jagged frontier. The boundary between them is not a line. It is a coastline.
- 758 consultants, BCG field experiment
- Inside: +40% quality, +25% speed
- Outside: 19 points more likely wrong
- Equal-looking tasks, opposite sides
The cliff gives no warning
Most persuasive exactly where it is most wrong: you are most likely to be fooled at the moment the answer sounds its best.
NOTHING TO LEARN FROM
It guesses from the nearest neighbour
- Out of distribution: outside the training region
- The model snaps a new input to its nearest known case
- Hallucination runs strongest where there is nothing to draw on
An input the model has never seen is out of distribution: not too hard, just absent from everything it learned on.
RISK AND UNCERTAINTY
Uncertainty must be taken in a sense radically distinct from the familiar notion of Risk, from which it has never been properly separated.
Frank Knight, Risk, Uncertainty and Profit, 1921
A genuinely unprecedented event carries Knightian uncertainty: no reference class, no odds to estimate, nothing for the model to stand on.
THE WINS YOU CAN MEASURE
Real gains, every one inside the line
- Support agents: +14% average, +34% for novices (Brynjolfsson, Li, Raymond 2023)
- Scoped coding task: +55.8% faster (Peng 2023)
- Well specified, high frequency, cheap to check
The measurable upside lives in commoditised, checkable work: the wins you can measure are the ones inside the frontier, where the gap between making and checking stays narrow enough to trust.
FIND A VERIFIER
A verifier, not a vibe
- 01Generate many candidates, cheaply
- 02Test each against an external check: proof, physics, measurement
- 03Keep only what the check passes
- 04Where no cheap check exists, abstain
Reliable success pairs a model with an external check across a searchable verifiable space. When checking turns expensive, an honest I-don't-know is the cheapest verification of all.
TWO ENEMIES OF THE CHECK
Rarity floods the signal; fluency fools the judge
- Rarity: a one-in-10,000 event leaves about 99% of alerts false
- Rarity: the single true signal drowns in spurious flags
- Fluency: developers 19% slower measured, 20% faster felt (METR 2025)
- Fluency: confidence reads as competence
Rarity is the base-rate trap. Judging work you cannot command is the metacognitive demand. Together they make the verifier you were counting on fail quietly.
THE MAP NO ONE SHIPS
Flat confidence over a ragged coast
The keystone: confidence runs straight, competence runs ragged. An upgrade does not only push the frontier outward. It can drown ground that used to sit inside the model's reach.
- Stated confidence: a flat line
- Actual accuracy: jagged, then a cliff
- The gap between them is every silent failure
- An upgrade can flood ground that used to be safe