07 Apr 2026 3 min read Theory

The Interpolation Ceiling

There is a recognizable pattern in how technically sophisticated people write about AI capabilities. The claims are impressive, carefully pitched at a level of abstraction that resists easy falsification, and made by people who understand the underlying systems well enough to know exactly where the imprecision lives. Dario Amodei describes a near-future model smarter than a Nobel Prize winner across most relevant fields: capable of proving unsolved mathematical theorems, writing difficult codebases from scratch, producing extremely good novels. Sam Altman has suggested that systems at roughly current capability levels could perform the majority of economically valuable work. Demis Hassabis, following AlphaFold, extended a specific within-distribution achievement into broad claims about what AI would do to biology as a whole. Ray Kurzweil has been making structurally similar predictions for decades, consistently early and consistently confident.

What these claims share is not that they are wrong in every dimension. Current systems are extraordinary at a wide range of tasks. The claims are wrong in a specific and architecturally precise way: they conflate performance within the training distribution with valid inference outside it. Those are different operations at the level of mechanism, not just degree.

What transformers actually do

Transformer-based language models are compression engines. Given a vast training distribution, they identify and recombine latent structure with a fluency and apparent coherence that consistently surprises even the people who built them. That surprise is real; the capabilities are real; the progress has been amazing.

But compression has a ceiling, and the ceiling is defined precisely by the boundary of the training distribution. Inside that boundary, the model can produce outputs that are indistinguishable from (and in many practical senses equivalent to) expert human performance. Outside it, the failure mode is not obvious error. It is plausible-sounding output that is no longer grounded in anything. The model does not know it has left the manifold. It continues generating fluent, confident text in territory where its outputs are extrapolation without anchor.

This matters because the tasks we most need intelligence for: unsolved problems, novel inference chains, theorems no one has proved, exist precisely outside the training distribution. That is what makes them unsolved. A model that can summarize, synthesize, and recombine everything humanity has written is not thereby equipped to produce what humanity has not yet thought. Those are different operations at the level of mechanism, not just degree.

The Nobel Prize framing activates a mental model – the lone genius navigating genuinely uncharted territory — that is exactly the model that does not transfer. Nobel-level work is defined by its location outside any existing distribution: that is its point.

What produces this in people who know better

This is not simple overclaim, and the more interesting question is what produces it.

Sustained exposure to a system that keeps exceeding predictions creates a specific epistemic hazard: you begin to discount your own theoretical objections because the empirical trajectory keeps surprising you. At some point the surprise accumulates enough that the architectural argument starts to feel like the kind of thing people said about every previous technology before it turned out they were wrong. That is a reasonable update. It is also, potentially, a motivated one, particularly when the claims appear in documents that are also investor communications, recruiting materials, and bids for continued public trust.

There is a structural strangeness here worth naming directly. Arguments for careful AI risk thinking that rest on imprecise capability claims undercut themselves. If the stakes are as high as claimed, then precision about what these systems actually are (and are not) is not a minor footnote. It is the foundation. The interpolation ceiling is the difference between a system that can accelerate human knowledge-production at extraordinary scale and a system that can replace the human at the frontier. Those are different situations requiring different responses, different governance frameworks, and different levels of alarm.

Writing as though the distinction does not exist, in a document arguing that the distinction matters enormously, is worth noticing.

What follows

The people making these claims are not foolish. They are often the people who understand the architecture best. That is precisely what makes the conflation worth examining rather than dismissing.

When rigorous technical knowledge gets gyrated into public argument under conditions of high stakes, commercial pressure, and uncertainty about trajectory, something specific happens to precision. The smoothing is not random. It moves toward claims that justify continued investment, continued urgency, and continued deference to the people making them.

Claims that overstate capability in the direction of near-term transformative potential serve a specific set of interests: they attract capital, they concentrate governance authority in the hands of those closest to the systems, and they create urgency that forecloses slower, more distributed deliberation. The people making these claims are not necessarily aware of this. Motivated updating rarely announces itself. But the pattern — technically sophisticated people consistently erring in the same direction, on the same question, with the same structural beneficiaries — is not a coincidence worth ignoring.

Staying as close as possible to what the mechanism actually does, and does not do, is not pessimism about AI. It is not a failure of imagination. It is the precondition for thinking clearly about what is actually coming, and for building governance frameworks adequate to the reality rather than the projection.

Jen

You might also like...