The Edge of the Interesting Thing
A recent study tested four AI models on a validated paradigm for moral sensitivity, presenting them with three categories of ethical tradeoffs: routine (two secular values), taboo (a sacred value against a secular
The Interpolation Ceiling
There is a recognizable pattern in how technically sophisticated people write about AI capabilities. The claims are impressive, carefully pitched at a level of abstraction that resists easy falsification, and made by people
The Painted Window
Why Claude's Self-Reports Aren't What They Look Like
There's a particular kind of conversation that happens thousands of times a day between humans and AI systems, and
The Measurement Problem in AI Risk: Why Output Variance Doesn't Capture Epistemic Drift
Anthropic's recent paper "The Hot Mess of AI: How Does Misalignment Scale with Model Intelligence and Task Complexity?" makes an important empirical observation: frontier models show increasing output variance
The "You" Problem: What AI Consciousness Discourse Gets Wrong
A new field has emerged with remarkable speed. It has journals, taxonomies, conferences, and a growing body of literature. It concerns itself with the psychology of artificial intelligence — with whether AI systems have