The Intelligence Gap

A person working on a balcony at dusk while green time drains from their watch toward a faint orbital station in the sky

We no longer have access to Fable 5. For three days we did. It showed up on Max and Pro on June 9, “free” (really just folded into the subscription we already pay for), the most capable model Anthropic had ever shipped, and three days later it was gone, pulled for everyone after a US government export-control order. Anthropic says it’s working to bring it back. Even when it does, the plan was always to move it off the flat-rate plans onto metered API pricing, somewhere around 10 and 50 dollars per million tokens, double Opus, and a single Fable 5 session was already eating through plan limits about twice as fast.

Fable 5’s edge is the horizon. You hand it a goal and it works for hours, planning across stages and checking its own work, then comes back close to done. Opus is great, but Opus I have to steer. I write the spec, I size the work to fit a context window, I set up the verification, then it builds what I wanted. With Opus, more of the work stays mine.

We were told AI would flatten this. Everyone gets a genius assistant, the playing field levels. But the genius you don’t have to babysit, the one that reaches furthest, costs the most, and it’s the first thing to get locked away.

Horizon is the thing you’re buying

There’s a number for this now. METR tracks a model’s “time horizon,” the length of task it can finish on its own, sized by how long the same job would take a human expert. That horizon has been doubling roughly every seven months for years, and it’s sped up to about every three to four months lately, three or four doublings a year. Whether the model can answer a hard question matters less than this. At the top, you pay for how long it’ll run before it needs you again.

The advantage compounds with length. The longer the task, the further the frontier model pulls ahead of the cheap one, because the cheap one hits a point where it needs a human checkpoint and the expensive one keeps going. So the gap is really about who gets to hand the work off and walk away.

The gap is how much you still have to babysit

With a cheaper model the work doesn’t disappear, it moves to you. You supply the clear spec, the tests, the taste for what good looks like. With the expensive long-horizon model, it supplies a lot of that itself, it infers the goal and fills in the judgment.

So the money buys back the hours you’d have spent steering. The person who can afford the frontier model delegates and goes to dinner. The person who can’t runs Opus and spends the evening writing specs and checking output. Getting good at that, building the verification so a cheaper model still lands what you wanted, is the whole subject of Loop Engineering for the Rest of Us.

The Stanford study of a hundred thousand developers found the gains are wildly uneven, near zero on complex work in an existing codebase, up to thirty or forty percent on simple greenfield projects, averaging around twenty. DORA found that teams adding AI without strong testing and version control get more instability, not less. The verification is real labor. It doesn’t vanish when AI shows up. It just sits with whoever can’t pay to make it vanish.

Open models are right behind, and that’s the trap

The obvious objection is that this is temporary. Open-weight models are catching up fast, the gap between the best closed model and the best open one is down to about four months on Epoch’s tracker. Today’s 50-dollar capability is next year’s free download.

But the frontier keeps moving, so the four-month gap doesn’t close, it travels. The people who can pay are always sitting at the newest, longest-horizon, least-babysitting model, and everyone else is running something a season behind that needs more steering. It’s a recency tax, you pay to stay current or you pay in the extra work of running last year’s model.

Paying in hours instead of dollars

The closest picture is In Time, the movie where people pay for everything with hours of their own life. That’s the cost side. Elysium is the other half, and yeah, it’s the obvious reach, the rich up on their orbital station with machines that fix anything while Earth grinds below. Whoever runs the newest, longest-horizon model saves the time and reaches things the rest of us can’t yet.

I don’t think the move is to feel doomed about it, and I don’t have a clean fix. What I do know is where the work goes when you can’t buy your way out of it. It goes into the spec, into the verification, into the taste you encode so a cheaper model can still produce something good. For most of us that work is the job now, and it’s what the next few posts are about.