Tech

Anthropic Built a Monster. Then Asked Everyone Else to Help Clean It Up.

Claude Mythos is either the most consequential cybersecurity model ever shipped, or the most expensive way to generate a press release. The coverage can't decide. That's the story.

By Chasing Seconds · APRIL 10, 20263 minute read

The most telling detail about Claude Mythos isn't what it can do. It's that Anthropic built it, got scared, and then called in the government.

That sequence matters. It's not a safety story. It's a power move — and the coverage of it is splitting exactly along the lines you'd expect.

What Actually Happened

Anthropic trained a general-purpose model that turned out to be, by their own account, unusually good at finding and exploiting software vulnerabilities. Good enough that they declined to release it publicly. Instead, they launched something called Project Glasswing — a coalition effort pulling in major tech companies and U.S. government stakeholders to use the model's capabilities defensively, patching the kinds of critical vulnerabilities the model itself could theoretically weaponize.

Platformer frames this as a genuine alarm bell. The concern is real: a model this capable at offensive security tasks represents a category shift, not just an incremental improvement. If Anthropic has it, others are close. The window to harden critical infrastructure is narrow.

Daring Fireball largely takes Anthropic at its word. The technical documentation from Anthropic's Frontier Red Team is detailed, the framing is careful, and the decision not to release publicly reads as credible restraint from a company that could have shipped and moved on.

Tom's Hardware doesn't buy it. Their piece lands the sharpest punch: those alarming claims about thousands of severe zero-day vulnerabilities rest on a sample of 198 manually reviewed cases. That's not a dataset. That's a demo. The conclusion — that Anthropic may be manufacturing urgency around a problem it's also selling the solution to — is uncomfortable, and it should be.

The Structural Problem Nobody Wants to Name

All three pieces are covering the same announcement. None of them can fully resolve the same tension: Anthropic is simultaneously the entity that created the threat, the entity defining the threat's severity, and the entity leading the response to the threat. That's a closed loop. It doesn't mean they're wrong. It means you can't verify it without them.

The 198-review figure is doing a lot of work here. Anthropic's language around it — thousands of potential severe vulnerabilities — is extrapolated, not observed. Extrapolation from 198 manual reviews to an industry-wide crisis is a significant leap, and the fact that this leap appears in what is functionally a product announcement should make anyone careful.

But here's what I keep coming back to: the alternative isn't reassuring either. If the model is as capable as described and Anthropic is overstating the danger for positioning purposes, that's cynical. If they're understating it to avoid panic while quietly coordinating with the government, that's something else entirely. Neither version of this story is boring.

The coalition itself is real. Big Tech doesn't show up to joint press releases about AI safety out of altruism. They showed up because someone convinced them the threat was credible enough to act on — or because the optics of not showing up were worse. Probably both.

What the Coverage Misses

Every piece treats this primarily as a story about Claude Mythos. I think it's a story about the emerging playbook for how AI companies handle capability overhang — the gap between what a model can do and what the world is ready for.

Withholding a model and building a defensive coalition around it is new behavior. It might be the right behavior. But it also hands Anthropic extraordinary influence over which vulnerabilities get patched, in which order, on whose timeline. That's not a criticism of Project Glasswing. It's an observation about what it means when a private company becomes the de facto coordinator of critical infrastructure security.

The monster is real enough that people showed up. Whether the monster is exactly as described is a different question.

And Anthropic is the only one who's seen it.

End — Filed from the desk

§ More from Tech

Keep reading tech.

Tech

Apple Can't Fold a Phone Because Folding Isn't the Hard Part

The foldable iPhone's production trouble has nothing to do with hinges — and that's the more interesting problem.

Tech

Fitbit Air Costs $99 and Has No Screen. That's Not a Compromise.

The fitness tracker that does less is winning by doing exactly that.

Tech

Silicon Valley's Infrastructure Ambitions Keep Losing to Zoning Boards

A writer at 404 Media catalogued every community that said no to a data center — and the list is longer than anyone expected.

§ Across the magazine

From the other desks.

Cars

Anthropic Built a Monster. Then Asked Everyone Else to Help Clean It Up.

What Actually Happened

The Structural Problem Nobody Wants to Name

What the Coverage Misses

Keep reading tech.

Apple Can't Fold a Phone Because Folding Isn't the Hard Part

Fitbit Air Costs $99 and Has No Screen. That's Not a Compromise.

Silicon Valley's Infrastructure Ambitions Keep Losing to Zoning Boards

From the other desks.

China Built the Spectacle. Now It's Writing the Rules Against It.

Audemars Piguet Handed AMBUSH the Keys. Watch What Happens Next.

Roland Garros Is Running Two Tournaments at Once