Marcus on AI · · 1 min read

Claude Mythos, evaluated

Mirrored from Marcus on AI for archival readability. Support the source by reading on the original site.

Very interesting evaluation from the UK’s AI Security Institute of the not yet publicly available Claude Mythos Preview.

On the happy side, in its current form, Myth is nowhere near as scary as Tom Fridman (who worries about schoolchildren accidentally taking down power grids) and others made it out to be.

On the darker side, it really does arm attackers to a greater degree than Mythos’s predecessors.

Here’s the scariest part of the thread on X:

and here’s the part (see second paragraph) that gives a little bit of comfort:

One hopes that by now no mission-critical infrastructure is “small, weakly defended, and vulnerable” with ready network access. One hopes.

You can read a longer report here.

I agree with their conclusion that

Even if Mythos was somewhat oversold in the media, the time to get our cybersecurity house in order is now (or better yet last year) — especially given the sudden profusion of agent-written code that may in fact be both weakly defended and vulnerable.

Subscribe now

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from Marcus on AI