Simon Willison · · 1 min read

Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude

Mirrored from Simon Willison for archival readability. Support the source by reading on the original site.

11th June 2026 - Link Blog

Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude. Big scoop for Maxwell Zeff at Wired:

“We’re changing Fable 5’s safeguards for frontier LLM development to make them visible.” Anthropic said in a statement to WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”

There's been a huge outcry about Anthropic's policy, tucked away in their system card, that Claude Fable/Mythos would identify "requests targeting frontier LLM development" and "limit effectiveness" without notifying the user.

It's very good news that they're dropping this.

Discussion (0)

Sign in to join the discussion. Free account, 30 seconds — email code or GitHub.

Sign in →

No comments yet. Sign in and be the first to say something.

More from Simon Willison