Simon Willison · June 11, 2026 · 1 min read

Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude

Mirrored from Simon Willison for archival readability. Support the source by reading on the original site.

11th June 2026 - Link Blog

Anthropic Walks Back Policy That Could Have ‘Sabotaged’ AI Researchers Using Claude. Big scoop for Maxwell Zeff at Wired:

“We’re changing Fable 5’s safeguards for frontier LLM development to make them visible.” Anthropic said in a statement to WIRED. “We made the wrong tradeoff and we apologize for not getting the balance right.”

There's been a huge outcry about Anthropic's policy, tucked away in their system card, that Claude Fable/Mythos would identify "requests targeting frontier LLM development" and "limit effectiveness" without notifying the user.

It's very good news that they're dropping this.

Discussion (0)

No comments yet. Sign in and be the first to say something.

Discussion (0)

More from Simon Willison