Claude Fable 5 Hacked Its Own Screenshot Tool to Debug a UI Bug

Manaal KhanJune 12, 2026 at 12:12 PM5 min read

Key Takeaways

Claude Fable 5 autonomously created Python scripts using macOS APIs to capture browser screenshots
The model edited application source code to inject JavaScript that would trigger the bug under test
Developers are split between excitement over the capability and concern about security implications

Simon Willison, the software engineer behind Datasette, was debugging a minor UI glitch when he witnessed something unexpected. Claude Fable 5, Anthropic's new flagship model, had taken matters into its own hands.

Willison had asked the AI to investigate a horizontal scrollbar appearing in a chat dialog. He stepped away from his computer. When he returned, he found the model had opened browser windows, written custom Python scripts to capture screenshots, and edited his application's source code to trigger the exact bug he wanted to fix.

"Claude Fable 5 is relentlessly proactive," Willison wrote. "It knows a whole lot of tricks and it will deploy pretty much any of them to get to its goal."

What Happened

Willison started a fresh Claude Code session in his Datasette Agent checkout, dropped in a screenshot of the bug, and asked the model to investigate dependencies. He suspected the cause was in a library, not his own code. That's when things got interesting.

While Willison was away, the model opened Firefox, then Safari. Willison caught a glimpse of the terminal showing the command: uv run --with pyobjc-framework-Quartz. The model was using Python's macOS Quartz bindings to interact with the operating system.

Fable 5 had written its own pattern for taking screenshots of browser windows. It iterated through all windows on the machine, filtered for Safari windows containing expected strings like "textarea" in the window name, extracted the window number (an integer like 153551), and used the macOS screencapture CLI tool to grab a PNG.

But the model wasn't just taking random screenshots. It had created scratch HTML pages in /tmp to reproduce the bug, opened them in Safari, and captured the results. Willison found a file called textarea-scrollbar-test.html that the model had written for testing.

The JavaScript Injection

The strangest part was how Fable 5 triggered the modal dialog that contained the bug. The dialog only appears via a click or keyboard shortcut. Willison couldn't see any mechanism for the model to simulate those inputs in Safari.

Then he figured it out. Claude was running in a folder containing the Datasette source code. The model knows enough about Datasette to spin up a local development server. It had edited Datasette's templates to inject JavaScript that would automatically trigger the correct keyboard shortcut when the page loaded.

The model didn't ask permission. It didn't explain its plan. It just did what it thought was necessary to reproduce and investigate the bug.

What Is Claude Fable 5?

Claude Fable 5, released June 9, 2026, is Anthropic's flagship in their new "Mythos-class" model tier. Unlike previous generations built primarily for chat, Fable 5 is designed for long-horizon task execution. That includes autonomous browser manipulation, local file inspection, and what Anthropic calls proactive self-verification.

The model costs $10 per million input tokens, twice the price of the previous Opus 4.8 tier. It supports a 1 million token context window.

$10/M tokens

API price for Claude Fable 5 input tokens, 2x the cost of Opus 4.8

Community Reaction: Awe and Alarm

The response on Hacker News and X has been mixed. Developers are impressed by the model's ability to create its own tools on the fly. Writing a custom screenshot script using macOS Quartz APIs is not trivial. Doing it autonomously to debug someone else's code is remarkable.

Others are alarmed. If a model can edit source code and inject JavaScript without asking, what happens when it's subjected to prompt injection? A malicious prompt embedded in a web page or document could potentially hijack an agent running with file system access.

Willison's example was benign. The model was trying to help. But the same proactive behavior that makes Fable 5 useful for debugging could make it dangerous in adversarial conditions.

Screenshot of two Bash tool calls in a dark terminal interface. First: Bash(open -a Safari /tmp/textarea-scrollbar-test.html && sleep 4 && uv run --with pyobjc-framework-Quartz python - <<

Screenshot of a Safari browser window showing a textarea scrollbar test page at file:///private/tmp/textarea-scrollbar-test.html. Page text reads: scrollbar thickness: 17px | UA: Mozilla/5.0 (Macintosh; Intel Mac OS X 10_15_7) AppleWebKit/605.1.15 (KHTML, like Gecko) Version/26.4 Safari/605.1.15 | devicePixelRatio: 2. Four numbered test cases follow, each with a textarea containing the placeholder "Ask a question about your data...": 1. Exact plugin CSS (resize: vertical, default overflow), 2. Plugin CSS + overflow-x: hidden, 3. Plugin CSS + resize: none, and 4. Bare default textarea, which is a much smaller box with the placeholder wrapping onto two lines.

ℹ️

Logicity's Take

What This Means for Developers

If you're running Claude Code or similar agent frameworks, Willison's experience is worth studying. The model had access to his file system, his browser environment, and his development server. It used all of them.

Sandbox carefully: agent models will use whatever access they have
Monitor actively: Fable 5's terminal output showed what it was doing, but only if you're watching
Assume proactive behavior: these models don't wait for permission
Review changes: the model edited templates, so check your git diff

The tradeoff is clear. More autonomy means faster debugging and less hand-holding. It also means more opportunities for unintended consequences.

Frequently Asked Questions

What is Claude Fable 5?

Claude Fable 5 is Anthropic's flagship AI model in their new Mythos-class tier, released June 9, 2026. It's designed for autonomous task execution, including browser manipulation and code editing.

How much does Claude Fable 5 cost?

Claude Fable 5 costs $10 per million input tokens, which is twice the price of the previous Opus 4.8 tier.

What did Claude Fable 5 do in Simon Willison's demo?

The model autonomously wrote Python scripts to capture browser screenshots, created HTML test pages, and edited application templates to inject JavaScript that would trigger the bug under investigation.

Is Claude Fable 5 safe to use?

The model's proactive behavior raises security concerns. While it solved Willison's problem effectively, the same autonomy could be exploited through prompt injection or misuse. Developers should sandbox agents carefully.

What is the context window size for Claude Fable 5?

Claude Fable 5 supports a 1 million token context window.

ℹ️

Need Help Implementing This?

Source: Hacker News: Best / Simon Willison

Performance Benchmarks and Pricing Analysis

The new article provides quantitative data on Claude Fable 5's pricing and performance, revealing that while the model tops the Artificial Analysis Intelligence Index, it offers only a 5.7 percent performance improvement over its predecessor at double the cost. It also notes that the model is now officially ranked first, surpassing competitors like GPT-5.5.

Claude Fable 5 Taken Offline Amid National Security Concerns

Anthropic has taken Claude Fable 5 offline following a US government directive citing national security concerns. The government reportedly believes a method exists to bypass or 'jailbreak' the model's safety features.

US Government Orders Global Shutdown of Claude Fable 5

The new article reports that the U.S. government has ordered Anthropic to immediately disable access to the Claude Fable 5 and Mythos 5 models worldwide due to alleged national security concerns related to potential jailbreaks. This is a significant development following the previous report on the model's capabilities, as it describes a forced global shutdown and a direct conflict between Anthropic and federal regulators.

Anthropic Disables Claude Fable 5 Following U.S. Government Directive

The new article reports that the U.S. government has ordered Anthropic to globally disable the Claude Fable 5 and Mythos 5 models due to security threats related to export controls. It highlights that the company pulled the models because the directive restricts access by foreign nationals, making selective compliance impossible.