Anthropic's AI Model Finds 10,000 Critical Bugs in One Month

Huma ShaziaMay 24, 2026 at 5:47 PM5 min read

Key Takeaways

Mythos Preview identified 10,000+ critical vulnerabilities across 1,000 open-source projects in one month
Independent reviewers confirmed 90.6% of flagged bugs were legitimate security flaws
The discovery rate now exceeds human patching capacity, creating a 'remediation bottleneck'

Project Glasswing's First Results

Anthropic has released the first results from Project Glasswing, a restricted cybersecurity program that gives select organizations access to its new AI model, Mythos Preview. The numbers are striking: more than 10,000 high or critical severity vulnerabilities identified across widely used open-source software in just one month.

Around 50 partners participated in the trial, including technology companies and research organizations. Mythos Preview scanned more than 1,000 open-source software projects during this period.

90.6%

of AI-flagged vulnerabilities confirmed as legitimate by independent security reviewers

Independent security firms reviewed 1,752 of the model's findings. They confirmed 90.6% were legitimate vulnerabilities. Of those, 62.4% qualified as genuinely high or critical risks.

Cloudflare and Mozilla Report 10x Improvement

Cloudflare found about 2,000 bugs in its internal software using Mythos Preview. Four hundred of those qualified as high or critical severity. The company reported the AI generated fewer false alarms than traditional human-led testing.

“After one month, most partners have each found hundreds of critical- or high-severity vulnerabilities in their software. Collectively, they've found more than ten thousand. Several have told us that their rate of bug-finding has increased by more than a factor of ten.”

— Anthropic blog post

Mozilla used Mythos Preview to analyze Firefox code and fixed 271 vulnerabilities in Firefox 150. The company said the new system performed significantly better than Anthropic's earlier Claude Opus 4.6 model.

The Remediation Bottleneck

Anthropic raised a concern that accompanies the good news: human teams may struggle to review and fix the large number of vulnerabilities uncovered by advanced AI systems.

“The bottleneck is no longer discovery, but remediation. We have essentially automated the process of finding security holes, but the speed of human patching has not kept pace.”

— Dario Amodei, CEO of Anthropic

The company noted in its blog post that there's often a long lag between discovering a vulnerability, creating a patch, and deploying that patch to end users. AI-powered discovery at this scale could widen that gap.

Partners are currently bound by a 90-day non-disclosure agreement that shields most technical details of the discovered bugs. This window allows maintainers time to implement fixes before vulnerabilities become public knowledge.

Legacy Code Under the Microscope

The AI's speed has revealed how many security issues have been hiding in older codebases. Some of these bugs have existed for decades without detection.

"Seeing a 27-year-old bug in a critical system library discovered in seconds was a sobering reminder that our legacy codebases have been hiding these ticking time bombs for decades." — Sarah Jenkins, Lead Security Analyst at the Cyber Verification Program

This finding underscores a broader pattern. Open-source software powers much of the internet's infrastructure. These projects are often maintained by small teams or even individual developers. A sudden influx of thousands of verified security reports could overwhelm them.

Community Response: Excitement and Concern

Discussion on Hacker News focused heavily on the remediation bottleneck. Many engineers expressed concern that high-quality vulnerability reports will overwhelm already under-resourced open-source maintainers.

Security-focused communities on Reddit are debating the ethics of AI models that could potentially generate exploits alongside discovery. Some are calling for clearer disclosure timelines beyond the current 90-day window.

What Happens Next

Anthropic hasn't announced plans to expand access beyond the current 50 partners. The company appears to be proceeding carefully, likely aware of the dual-use nature of such powerful vulnerability detection.

The immediate question for the software industry: how do you handle 10,000 critical bugs when your security teams were built for dozens? The tools for finding problems have leaped ahead. The systems for fixing them have not.

Logicity's Take

Frequently Asked Questions

What is Anthropic's Mythos Preview model?

Mythos Preview is Anthropic's new AI model designed specifically for cybersecurity applications. It scans software code to identify security vulnerabilities and has been tested through the company's restricted Project Glasswing initiative.

How accurate is AI bug detection compared to human testing?

Independent security firms confirmed 90.6% of Mythos Preview's flagged vulnerabilities were legitimate. Cloudflare reported the AI produced fewer false alarms than traditional human-led testing methods.

What is the 'remediation bottleneck' in AI security scanning?

The remediation bottleneck refers to the gap between AI-powered discovery of vulnerabilities (which is now extremely fast) and human capacity to review, prioritize, and patch those issues (which remains limited by available developer time).

How many open-source projects did Anthropic's AI scan?

Mythos Preview scanned more than 1,000 open-source software projects during the one-month trial period with 50 partner organizations.

What is the disclosure timeline for discovered bugs?

Partners are bound by a 90-day non-disclosure agreement that shields technical details of discovered vulnerabilities. This gives maintainers time to implement fixes before the bugs become public.

ℹ️

Need Help Implementing This?

Source: Tech-Economic Times / ET

A new technology is set to revolutionize the way AI agents learn and adapt, enabling them to accumulate wisdom and apply it to new situations. This innovation has the potential to significantly boost the reliability of AI agents, especially in complex tasks. By converting raw agent trajectories into reusable guidelines, this tech is poised to transform the AI landscape.

9 Apr 2026

Trending Tech·10 min

The Dark Side of AI: How Bots Are Fueling a Monetized Abuse Ecosystem

A recent analysis of 2.8 million Telegram messages reveals a shocking truth: AI-powered bots are being used to create and sell non-consensual intimate images. These bots can turn ordinary photos into synthetic nude images, and the abuse is being monetized through affiliate programs and subscription-based archives. The researchers behind the study are calling for stricter regulations to combat this growing problem.

9 Apr 2026

Trending Tech·8 min

AI's Secret Sauce: How Journalism Became the Unlikely Ingredient

A recent study reveals that AI chatbots rely heavily on journalistic sources for their quotes, with one in four coming from news outlets. This shocking discovery has significant implications for the media industry and our understanding of AI's information gathering processes. As AI technology continues to evolve, it's essential to consider the role of journalism in shaping its responses.

9 Apr 2026