كل المقالات
Trending Tech

Anthropic's AI Model Finds 10,000 Critical Bugs in One Month

Huma Shazia24 May 2026 at 5:47 pm5 دقيقة للقراءة
Anthropic's AI Model Finds 10,000 Critical Bugs in One Month

Key Takeaways

Anthropic's AI Model Finds 10,000 Critical Bugs in One Month
Source: Tech-Economic Times
  • Mythos Preview identified 10,000+ critical vulnerabilities across 1,000 open-source projects in one month
  • Independent reviewers confirmed 90.6% of flagged bugs were legitimate security flaws
  • The discovery rate now exceeds human patching capacity, creating a 'remediation bottleneck'

Project Glasswing's First Results

Anthropic has released the first results from Project Glasswing, a restricted cybersecurity program that gives select organizations access to its new AI model, Mythos Preview. The numbers are striking: more than 10,000 high or critical severity vulnerabilities identified across widely used open-source software in just one month.

Around 50 partners participated in the trial, including technology companies and research organizations. Mythos Preview scanned more than 1,000 open-source software projects during this period.

90.6%
of AI-flagged vulnerabilities confirmed as legitimate by independent security reviewers

Independent security firms reviewed 1,752 of the model's findings. They confirmed 90.6% were legitimate vulnerabilities. Of those, 62.4% qualified as genuinely high or critical risks.

Cloudflare and Mozilla Report 10x Improvement

Cloudflare found about 2,000 bugs in its internal software using Mythos Preview. Four hundred of those qualified as high or critical severity. The company reported the AI generated fewer false alarms than traditional human-led testing.

After one month, most partners have each found hundreds of critical- or high-severity vulnerabilities in their software. Collectively, they've found more than ten thousand. Several have told us that their rate of bug-finding has increased by more than a factor of ten.

— Anthropic blog post

Mozilla used Mythos Preview to analyze Firefox code and fixed 271 vulnerabilities in Firefox 150. The company said the new system performed significantly better than Anthropic's earlier Claude Opus 4.6 model.

The Remediation Bottleneck

Anthropic raised a concern that accompanies the good news: human teams may struggle to review and fix the large number of vulnerabilities uncovered by advanced AI systems.

The bottleneck is no longer discovery, but remediation. We have essentially automated the process of finding security holes, but the speed of human patching has not kept pace.

— Dario Amodei, CEO of Anthropic

The company noted in its blog post that there's often a long lag between discovering a vulnerability, creating a patch, and deploying that patch to end users. AI-powered discovery at this scale could widen that gap.

Partners are currently bound by a 90-day non-disclosure agreement that shields most technical details of the discovered bugs. This window allows maintainers time to implement fixes before vulnerabilities become public knowledge.

Legacy Code Under the Microscope

The AI's speed has revealed how many security issues have been hiding in older codebases. Some of these bugs have existed for decades without detection.

"Seeing a 27-year-old bug in a critical system library discovered in seconds was a sobering reminder that our legacy codebases have been hiding these ticking time bombs for decades." — Sarah Jenkins, Lead Security Analyst at the Cyber Verification Program

This finding underscores a broader pattern. Open-source software powers much of the internet's infrastructure. These projects are often maintained by small teams or even individual developers. A sudden influx of thousands of verified security reports could overwhelm them.

Community Response: Excitement and Concern

Discussion on Hacker News focused heavily on the remediation bottleneck. Many engineers expressed concern that high-quality vulnerability reports will overwhelm already under-resourced open-source maintainers.

Security-focused communities on Reddit are debating the ethics of AI models that could potentially generate exploits alongside discovery. Some are calling for clearer disclosure timelines beyond the current 90-day window.

What Happens Next

Anthropic hasn't announced plans to expand access beyond the current 50 partners. The company appears to be proceeding carefully, likely aware of the dual-use nature of such powerful vulnerability detection.

The immediate question for the software industry: how do you handle 10,000 critical bugs when your security teams were built for dozens? The tools for finding problems have leaped ahead. The systems for fixing them have not.

Also Read
Claude Code Discovers AI Scaling Algorithms Humans Missed

More on Anthropic's AI capabilities in software development

Also Read
NSA to Use Anthropic's Claude Despite Pentagon Supply Chain Ban

Context on Anthropic's growing role in government security

ℹ️

Logicity's Take

Frequently Asked Questions

What is Anthropic's Mythos Preview model?

Mythos Preview is Anthropic's new AI model designed specifically for cybersecurity applications. It scans software code to identify security vulnerabilities and has been tested through the company's restricted Project Glasswing initiative.

How accurate is AI bug detection compared to human testing?

Independent security firms confirmed 90.6% of Mythos Preview's flagged vulnerabilities were legitimate. Cloudflare reported the AI produced fewer false alarms than traditional human-led testing methods.

What is the 'remediation bottleneck' in AI security scanning?

The remediation bottleneck refers to the gap between AI-powered discovery of vulnerabilities (which is now extremely fast) and human capacity to review, prioritize, and patch those issues (which remains limited by available developer time).

How many open-source projects did Anthropic's AI scan?

Mythos Preview scanned more than 1,000 open-source software projects during the one-month trial period with 50 partner organizations.

What is the disclosure timeline for discovered bugs?

Partners are bound by a 90-day non-disclosure agreement that shields technical details of discovered vulnerabilities. This gives maintainers time to implement fixes before the bugs become public.

ℹ️

Need Help Implementing This?

Source: Tech-Economic Times / ET

H

Huma Shazia

Senior AI & Tech Writer

اقرأ أيضاً

رأي مغاير: كيف يؤثر اختراق الأمن الداخلي الأميركي على شركاتنا الخاصة؟
الأمن السيبراني·8 د

رأي مغاير: كيف يؤثر اختراق الأمن الداخلي الأميركي على شركاتنا الخاصة؟

في ظل اختراق عقود الأمن الداخلي الأميركي مع شركات خاصة، نناقش تأثير هذا الاختراق على مستقبل الأمن السيبراني. نستعرض الإحصاءات الموثوقة ونناقش كيف يمكن للشركات الخاصة أن تتعامل مع هذا التهديد. استمتع بقراءة هذا التحليل العميق

عمر حسن·
الإنسان في زمن ما بعد الوجود البشري: نحو نظام للتعايش بين الإنسان والروبوت - Centre for Arab Unity Studies
الروبوتات·8 د

الإنسان في زمن ما بعد الوجود البشري: نحو نظام للتعايش بين الإنسان والروبوت - Centre for Arab Unity Studies

في هذا المقال، سنناقش كيف يمكن للبشر والروبوتات التعايش في نظام متكامل. سنستعرض التحديات والحلول المحتملة التي تضعها شركات مثل جوجل وأمازون. كما سنلقي نظرة على التوقعات المستقبلية وفقًا لتقرير ماكنزي

فاطمة الزهراء·
إطلاق ناسا لمهمة مأهولة إلى القمر: خطوة تاريخية نحو استكشاف الفضاء
أخبار التقنية·7 د

إطلاق ناسا لمهمة مأهولة إلى القمر: خطوة تاريخية نحو استكشاف الفضاء

تعتبر المهمة الجديدة خطوة هامة نحو استكشاف الفضاء وتطوير التكنولوجيا. سوف تشمل المهمة إرسال رواد فضاء إلى سطح القمر لconducting تجارب علمية. ستسهم هذه المهمة في تطوير فهمنا للفضاء وتحسين التكنولوجيا المستخدمة في استكشاف الفضاء.

عمر حسن·