What Is the AI Cybersecurity Threat? How Claude Mythos Found 27-Year-Old Vulnerabilities

A 27-Year-Old Bug Nobody Found Until an AI Looked

Security researchers spend careers hunting for vulnerabilities. Entire teams at major companies run bug bounty programs, conduct penetration testing, and comb through codebases line by line. And yet, a bug that had been sitting in OpenBSD since around 1997 went undetected until an AI found it.

That AI was Claude, working as part of Anthropic’s security research project known as Claude Mythos. The discovery wasn’t a one-off. The project surfaced thousands of potential zero-day vulnerabilities across real-world codebases — raising urgent questions about what AI cybersecurity threats actually look like, who benefits from this capability, and whether we’re ready for what comes next.

This article explains what Claude Mythos is, how it works, and why a single AI security project is forcing the entire cybersecurity community to rethink its assumptions.

What Is Claude Mythos?

Claude Mythos is a security research initiative developed by Anthropic to test and demonstrate Claude’s ability to identify real-world software vulnerabilities at scale. Unlike traditional security tools that rely on pattern matching or known exploit signatures, Claude Mythos uses Claude’s large language model capabilities to reason about code — understanding context, intent, and potential failure modes the way a human security researcher might.

The project operates by feeding Claude large volumes of source code and prompting it to identify security flaws. What makes it different from prior automated scanning tools is that Claude can understand why a particular code pattern might be dangerous, not just whether it matches a list of known bad patterns.

Anthropic designed Mythos explicitly to understand what capable AI systems can do in the security domain — and to inform their safety research accordingly. The findings were significant enough to change how the company thinks about AI risk in this space.

What Claude Mythos Actually Found

The headline number is striking: Claude Mythos reportedly identified thousands of zero-day vulnerabilities across production codebases. Zero-day means these were previously unknown vulnerabilities — ones no patch existed for and no CVE (Common Vulnerabilities and Exposures) entry had been filed.

The most attention-grabbing find was a vulnerability in OpenBSD, one of the most security-focused operating systems ever developed. OpenBSD has a long reputation for aggressive security auditing. The bug Claude found had been present in the codebase for approximately 27 years — introduced around 1997 and never caught despite decades of human review.

That single discovery encapsulates both the promise and the danger of AI-assisted security research.

How AI Finds Vulnerabilities Humans Miss

To understand why this is such a big deal, it helps to understand why human security researchers miss things.

The Scale Problem

Modern software codebases are enormous. A major operating system like Linux or OpenBSD contains tens of millions of lines of code. Even dedicated security teams can only audit a fraction of any given codebase at any time. Humans prioritize — they focus on recently modified code, high-risk components, or areas where previous vulnerabilities were found.

This means vast sections of older, “stable” code never get serious scrutiny. If a bug slips into a codebase in 1997 and doesn’t cause obvious crashes or erratic behavior, it can sit dormant for decades.

The Context Problem

Many serious vulnerabilities aren’t obvious in isolation. A function might look safe on its own but become exploitable only when called from a specific context, combined with a particular input type, or paired with another function that modifies shared state.

Human reviewers can get fatigued or mentally anchor to how code should work rather than how it does work under adversarial conditions. An AI system doesn’t get tired, doesn’t carry assumptions about a developer’s intent, and can trace data flows across the entire codebase simultaneously.

What Claude Does Differently

Claude reasons about code semantically. It understands that a buffer might overflow not because of an obvious off-by-one error, but because of an implicit assumption buried in a type conversion three function calls earlier. It can hold the entire context of a complex vulnerability chain in working memory in a way that’s difficult for humans to sustain.

This makes it particularly good at finding what security researchers call “logic vulnerabilities” — flaws that don’t look like bugs at all until you understand the full execution path.

The OpenBSD Vulnerability: Why It Matters

✗ VIBE-CODED APP

Tangled. Half-built. Brittle.

✓ AN APP, MANAGED BY REMY

UIReact + Tailwind✓

APIValidated routes✓

DBPostgres + auth✓

DEPLOYProduction-ready✓

Architected. End to end.

Built like a system. Not vibe-coded.

Remy manages the project — every layer architected, not stitched together at the last second.

OpenBSD isn’t a random codebase. It was created specifically with security as the primary design goal. The team behind it has historically been more aggressive about security auditing than virtually any other operating system project. The phrase “only two remote holes in the default install, in a heck of a long time” was long used as a marketing tagline.

A 27-year-old vulnerability surviving in OpenBSD isn’t just a technical footnote. It’s a statement about the limits of human-led security processes.

What This Says About Legacy Code

Most of the software running the world’s infrastructure is old. Banking systems, industrial controls, telecommunications infrastructure, and government systems run on code that was written decades ago by developers who are often no longer around. That code has been modified, extended, and patched — but rarely audited end-to-end.

If Claude Mythos can find a 27-year-old bug in one of the most security-conscious codebases ever written, the implication for less rigorously audited legacy systems is sobering.

The Responsible Disclosure Question

Anthropic reportedly handled findings through responsible disclosure processes — notifying affected maintainers before publishing details, giving time for patches to be developed. This is the established norm in security research, and it’s the right call.

But it raises a harder question: what happens when a less responsible actor runs similar AI-assisted vulnerability research and doesn’t disclose? The same capability that makes Claude Mythos useful for defense makes it dangerous in other hands.

The Dual-Use Problem in AI Cybersecurity

This is where the AI cybersecurity threat conversation gets complicated.

Every security capability is inherently dual-use. A tool that helps defenders find and patch vulnerabilities is also a tool that helps attackers find and exploit them. This has always been true — fuzzing tools, disassemblers, exploit frameworks like Metasploit all exist on this spectrum.

But AI changes the scale and accessibility equation significantly.

The Democratization of Offensive Capability

Before AI-assisted security research, finding zero-day vulnerabilities at scale required significant expertise — years of specialized training, deep knowledge of specific platforms and architectures, and a lot of time. It was hard, and the barrier to entry was high enough to limit who could do it.

AI lowers that barrier. A technically savvy but non-expert user can now potentially direct an AI system to hunt for vulnerabilities in ways that previously required a senior security engineer. This expands both the pool of defenders and, critically, the pool of potential attackers.

Nation-State and Criminal Actor Risks

Nation-state actors already employ large teams of skilled security researchers. For them, AI-assisted vulnerability discovery isn’t democratization — it’s acceleration. If Claude Mythos can find thousands of zero-days at scale, similar capabilities in the hands of state-sponsored groups could mean a dramatic increase in both the number and speed of sophisticated attacks.

Criminal ransomware groups are also increasingly sophisticated. Access to AI-assisted vulnerability research could let smaller criminal operations punch far above their weight.

Anthropic’s Position

Anthropic has been unusually transparent about this tension. The Mythos project wasn’t just about finding vulnerabilities — it was partly research into AI risk itself. Understanding what Claude can do in the security domain informs how Anthropic builds safety guardrails, how they think about model deployment, and what capabilities they choose to restrict or control.

This reflects a broader industry challenge: you can’t build meaningful safety constraints around capabilities you don’t fully understand.

Plans first. Then code.

PROJECTYOUR APP

SCREENS12

DB TABLES6

BUILT BYREMY

1280 px · TYP.

yourapp.msagent.ai

A · UI · FRONT END

Remy writes the spec, manages the build, and ships the app.

What This Means for Enterprise Security Teams

For security professionals and IT leaders, Claude Mythos should trigger a few concrete responses.

Reassess Your Legacy Code Risk

If you have code that’s been running for more than five years without a thorough security audit, you should treat it as higher risk than you previously did. The argument that “if it had a serious bug, someone would have found it by now” is weaker than it used to be. AI-assisted discovery is changing what’s findable.

AI-Assisted Defense Is Now Table Stakes

The same capabilities that make AI useful for finding vulnerabilities make it useful for defending against them. Security teams that aren’t incorporating AI-assisted code review, automated vulnerability scanning, and AI-augmented threat detection are increasingly at a disadvantage.

Tools that use AI to continuously monitor codebases, flag anomalous patterns, and cross-reference new vulnerabilities against your specific stack are moving from “nice to have” to baseline practice.

The Patch Lag Problem Gets Worse

One of the consistent findings in enterprise security is that organizations are slow to patch known vulnerabilities. The average time between a CVE being published and systems being patched is often measured in months, not days.

If AI-assisted research accelerates the rate at which zero-days are discovered and then weaponized, that patch lag window becomes even more dangerous. Security operations teams need to think seriously about how to accelerate their patch deployment cycles.

AI Safety and the Bigger Picture

Claude Mythos is one data point in a much larger conversation about what it means to deploy capable AI systems responsibly.

Why Anthropic Published This

Anthropic didn’t have to disclose what Claude Mythos found. The commercial incentive to quietly patch and move on would be understandable. The fact that they published findings and engaged with the security community reflects a bet that transparency about AI capabilities — even uncomfortable ones — is more valuable than obscuring them.

Understanding what capable AI systems can do is necessary for building appropriate guardrails. A safety argument that doesn’t account for what the system is actually capable of is not a useful safety argument.

The Frontier Model Safety Debate

Claude Mythos feeds into a broader debate about whether advanced AI models should have their most powerful capabilities restricted, watermarked, or controlled. Security applications are a clear case where the same capability serves both beneficial and harmful ends, and where the harmful use cases are concrete and serious.

This is part of why AI safety research at labs like Anthropic focuses so heavily on understanding model capabilities before deployment, not after.

What Responsible AI in Security Looks Like

The emerging consensus in responsible AI security research involves a few principles:

Capability transparency — Disclose what AI systems can do in security contexts, even when the implications are uncomfortable
Coordinated disclosure — Follow established responsible disclosure norms when AI systems find real vulnerabilities
Access controls — Limit access to the most powerful security-relevant capabilities to vetted researchers and organizations
Red-teaming — Proactively test AI systems for misuse potential before deployment

Cursor

ChatGPT

Figma

Linear

GitHub

Vercel

Supabase

remy.msagent.ai

Seven tools to build an app. Or just Remy.

Editor, preview, AI agents, deploy — all in one tab. Nothing to install.

None of these are complete answers. But they represent the beginning of a framework for managing dual-use AI security capabilities.

How MindStudio Fits Into AI Security Workflows

Most organizations won’t be running projects like Claude Mythos directly. But the broader principle — using AI to automate complex analysis tasks — is immediately applicable to security operations work.

MindStudio lets security and IT teams build AI-powered workflows without writing code. You can connect Claude (or any of 200+ other models) to your existing tools and set up agents that run on schedules, respond to triggers, or process data from integrations like Slack, email, or internal ticketing systems.

For security operations specifically, this means practical applications like:

Automated vulnerability triage agents that pull new CVEs from feeds, cross-reference them against your asset inventory, and prioritize by exposure
Incident summary agents that take raw log data and produce human-readable summaries for faster response
Security awareness workflows that generate and distribute training content based on current threat intelligence
Compliance monitoring agents that run regular checks against policy requirements and flag deviations

The average build on MindStudio takes 15 minutes to an hour, and you don’t need to manage API keys or stand up infrastructure. You can try it free at mindstudio.ai.

The point isn’t that MindStudio replaces a security research capability like Mythos — it doesn’t. But for security teams that want to operationalize AI for day-to-day work without building custom tooling, it’s a practical starting point.

Frequently Asked Questions

What is Claude Mythos?

Claude Mythos is Anthropic’s security research project that uses Claude to find software vulnerabilities at scale. It applies Claude’s code reasoning capabilities to identify zero-day vulnerabilities in real-world codebases. The project found thousands of previously unknown vulnerabilities, including a 27-year-old bug in OpenBSD, and was designed in part to help Anthropic understand AI capabilities in the security domain for safety research purposes.

What is a zero-day vulnerability?

A zero-day vulnerability is a security flaw in software that is unknown to the software’s developer or maintainers — meaning zero days have passed since the developer became aware of it. Because no patch exists, zero-days are particularly dangerous. Attackers who discover or purchase zero-days can exploit them before any defense is available.

How can AI find vulnerabilities that humans miss?

AI models like Claude can analyze entire codebases simultaneously, trace data flows across complex call chains, and reason about how seemingly safe code becomes dangerous in specific contexts. Unlike human reviewers, AI doesn’t get fatigued, doesn’t carry cognitive biases about how code “should” work, and can hold large volumes of context in working memory. This makes it effective at finding logic vulnerabilities and subtle flaws in old, rarely-audited code.

Is AI-assisted vulnerability research dangerous?

It’s inherently dual-use. The same capabilities that help defenders find and patch vulnerabilities can help attackers find and exploit them. This is why responsible disclosure practices, access controls on powerful security AI capabilities, and transparency about what AI systems can do are all important. Anthropic has been explicit that one goal of Claude Mythos was to understand this risk better, not just to find bugs.

Hermes, walked through line by line — free 1-hour workshop

What should companies do in response to AI-assisted threat discovery?

Security teams should treat legacy code as higher-risk than previously assumed, adopt AI-assisted code review and vulnerability scanning, accelerate patch deployment cycles, and invest in continuous monitoring. The window between vulnerability discovery and exploitation is shrinking as AI makes finding bugs faster and more accessible.

Does Claude help with cybersecurity tasks?

Claude can assist with a range of security-related tasks including code review, explaining vulnerability concepts, analyzing security documentation, and helping draft security policies. However, Anthropic applies safety guidelines to prevent Claude from being used to develop exploits or assist with offensive security activities targeting systems without authorization.

Key Takeaways

Claude Mythos demonstrated that AI can find thousands of zero-day vulnerabilities at scale, including a 27-year-old bug in one of the most security-audited codebases ever written
The core challenge is dual-use: the same capability that helps defenders is dangerous in offensive hands, particularly as it lowers the barrier to sophisticated vulnerability research
Legacy code across enterprise and infrastructure systems carries more risk than previously assumed — the argument that “old bugs would have been found” no longer holds the way it once did
Anthropic’s transparency about Mythos findings reflects a broader principle: safety arguments need to be grounded in what AI systems can actually do
For security teams, the practical response involves adopting AI-assisted defense tools, tightening patch cycles, and treating AI-accelerated threat discovery as the new baseline — not a future concern
Platforms like MindStudio make it practical for teams to build AI-powered security workflows without writing custom code, connecting models like Claude to existing tools and processes

The discovery of a 27-year-old vulnerability isn’t just a technical milestone — it’s a signal that the security landscape is changing faster than most organizations are prepared for.