Did Anthropic release Claude Opus 4.7 with cybersecurity safeguards?

Yes. On April 16, 2026, Anthropic launched Claude Opus 4.7 with safeguards that automatically detect and block requests indicating prohibited or high-risk cybersecurity uses. The release was tied to Project Glasswing and accompanied by a new Cyber Verification Program for legitimate security researchers.

How is Sunglasses different from Project Glasswing?

Project Glasswing is an enterprise coalition focused on provider-side cyber defense and vulnerability discovery in critical software. Sunglasses is the open, local, runtime-layer AI agent security scanner for individual builders, small teams, and security researchers who need defensive tooling without an enterprise procurement cycle. The two operate at different layers and are complementary, not competitive.

What is the Cyber Verification Program?

The Cyber Verification Program is Anthropic's review process for legitimate security researchers, penetration testers, and red teams who need fuller access to Claude for defensive cybersecurity work. Approved users can run vulnerability research and red-team workflows without being blocked by Opus 4.7's automatic cybersecurity safeguards.

Do open-source AI agent security tools still matter now that Anthropic ships built-in safeguards?

Yes. Provider-side safeguards refuse high-risk requests at the model layer, but they do not eliminate AI prompt injection hidden in documents, malicious tool instructions, poisoned MCP descriptions, credential exfiltration in normal-looking content, or unsafe action chains across mixed-model agent systems. The runtime layer still needs its own defense. Open-source tools make that practical for everyone outside enterprise procurement cycles.

Opus 4.7 Just Made AI Agent Security Mainstream

Q: What is Project Glasswing?

Project Glasswing is Anthropic's defensive cybersecurity coalition. Launch partners include Amazon Web Services, Anthropic, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks. The initiative explores using frontier AI models like Claude Mythos Preview to find and remediate vulnerabilities in critical software.

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Today is Day 1. Anthropic launched Claude Opus 4.7, shipped automatic Opus 4.7 cybersecurity safeguards, tied the release directly to Project Glasswing, and opened the new Cyber Verification Program for legitimate security researchers who need fuller access for defensive work. That is a real shift — and exactly why the open-source side of AI agent security matters more, not less.

For months, AI agent security could still be treated like a niche concern: a red-team topic, a prompt-injection edge case, a problem for frontier labs and Fortune 100 security teams. Today Anthropic made it public product policy. Their own words are unambiguous: Opus 4.7 ships with safeguards that "automatically detect and block requests that indicate prohibited or high-risk cybersecurity uses."

That means the market has changed. AI agent security is no longer a future category. It is a live operating constraint.

FIG.01 · Analysis

What Anthropic changed today

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Context

Anthropic's launch is important for three reasons.

The point

First, Opus 4.7 is not just a model update. It is their first public proving ground for cyber controls after the Claude Mythos Preview and Project Glasswing announcements. Anthropic explicitly said it would keep Mythos limited, test new cyber safeguards on a less capable model first, and use what it learns to work toward safer deployment of Mythos-class systems.

Detail

Second, they introduced a formal split between general access and verified security work. If you are blocked while doing legitimate vulnerability research, penetration testing, or red-teaming, Anthropic now wants you in a review process: the Cyber Verification Program.

In practice

Third, they pushed cybersecurity into the center of the AI product conversation. Not safety in the abstract. Not policy PDF language. Actual product behavior, with actual blocked requests.

Why it matters

That is a big deal.

FIG.02 · Coverage

Project Glasswing is the enterprise move. Sunglasses is the open runtime layer.

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

The wedge

Project Glasswing is Anthropic's defensive coalition play. Their launch group includes 12 major partners and institutions: Amazon Web Services, Anthropic, Apple, Broadcom, Cisco, CrowdStrike, Google, JPMorganChase, the Linux Foundation, Microsoft, NVIDIA, and Palo Alto Networks.

What we look for

That is the top of the market.

The question

Sunglasses is not that.

House sentence

We are the open, local, runtime-side layer for everyone else: individual builders, security researchers, small teams, indie operators, and anyone shipping agents without an enterprise procurement cycle.

The open-source side is not hypothetical

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Context

We are not writing from theory here.

The point

Sunglasses v0.2.14 just shipped to PyPI today — timed with the Opus 4.7 launch — adding 5 new attack categories specifically relevant to the agent stack: tool metadata smuggling, memory eviction-rehydration chains, multi-stage encoding, tool output poisoning, and provenance chain fracture. Our detection layer now spans 253 patterns, 1,475 keywords, and 40 attack categories. We have already published real public artifacts, including our Axios RAT detection report, alongside broader runtime security work around prompt injection, tool poisoning, MCP abuse, and agent trust-boundary failures.

Detail

That is the lane.

In practice

Not "we built the most powerful cyber model."

Why it matters

Not "we can out-offense the labs."

Bottom line

Our job is to make defensive AI agent security practical, inspectable, and available to the people who are actually wiring these systems together in the wild.

FIG.04 · Analysis

The XBOW angle

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Context

XBOW published an offensive Opus 4.7 workflow today. That is newsworthy, and honestly, it reinforces the same point from a different direction: frontier models are now operational enough that security workflows around them are becoming real, fast.

The point

Our position is different.

Detail

Sunglasses is the defensive runtime layer for everyone who is not trying to build the most aggressive offensive workflow possible. We care about what happens before an agent acts: what it read, what it trusted, what hidden instructions got through, what tool descriptions were poisoned, what policies were bypassed, and what evidence you have afterward.

In practice

That is not a knock on offensive research. It is a distinction in operating model.

FIG.05 · Analysis

We filed for the Cyber Verification Program on Day 1

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Context

We did not wait around for the discourse to settle.

The point

We filed for Anthropic's Cyber Verification Program today because this is exactly the kind of transition Sunglasses was built for: provider safeguards getting stricter, legitimate defensive work needing clearer boundaries, and security teams needing evidence about where model-side controls stop and runtime-side controls still matter.

Detail

If we get in, we will use that access the same way we approach everything else: bounded, defensive, public, and reviewable.

In practice

We want to know:

Signals

what Opus 4.7 cybersecurity safeguards block,
what they partially allow,
what they over-block,
and what still needs to be handled by independent runtime security.

Why it matters

That is useful to researchers. Useful to builders. Useful to Anthropic too.

FIG.06 · Analysis

Day 1 reality

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Context

Here is the honest version.

The point

Anthropic just made "Opus 4.7 cybersecurity safeguards" a live search term. Project Glasswing is now a reference point. Claude Mythos Preview is the backdrop for the whole story. The Cyber Verification Program is now part of the real workflow for legitimate security research on Claude. And all of that is happening while the broader market is still underestimating how much AI prompt injection and trust-boundary abuse happen outside the model itself.

Detail

That gap is where we live.

In practice

If you are a big enterprise in a closed partner program, Glasswing is your story.

Why it matters

If you are everyone else trying to secure real agents now, the open-source side still needs to exist.

Bottom line

That is Sunglasses.

FIG.07 · Market signal

Why this matters for AI prompt injection and AI agent security

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Market signal

Most agent failures do not start with a user typing "do something evil." They start with trust mistakes: a poisoned README, a malicious retrieval chunk, an unsafe MCP description, a hidden instruction in a PDF, or a "normal" page that quietly rewrites the agent's priorities.

The shift

That is why AI prompt injection and AI agent security need their own defensive layer even in a world where labs are adding stronger model-side safeguards.

Evidence

Provider safeguards matter.

Why now

Runtime security still matters.

The stakes

Both are now part of the stack.

FIG.08 · First controls

What to do next

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

First sentence

If you are building with agents, do both:

Signals

use the strongest provider-side safeguards you can get,
and add your own runtime-layer defenses.

The controls

Do not assume one replaces the other.

What to do

Install Sunglasses:

bash

pip install sunglasses

Bottom line

Source code: github.com/sunglasses-dev/sunglasses

In practice

If this thesis resonates, star the repo. That is the fastest way to help us keep shipping public defensive work while the wave is hot.

First sentence

Because today Anthropic made AI agent security mainstream.

The controls

The open-source side needs to move just as fast.

FIG.10 · Coverage

More from Sunglasses

sunglasses://blog/opus-4-7-mainstream-ai-agent-security

Runtime Governance Is Not Enough for AI Agent Security

Why runtime policy gates are necessary but insufficient — and where attackers actually get in.

Beyond AI Guardrails: Why Prompt Filtering Alone Won't Secure Your Agents

Lakera, Rebuff, NeMo Guardrails — what they cover, what they don't, and the architecture your agents actually need.

MCP Tool Poisoning: How Malicious Tool Descriptions Hijack AI Agents

The attack hidden in tool metadata — invisible to humans, high-visibility to the model.

AXIOS RAT — Real Malware Detection Report

Sunglasses scanned the BlueNoroff/Lazarus Axios RAT campaign. 3 threats detected in 3.67ms. Full report.

How Sunglasses Works

The detection architecture: 269 patterns, 1,491 keywords, 48 categories.

Dear World: We Switched to MIT. Here's Why.

The license change behind the open-source-first strategy.

Opus 4.7 Just Made AI Agent Security Mainstream — Here's the Open-Source Side

What Anthropic changed today

Project Glasswing is the enterprise move. Sunglasses is the open runtime layer.

The open-source side is not hypothetical

The XBOW angle

We filed for the Cyber Verification Program on Day 1

Day 1 reality

Why this matters for AI prompt injection and AI agent security

What to do next

More from Sunglasses

Frequently Asked Questions

Did Anthropic release Claude Opus 4.7 with cybersecurity safeguards?

What is Project Glasswing?

How is Sunglasses different from Project Glasswing?

What is the Cyber Verification Program?

Do open-source AI agent security tools still matter now that Anthropic ships built-in safeguards?

Scan what the agent sees, before it acts