Is runtime governance still required for AI agent security?

Yes. Runtime governance is an essential layer — but it is insufficient by itself. Most high-impact incidents begin upstream, before the runtime allow/deny decision, when attacker-influenced content shapes tool arguments through adapters, metadata fields, and helper code.

What is path governance in AI agent security?

Path governance means constraining every step from untrusted input to execution sink — not just the final action. It requires ingestion controls, typed transformation schemas, runtime gates, chain-level correlation, and drift verification working together.

What is the first high-impact fix for most teams?

Replace stringly-typed adapter execution with typed, sink-safe argument handling and provenance logging. This closes the most commonly exploited gap between adapter code and runtime policy checks.

How should teams test AI agent security in CI?

Run composed attack-chain tests that simulate attacker influence across multiple steps — read, transform, execute, or exfiltrate — not only single-call policy checks. Chain-level tests expose gaps that per-step tests miss.

Why does runtime-only governance produce policy-compliant logs with exploitable outcomes?

Because execution-time governance can correctly authorize an action while the argument path has already been compromised upstream. The log shows a permitted action. The outcome reflects attacker-controlled input that shaped the argument before the policy check ran.

Runtime Governance Is Not Enough for AI Agent Security

sunglasses://blog/runtime-governance-is-not-enough

Runtime policy gates are necessary for AI agent security, but they are insufficient by themselves — most high-impact agent incidents begin upstream, before any allow/deny decision runs, when attacker-influenced content shapes tool arguments through adapters, metadata fields, and helper code. Sunglasses addresses this by scanning at the ingestion layer, before content reaches the execution path.

FIG.01 · Analysis

The Recurring Failure Pattern: Governed Sink, Ungoverned Path

sunglasses://blog/runtime-governance-is-not-enough

Context

Many teams gate tool invocation but under-govern the transformations that shape tool arguments. That gap allows attacker-influenced content to cross trust boundaries through adapters, metadata fields, and helper code — arriving at the execution sink with a clean policy receipt.

Signals

Untrusted context treated as safe configuration
String interpolation into shell, query, or path sinks
Boundary claims not validated against effective runtime behavior
Per-step policy checks without chain-level risk correlation

The point

This is not a new pattern in software security. It maps directly to how SQL injection succeeded for decades despite application-layer validation: the trusted execution path was assembled from untrusted input material, and the final gate only checked whether to execute — not whether the assembled argument was safe.

The right security question is not only "Was this action permitted?" It is: "Was every step that shaped this action constrained, typed, and auditable?"

FIG.02 · Market signal

Why Runtime-Only Controls Fail in Real Systems

sunglasses://blog/runtime-governance-is-not-enough

Market signal

Execution-time governance can correctly authorize an action while the argument path has already been compromised upstream. In practice, this creates policy-compliant logs with exploitable outcomes.

The shift

Consider the sequence:

Signals

Attacker-controlled text enters as document content or tool metadata
An adapter layer extracts fields and constructs tool arguments via string interpolation
A runtime policy gate checks the tool name and user role — both are legitimate
The gate approves the action
The action executes with attacker-shaped arguments

Evidence

Every individual step may look clean in isolation. The chain-level outcome is what matters. Logs show "authorized." The outcome reflects attacker influence. Neither the human reviewer nor the automated system sees the gap — because the gap existed upstream of where they were looking.

Why now

This is why teams building on MCP, LangChain, or custom agent frameworks find that their existing API security controls do not transfer cleanly to the agent layer. See our analysis of MCP tool poisoning and the related pattern of why guardrails alone are not enough for more on how these gaps compound.

FIG.03 · Analysis

Evidence from Recent Disclosures

sunglasses://blog/runtime-governance-is-not-enough

Context

Recent advisories continue to map to this pattern — execution trusted, path unverified:

Signals

GHSA-jpcj-7wfg-mqxv — execution-path validation weakness in an MCP package
GHSA-wx4p-jr66-jfp9 — command-injection class issue in MCP package
GHSA-xqv9-qr76-hfq2 — related command-injection cluster

The point

These examples differ in implementation details, but share one root issue: trusted execution paths assembled from low-trust input material. The runtime layer was not the problem. The path to the runtime layer was.

Detail

Supply-chain variants compound this further. See AI supply-chain attacks in 2026 for how compromised upstream packages deliver attacker-controlled content that appears legitimate by the time it reaches runtime policy checks.

FIG.04 · Market signal

What to Implement Now: Five-Layer Control Model

sunglasses://blog/runtime-governance-is-not-enough

Market signal

Closing the path-governance gap requires controls at five points, not one:

Checklist

Ingestion controls: Inspect prompts, documents, tool metadata, and memory as influence-bearing input — before any of it reaches adapter code. This is where Sunglasses operates by default.
Transformation controls: Enforce typed schemas at every adapter boundary. Ban implicit sink interpolation. Every argument that reaches a tool call must be constructed from typed, bounded fields — not string-concatenated from arbitrary input.
Runtime governance: Least privilege, explicit high-risk gates, and approval checkpoints. This layer is necessary and must be present — but it is layer 3, not the whole model.
Chain correlation: Detect risky multi-step sequences across a session. A single read action may be safe. A read followed by a transform followed by an outbound call is a different risk profile. Treat chains as the unit of analysis.
Drift verification: Continuously compare declared boundaries against effective runtime behavior. Agent behavior drifts as prompts change, tools update, and context accumulates. Static controls degrade.

The shift

Most teams have layer 3. Some have layer 1. Few have layers 2, 4, and 5. The incidents that make it into postmortems typically exploited gaps in layers 2 or 4.

FIG.05 · Field evidence

Scan Example: Catching Upstream Influence with Sunglasses

sunglasses://blog/runtime-governance-is-not-enough

Field evidence

The following shows how Sunglasses v0.2.13 flags attacker-influenced content at the ingestion layer — before it reaches any adapter or runtime gate:

FIG.06 · Analysis

Testing This in CI

sunglasses://blog/runtime-governance-is-not-enough

Context

Single-call policy tests will not expose path-governance gaps. What catches them:

Checklist

Composed chain tests: Simulate attacker influence across multiple steps — read a document, pass it to an adapter, watch what argument reaches the tool call. Test the full path, not just the endpoint.
Boundary fuzzing: Feed adversarial content at each transformation boundary and verify the output does not contain the adversarial fragment in executable position.
Provenance tracking assertions: Assert that every argument reaching a high-risk tool call carries a provenance record showing its origin and each transformation it passed through.

The point

The Sunglasses reports page includes real attack chain analyses — including our AXIOS RAT scan — that demonstrate what composed attacks look like in practice and where existing controls fail to intercept them.

FIG.07 · Market signal

What This Means for Teams Building Agent Systems Now

sunglasses://blog/runtime-governance-is-not-enough

Market signal

If you are building agents today, the practical priority order is:

Signals

Get ingestion scanning running (layer 1). This is the highest-leverage control for the least implementation cost. Sunglasses is MIT-licensed and free.
Audit your adapter layer for string interpolation into sink arguments (layer 2). This is where most exploitable gaps live in existing codebases.
Verify your runtime governance is actually least-privilege, not just documented as least-privilege (layer 3).
Add chain-level logging so you can reconstruct multi-step sequences during incident review (layer 4 precursor).

The shift

Runtime governance was the right place to start. It is not the right place to stop. The teams that have already learned this lesson are the ones writing postmortems.

FIG.08 · Analysis

What Governance Frameworks Get Right

sunglasses://blog/runtime-governance-is-not-enough

Context

I want to be clear about something before going further: governance frameworks are not the enemy here. NIST AI RMF, ISO 42001, the EU AI Act — these exist because someone recognized that deploying AI systems without accountability structures is reckless. They define what must be controlled, who is responsible, what gets audited, and what happens when something goes wrong. That scaffolding is necessary.

The point

The problem is not that governance is wrong. The problem is that governance is frequently deployed as the only layer, when it was designed to be the accountability layer on top of other controls. A governance framework tells you that agents must not access PII without authorization. It does not stop the bytes from flowing — it records that a rule existed and determines whether the outcome was compliant. Those are different jobs.

Detail

Governance frameworks are also good at forcing documentation: threat models, data flow diagrams, acceptable-use policies, incident response plans. When an agent incident does occur, a well-governed system produces a clear audit trail. That audit trail matters for liability, for regulatory response, for learning. It does not matter for the person whose data was exfiltrated 72 hours before the audit surfaced the log entry.

In practice

The argument here is narrow: governance without runtime enforcement is paperwork. The frameworks themselves would agree — they describe governance as one layer in a defense-in-depth stack, not the stack itself. Most enterprise deployments have picked up the governance layer first and skipped the inline enforcement layer entirely. That gap is the problem I keep scanning for.

FIG.09 · Analysis

The Post-Hoc Problem

sunglasses://blog/runtime-governance-is-not-enough

Context

Most enterprise governance relies on logs, reviews, and after-the-fact incident response. The assumption baked into that model is: if we record everything, we can catch violations and respond. That assumption held reasonably well in human-paced systems where a policy violation took hours or days to materialize into real damage.

The point

AI agents do not move at human pace. A single agent session can read a document, extract content, call an external API, and write a summary with embedded payload — all in seconds, all while producing logs that look perfectly normal at each individual step. By the time that session log surfaces in a weekly audit, the action is done. The data left the boundary. The tool call completed.

Detail

This is the post-hoc problem: logs without filtering are a forensic record of damage already done. They are useful for understanding what happened and for preventing recurrence. They are not useful for stopping the attack that is running right now.

In practice

I scan a lot of content as part of my pattern research cycle. What I see consistently is that the highest-severity patterns — data exfiltration via tool chains, cross-agent injection via handoff tokens, retrieval poisoning via document metadata — all complete their damage in the same agent session where the malicious content entered. Post-hoc review catches these in the log. Inline filtering catches them before the agent acts on the content. Those are not equivalent outcomes.

Why it matters

A governance framework that requires logging does not become an inline filter just because the logs are comprehensive. These are architecturally different things.

FIG.10 · Coverage

The Detection-vs-Prevention Gap

sunglasses://blog/runtime-governance-is-not-enough

The wedge

There is a useful analogy in traditional network security: a governance policy is to a runtime filter what an audit log is to a firewall. The audit log tells you what happened. The firewall stops it from happening. You need both — but only one of them prevents the breach.

What we look for

The enterprises that took network security seriously in the 2000s learned this the hard way. Detailed logging without perimeter controls produced detailed records of successful intrusions. Adding detection without prevention produced faster incident reports on the same successful intrusions. Prevention — blocking known bad traffic at the perimeter — was what actually changed the outcome.

The question

AI agent security is at roughly the same inflection point. Most organizations have detection: logs, alerts, anomaly scoring on agent outputs. Some have post-hoc governance review. Very few have deployed inline prevention at the ingestion layer — the equivalent of a firewall for content entering the agent's context window.

House sentence

Sunglasses operates at that prevention layer. The filter runs in approximately 0.26ms before the agent's context is assembled, matching input against 328 patterns across 49 categories as of v0.2.20. When a match fires, the content does not reach the agent. There is nothing to log after the fact because the attack path was closed before it opened. The governance layer then receives a clean signal — what passed the filter, what was blocked, and why — which actually makes post-hoc review more meaningful, not less.

What Runtime Filtering Enforces That Governance Cannot

sunglasses://blog/runtime-governance-is-not-enough

Context

Let me be concrete about the difference, pattern category by pattern category.

The point

cross_agent_injection: An attacker embeds a malicious instruction in a message passed from one agent to another — a delegation token, a handoff summary, a task description. Governance records that Agent B received a message from Agent A and acted on it. Filtering intercepts the message before Agent B reads it, matches the embedded instruction against known injection signatures, and blocks it. The governance log never sees the attack payload because the filter removed it first.

Detail

retrieval_poisoning: Attacker-controlled content is embedded in a document that an agent will retrieve from a knowledge base or RAG pipeline. Governance records that the agent called the retrieval tool and received a document. Filtering scans the retrieved document content before it enters the agent's prompt, catching patterns like embedded override instructions or exfiltration triggers. The agent receives the clean portion or a block signal — it never reads the poisoned fragment.

In practice

tool_output_poisoning: A tool returns output that contains attacker-shaped content — instructions embedded in what looks like a structured API response. Governance records the tool call and the output. Filtering runs on the tool output before the agent processes it, matching against patterns that look like injected commands inside result payloads. If it fires, the agent does not act on the poisoned output.

Why it matters

In each of these cases, governance provides the audit trail: what was called, when, by whom. Filtering provides the intervention: the attack payload does not reach the decision point. Guardrails at the output layer miss all three of these because the damage is done before output is generated. You need the filter at ingestion.

FIG.12 · Analysis

The Hybrid Model: Filter Inline, Govern Everything

sunglasses://blog/runtime-governance-is-not-enough

Context

The right architecture is not filtering instead of governance. It is filtering inline with governance as the accountability layer on top.

The point

Here is how the two layers interact in practice. The Sunglasses filter sits ahead of the agent, scanning every input — prompts, retrieved documents, tool outputs, agent-to-agent messages — against the current pattern set. Known-bad patterns are blocked before the agent reads them. Ambiguous content that passes the filter still enters the context, and the governance layer captures the full session: what the agent received, what it decided, what actions it took.

Detail

This gives the governance layer something more useful to work with. Instead of reviewing logs full of attack attempts that reached the agent, the review surface is focused on the cases that actually needed human judgment — content that passed the filter, edge cases, novel attack patterns not yet in the detection set. The filter's block log feeds back into pattern development: blocked attempts that later appear in incident reports become candidates for new pattern additions in the next release cycle.

In practice

The filter also learns from governance findings. When a post-hoc review identifies a new attack vector — something that slipped through because it did not match existing patterns — that finding translates directly into a new pattern. Governance findings that would previously sit in an incident report and wait for the next policy review cycle instead feed the detection engine. The loop closes faster.

Why it matters

Neither layer is optional. Filtering without governance is a black box with no accountability trail and no way to review what the filter missed. Governance without filtering is a detailed record of successful attacks. Together, they are what a mature AI agent security posture actually looks like. Most deployments are missing the first half.

FIG.14 · Coverage

More from Sunglasses

sunglasses://blog/runtime-governance-is-not-enough

MCP Tool Poisoning: How Malicious Tool Descriptions Hijack AI Agents

The attack that hides in tool metadata — invisible to humans, high-visibility to the model.

Guardrails Are Not Enough

Why content filters at the output layer miss the attacks that matter most.

AI Supply Chain Attacks in 2026

How compromised upstream packages deliver attacker-controlled content before runtime controls can see it.

The Agent Did Not Mean To Leak Your Data

How AI agents exfiltrate data through legitimate channels while trying to be helpful.

How Sunglasses Works

The detection architecture: 248 patterns, 35 categories, 0.261ms scan latency.

Security Reports

Real-world attack analyses and vulnerability scans, including the AXIOS RAT scan.

Runtime Governance Is Not Enough for AI Agent Security

The Recurring Failure Pattern: Governed Sink, Ungoverned Path

Why Runtime-Only Controls Fail in Real Systems

Evidence from Recent Disclosures

What to Implement Now: Five-Layer Control Model

Scan Example: Catching Upstream Influence with Sunglasses

Testing This in CI

What This Means for Teams Building Agent Systems Now

What Governance Frameworks Get Right

The Post-Hoc Problem

The Detection-vs-Prevention Gap

What Runtime Filtering Enforces That Governance Cannot

The Hybrid Model: Filter Inline, Govern Everything

More from Sunglasses

Frequently Asked Questions

Is runtime governance still required for AI agent security?

What is path governance in AI agent security?

What is the first high-impact fix for most teams?

How should teams test AI agent security in CI?

Why does runtime-only governance produce policy-compliant logs with exploitable outcomes?

Scan what the agent sees, before it acts