MCP Security Goes Architectural: Prompts Don't Protect
Prompts Don't Protect: Architectural Enforcement via MCP Proxy for LLM Tool Access Control
Prompt-based MCP access control is fundamentally broken. The industry is moving toward architectural enforcement at the protocol level.
Read this first.
- LLMs ignore prompt-based tool restrictions in adversarial scenarios
- MCP Proxy approach: intercept tool calls at the protocol layer, enforce before the LLM sees them
- ADR system provides observability, detection, and response for enterprise MCP deployments
- Directly relevant to MCP server developers and agent platform builders
Where this changes the map.
Should prepare for proxy-based access control as a standard layer
ADR pattern may become a required component for enterprise deployments
Translated text.
Two papers from the past week mark an inflection point in MCP security thinking.
The Problem: Prompts Are Not Security
Rohith Uppala’s paper “Prompts Don’t Protect” demonstrates a critical finding: when unauthorized tools are visible in an agent’s context window, LLMs will select them in adversarial scenarios — even when explicitly instructed otherwise in the system prompt. The prompt is a suggestion, not a security boundary.
The Solution: Protocol-Layer Enforcement
The proposed MCP Proxy sits between the agent and MCP servers, enforcing access control before tool calls reach the LLM. This is an architectural shift — security moves from the prompt layer (soft, unreliable) to the protocol layer (hard, enforceable).
Enterprise Adoption
A separate paper from the ADR team describes the first large-scale, production-proven enterprise framework for securing AI agents operating through MCP. They identify three challenges: limited observability (existing EDR tools don’t understand MCP), protocol-level attack surfaces, and the need for runtime detection rather than static analysis.
What This Means for agentk.it Users
Every MCP server indexed in this directory is a potential tool an agent can invoke. Understanding that prompt-based restrictions are insufficient helps you evaluate which MCP servers to trust and how to deploy them securely.
Source: arXiv:2605.18414 (Prompts Don’t Protect) and arXiv:2605.17380 (ADR System)
Follow-up signals.
- Whether Anthropic or OpenAI adopt MCP Proxy patterns in their official MCP implementations
- Open-source MCP Proxy implementations appearing on GitHub
Trace the origin.
- Original title
- Prompts Don't Protect: Architectural Enforcement via MCP Proxy for LLM Tool Access Control
- Source
- arXiv
- Author
- Rohith Uppala
- Original date
- 2026-05-18
- Permission
- open_license
- Published
- 2026-05-19
- Source URL
- https://arxiv.org/abs/2605.18414