Layered Security Review of Autonomous Agent Frameworks

Summary

As autonomous agent frameworks built on large language models (LLMs) evolve into complex, tool-integrated, and continuously operating systems, their security surface expands far beyond traditional prompt-level vulnerabilities. This survey by Xu and Chen addresses a critical gap: while individual attack surfaces (e.g., prompt injection, tool misuse) have been studied, there has been no systematic, layered review of security risks across the entire agent stack.

The authors propose a four-layer security taxonomy: (1) Context and Instruction Layer (prompt injection, jailbreaking), (2) Tool and Action Layer (tool misuse, parameter manipulation), (3) State and Persistence Layer (memory poisoning, state corruption), and (4) Ecosystem and Automation Layer (multi-agent collusion, automation chain abuse). Using OpenClaw as a case study, they demonstrate how attacks can propagate across layers—from a manipulated instruction to unsafe tool calls, persistent state contamination, and automated ecosystem-level impact.

Key Contributions

First layered security taxonomy for autonomous agent frameworks, organizing risks into four functional layers.
Cross-layer threat propagation analysis, showing how a single vulnerability can cascade across the entire agent stack.
OpenClaw case study providing concrete examples of attack and defense patterns in a real-world agent framework.
Identification of research imbalances: most work focuses on the context/instruction layer, while state/persistence and ecosystem layers remain underexplored.
Future research agenda highlighting the need for long-horizon evaluation, ecosystem trust models, and integrated defenses.

Implications

For Researchers

This taxonomy provides a structured foundation for future security research. The identified research imbalance—heavy focus on prompt-level attacks versus sparse work on state persistence and ecosystem automation—points to clear opportunities. Cross-layer threat propagation is a particularly underexplored area that could yield high-impact findings.

For Developers

Developers building agent frameworks (e.g., LangChain, AutoGPT, CrewAI) can use this layered model to audit their systems. Key action items include: implementing tool sandboxing and parameter validation (tool layer), adding state integrity checks and memory encryption (persistence layer), and designing trust models for inter-agent communication (ecosystem layer).

For Users

End users of agent tools should be aware that security risks extend beyond “bad prompts.” Agents that persist state across sessions or autonomously interact with external services introduce new attack surfaces. The paper underscores the importance of transparency and user control over agent state and automation chains.

References

https://arxiv.org/abs/2604.27464v1

Layered Security Review of Autonomous Agent Frameworks

Read this first.

Where this changes the map.

Translated text.