LLM Agents Self-Adapt Security for IoT at the Edge

Summary

The rapid adoption of IoT systems at the network edge has created an urgent need for security mechanisms that are both adaptive and resource-efficient. Traditional approaches relying on static rule sets or learned policies struggle to guarantee feasibility, conflict safety, and execution correctness under latency, energy, and computational constraints. This paper introduces ASPO (Adaptive Security Pattern Optimizer), a self-adaptive multi-agent framework that integrates Large Language Model (LLM)-based reasoning with deterministic enforcement within a MAPE-K (Monitor, Analyze, Plan, Execute - Knowledge) control loop.

ASPO’s key architectural innovation is the explicit separation of stochastic decision generation from deterministic execution. LLM agents propose candidate mitigation portfolios, while a deterministic optimization core enforces closed-world action integrity, conflict-free composition, and resource feasibility at every decision epoch. The framework was deployed on a distributed edge-gateway testbed and evaluated across two workloads comprising 500 and 1,000 runtime security decisions using replayed IoT attack traffic.

The results demonstrate invariant safety properties including 100% conflict-free activation, consistent resource feasibility across workloads, and stable pattern dominance with perfect rank preservation. Notably, deeper decision exploration reduced extreme-case execution costs, compressing tail latency and energy overheads by 21.9% and 23.1% respectively, without increasing mean energy consumption.

Key Contributions

A novel hybrid architecture that separates LLM-based stochastic decision generation from deterministic execution enforcement within a MAPE-K control loop
Empirical validation of 100% conflict-free security pattern activation across 1,500 runtime decisions in a distributed edge-gateway testbed
Demonstration that deeper LLM exploration reduces tail latency by 21.9% and energy overheads by 23.1% without increasing mean consumption
Formal guarantees of resource feasibility, action integrity, and conflict avoidance through deterministic optimization core

Implications

For Researchers

This work provides a rigorous empirical framework for studying hybrid LLM-deterministic systems in resource-constrained environments. The separation of stochastic reasoning from deterministic enforcement offers a testable hypothesis for safe multi-agent system design. Researchers can build on ASPO’s architecture to explore formal verification methods for LLM-driven decisions and extend the approach to other domains requiring adaptive, safe decision-making under constraints.

For Developers

ASPO presents a production-ready architectural pattern for deploying LLM agents in environments where safety and resource constraints are paramount. The clear separation between the LLM’s generative role and the deterministic enforcement layer provides a modular design that can be adapted to various edge computing scenarios. Developers can implement similar hybrid architectures to combine the flexibility of LLMs with the reliability of rule-based systems.

For Users

End users of IoT systems benefit from security automation that is both adaptive and provably safe. ASPO’s ability to maintain 100% conflict-free operation while reducing worst-case resource consumption means more reliable and efficient security protection without manual configuration overhead. This is particularly valuable for smart home, industrial IoT, and healthcare applications where security failures or resource exhaustion can have serious consequences.

References

https://arxiv.org/abs/2605.00741v1

LLM Agents Self-Adapt Security for IoT at the Edge

Read this first.

Where this changes the map.

Translated text.