ColPackAgent: MCP-Powered AI for Colloidal Packing Simulations

Summary

ColPackAgent introduces a novel approach to scientific computing by combining a Model Context Protocol (MCP) tool server with a portable agent skill to automate colloidal packing simulations. The core innovation is the colpack Python package, which wraps HOOMD-blue’s hard-particle Monte Carlo engine, and an MCP server that exposes its functions as tools. The agent skill encodes a four-stage workflow contract (setup, planning, execution, analysis) that guides the LLM through the simulation process.

The paper demonstrates the system in three modes: interactive (with human feedback), autonomous (from an end-to-end prompt), and autoresearch (following a program file). Examples include 3D cube particles, a binary 2D system of disks and capsules, and the 2D hard-disk freezing transition. A key contribution is the benchmark of 17 LLMs on 17 stage-specific prompts, providing a granular view of model reliability for scientific workflows.

Key Contributions

MCP-based tool server for scientific simulation: Exposes the colpack Python package (wrapping HOOMD-blue) as callable tools, enabling any MCP-compatible agent to run Monte Carlo simulations.
Portable agent skill for structured workflows: Encodes a four-stage workflow contract (setup, planning, execution, analysis) that transforms LLMs from workflow describers to reliable executors.
Multi-mode operation: Supports interactive, autonomous, and autoresearch modes, demonstrating flexibility for different research scenarios.
Comprehensive LLM benchmark: Evaluates 17 LLMs on 17 stage-specific prompts, providing a stage-level reliability check for scientific workflow following.
Open-source implementation: The colpack package and MCP server are available, enabling reproducibility and extension by the community.

Implications

For Researchers

ColPackAgent lowers the barrier to running complex Monte Carlo simulations. Researchers can now describe a simulation in natural language and have it executed autonomously, rather than writing and debugging Python scripts. The autoresearch mode is particularly powerful for high-throughput screening of colloidal systems.

For Developers

This paper provides a clear architectural pattern for building scientific agents: wrap a domain-specific Python package as an MCP tool server, then pair it with a portable agent skill that encodes the workflow. Developers can reuse this pattern for other simulation engines (e.g., LAMMPS, GROMACS) or data analysis pipelines.

For Users

End users—including students, technicians, and scientists without deep programming expertise—can now interact with advanced simulation software through a chat interface. The system’s ability to handle human feedback in interactive mode also makes it suitable for educational settings and exploratory research.

References

https://arxiv.org/abs/2605.15625v1

ColPackAgent: MCP-Powered AI for Colloidal Packing Simulations

Read this first.

Where this changes the map.

Translated text.