Catmaster Achieves Faster Heterogeneous Catalysis Research Using LLM-Driven Workflows

Researchers are addressing a major bottleneck in computational heterogeneous catalysis through a novel agentic autonomous system called CatMaster. Developed by Honghao Chen, Jiangjie Qiu, and Yi Shen Tew from the Beijing Key Laboratory of Artificial Intelligence for Advanced Chemical Engineering Materials at Tsinghua University, together with Xiaonan Wang and collaborators, CatMaster is a large-language-model–driven platform that automates complex computational workflows traditionally dependent on time-consuming, iterative, and often irreproducible manual scripting. The system translates natural-language requests into complete and executable calculation workspaces, including structures, input files, outputs, and detailed run records. By substantially reducing the overhead associated with workflow management, CatMaster enables researchers to focus on chemical interpretation and modelling decisions rather than the technical intricacies of setup and execution, thereby accelerating materials discovery and advancing the understanding of catalytic processes.

Although powerful libraries such as ASE and pymatgen, along with workflow managers for job orchestration, already exist, a significant operational gap remains in their day-to-day use. In practice, researchers frequently rely on manual intervention to chain tools together, convert data formats, and track intermediate structures. This manual oversight involves numerous micro-decisions, such as managing calculation files, tuning convergence parameters, diagnosing failures, and selecting candidates for higher-fidelity validation, all of which slow down iteration and reduce reproducibility. Even seemingly simple tasks, such as screening candidates for reactions like the hydrogen evolution reaction, can require extensive manual effort to execute calculations and extract chemically meaningful results, such as final energies or bond lengths.

To evaluate CatMaster’s capabilities, the authors selected the surface chemistry of body-centered cubic (BCC) iron as a representative test case, focusing on two key aspects: accuracy against established benchmarks and strict adherence to user-defined constraints in a multi-stage, end-to-end workflow. As a baseline test of physical correctness, CatMaster was instructed to compute surface energies for Fe(110), Fe(100), and Fe(111) using a standard protocol involving symmetric slabs, the PBE functional, and no dispersion corrections. The agent autonomously retrieved the appropriate bulk reference structure (mp-13), performed lattice relaxation, and constructed the required symmetric slab models. The resulting surface energies—γ₁₁₀ = 2.48, γ₁₀₀ = 2.56, and γ₁₁₁ = 2.76 J m⁻²—closely match literature values, with relative errors within 3%. This close agreement confirms that CatMaster can independently instantiate and execute standard thermodynamic workflows with high accuracy, validating its ability to handle geometry construction, parameter selection, and execution in a fully autonomous manner.

LLM-driven Workspaces for Catalysis Computation offer accelerated discovery

The research team engineered a hierarchical architecture comprising a Planner, an Executor, and a Summarizer, all anchored by a persistent whiteboard record, to manage complex catalytic computations. The Executor then carries out these tasks using schema-validated tools, while the Summarizer meticulously documents the process and results on the whiteboard, providing a clear audit trail. All demonstrations within the study were performed utilising GPT-5.2-thinking-medium or GPT-5.2-thinking-high, selected based on the complexity of each specific case. This tiered approach allows for efficient screening of a large chemical space, followed by rigorous verification of promising candidates with more computationally demanding methods. This allows for automatic human-in-the-loop intervention, enabling users to modify the task planning chain as needed. The system delivers a file-centric execution contract, persisting the authoritative project state in the file system rather than the LLM’s context window, ensuring resilience, inspectability, and reproducibility of results as Supporting Information.

CatMaster streamlines catalysis via LLM agents

This innovative approach addresses critical limitations in current workflows, which are often costly, iterative, and sensitive to setup choices. Experiments revealed that CatMaster effectively manages the complexities of catalysis research by treating the calculation workspace as the primary scientific record. The team demonstrated the system’s capabilities through four progressively complex demonstrations, beginning with an O2 spin-state check executed remotely. Measurements confirm that CatMaster’s multi-fidelity tool library enables rapid surrogate relaxations alongside accurate DFT calculations when needed.

Results demonstrate that the system’s hierarchical agent design, coupled with persistent whiteboard memory, supports long-horizon workflows and deferred resolution of intermediate outcomes. The team achieved a file-centric execution contract, transforming natural-language protocols into restartable and inspectable workspaces, ensuring a complete evidence trail. Scientists recorded that the workspace serves as an execution contract, with each task emitting a minimal evidence package including input decks, output logs, and key scalar summaries, facilitating downstream analysis and revision. This approach promises to accelerate discovery and enhance reproducibility in the field of computational catalysis.

CatMaster streamlines catalysis via LLM agents

Demonstrations across diverse workflows, including spin-state checks, surface thermodynamics, adsorption exploration, and high-throughput alloy screening, highlight its versatility and effectiveness. The authors acknowledge limitations in handling complex calculation failures, such as SCF non-convergence, and suggest developing specialised “debugger agents” as a future direction. Further research could focus on expanding the tool library and enhancing the agent’s ability to autonomously resolve intricate computational issues, ultimately enabling fully unattended operation.

👉 More information
🗞 CatMaster: An Agentic Autonomous System for Computational Heterogeneous Catalysis Research
🧠 ArXiv: https://arxiv.org/abs/2601.13508

Rohail T.

Rohail T.

As a quantum scientist exploring the frontiers of physics and technology. My work focuses on uncovering how quantum mechanics, computing, and emerging technologies are transforming our understanding of reality. I share research-driven insights that make complex ideas in quantum science clear, engaging, and relevant to the modern world.

Latest Posts by Rohail T.:

Stripe Modelling Achieves 13% Carbon Retention in Diii-D Tokamak PMI Simulations

Stripe Modelling Achieves 13% Carbon Retention in Diii-D Tokamak PMI Simulations

January 23, 2026
Cai4sg Advances Social Good: Examining Roles, Challenges and Emerging Trends

Cai4sg Advances Social Good: Examining Roles, Challenges and Emerging Trends

January 23, 2026
Llms Achieve 206% Improvement with Codified Expert Knowledge for AI Agents

Llms Achieve 206% Improvement with Codified Expert Knowledge for AI Agents

January 23, 2026