Researchers are increasingly seeking to model built environments not simply as collections of physical particles, but as spaces experienced by cognitive agents. Javier Argota Sánchez-Vaquerizo from ETH Zürich, Luis Borunda Monsivais from Virginia Tech, and et al. present a novel approach, termed Agentic Environmental Simulations, which utilises large generative models to predict environmental states based on semantic expectation. This work is significant because it proposes a shift towards Episodic Spatial Reasoning, advancing simulations through surprisal-triggered events and formalising ‘Cognitive Friction’ to reveal ambiguities in spatial design , termed “Phantom Affordances”. Ultimately, the team challenges conventional human-computer interaction paradigms, advocating for environments to be treated as dynamic cognitive partners within a human-centered framework for designing more intuitive and responsive simulations.
Ultimately, the team challenges conventional human-computer interaction paradigms, advocating for environments to be treated as dynamic cognitive partners within a human-centered framework for designing more intuitive and responsive simulations.
Predictive Spatial Simulations with Large Language Models offer
Scientists have demonstrated a paradigm shift in architectural simulations, moving beyond traditional physics-based models to embrace agentic environmental simulations driven by large generative models. This breakthrough research introduces a method where simulations actively predict the next state of spatial environments based on semantic expectation, rather than deterministic calculations. The team achieved this by integrating Large Language Models into spatial pipelines, enabling the simulation of cognitive processes alongside physical interactions, allowing for the prediction of complex phenomena such as crowd behaviour influenced by ambiguous signage. By formalising Cognitive Friction, the work establishes a quantifiable metric to identify areas where environmental signals are unclear or misleading to users, allowing for the creation of Cognitive Simulations that quantify qualitative experience previously limited to post-occupancy evaluations. Experiments show the development of a pipeline decoupling exhaustive geometric surveying from experiential reality, employing a dual-process cognitive architecture featuring a low-compute “Heuristic Autopilot” for routine tasks and a high-compute “Episodic Reasoning” module activated during critical events, such as navigating spatial junctions or encountering ambiguous signage. This decoupling optimises computational resources and enables the generation of heatmaps representing affordance, perception, and emotional loads, offering insights into how individuals with diverse neurocognitive profiles perceive space.
The research produces a multimodal narrative log of “Moments of Disorientation” and “Semiotic-Cognitive Misalignment”, identifying potential cognitive hazards within spatial design. Furthermore, the study challenges conventional HCI paradigms by treating environments as dynamic cognitive partners, proposing a framework for cognitive orchestration that prioritises the preservation of autonomy, affective clarity, and cognitive integrity within AI-driven simulations. Crucially, scientists reframe generative AI “hallucinations” not as errors, but as valuable design heuristics, indicating areas of architectural semiotic ambiguity where environmental signals fail to clearly communicate affordances. The research team engineered a system where large generative models actively predict the next state of spatial environments, grounded in semantic expectation rather than chronological time-steps. Researchers implemented a dual-process cognitive architecture, decoupling routine locomotion from critical episodes requiring high-compute multimodal Large Language Model modules.
Experiments employ a Heuristic Autopilot, a low-compute background layer handling standard movement via physics-based heuristics, while Episodic Reasoning, powered by LLMs, activates only during significant events like spatial junctions or semiotic ambiguities. This system delivers a pipeline that mirrors dual-process cognition, optimising computational resource allocation by triggering high-compute modules when the Heuristic Autopilot exceeds a defined surprisal threshold. This innovative approach enables the quantification of qualitative experience, previously limited to post-occupancy evaluations, generating heatmaps of affordance, perception, and emotional loads such as stress, risk, and comfort. The study pioneered a method for formalising Cognitive Friction, allowing the revelation of “Phantom Affordances”, or semiotic ambiguities within built spaces.
Scientists harnessed LLMs to proxy human affective transitions, providing rich contextual narratives to simulate how neurodivergent individuals or specific demographic profiles perceive space differently. This resulted in a multimodal narrative log of “Moments of Disorientation” and “Semiotic-Cognitive Misalignment”, distinguishing between beneficial cognitive friction and hazardous disorientation. The research team formalised Cognitive Friction (Cf) as a metric to reveal “Phantom Affordances”, representing semiotic ambiguities within built spaces. Experiments demonstrate that by quantifying the divergence between an agent’s expectation (Egen) and physical reality (Rphys), researchers can generate Cognitive Friction Heatmaps, pinpointing zones of ambiguity where legibility of intent is unclear. The core of this work lies in operationalising hallucination as a heuristic, defining Cognitive Friction (Cf) as a measure of Semiotic Divergence, calculated as Cf = 1 − sim(Egen, Rphys), where ‘sim’ represents cosine similarity within a shared multimodal embedding space.
A high Cf value signifies a “Phantom Affordance”, indicating a strong architectural signal that the physical environment fails to fulfil. Mapping these points of divergence creates Cognitive Friction Heatmaps, diagnosing semiotic ambiguity and treating hallucinations not as errors, but as Latent Design Requirements. This allows for a shift in design strategy, aligning environmental cues like lighting and texture with the agent’s expectations. Tests prove that the simulation operates on a physics autopilot until prediction error exceeds a defined threshold (τ), triggering the generative model to produce an expectation.
The semantic divergence between this expectation and ground truth then quantifies the Cognitive Friction, revealing potential design flaws. Researchers posit that this framework moves beyond optimisation, advocating for “Cognitive Orchestration”, a human-centered approach ensuring transparency and preserving autonomy in adaptive environments. Design principles include legible Cf heatmaps for non-experts, opt-out mechanisms for manual overrides, demographic equity in training data, and auditable simulation parameters. Furthermore, the study outlines three key contributions: a shift to episodic spatial reasoning mirroring human memory, a formalisation of hallucinations as a cognitive friction metric, and a framework for ethical AI-driven environmental simulation. Current implementations, however, assume Western spatial semiotics, necessitating diversified training data for cross-cultural applicability. This new framework treats occupants as cognitive agents, actively predicting environmental states based on semantic expectation rather than deterministic particle behaviour. By employing episodic spatial reasoning, aligned with human memory structures, the research enables prediction of experiential qualities previously accessible only through post-occupancy evaluation. Central to this work is the formalisation of ‘Cognitive Friction’ and the identification of ‘Phantom Affordances’, representing semiotic ambiguities within built spaces.
Researchers propose that generative errors, or ‘hallucinations’ within the simulations, can function as diagnostic tools, revealing mismatches between environmental design and cognitive intent. Furthermore, the authors suggest a shift from using geometry as a fixed constraint to viewing it as a self-fulfilling prophecy aligned with cognitive needs, potentially enabling generative design processes driven by narrative prompts. Acknowledging the ethical implications of this technology, the authors emphasise the importance of prioritising cognitive integrity over behavioural optimisation. Their principle of cognitive orchestration aims to ensure that AI-driven spatial design enhances, rather than undermines, human autonomy, affective clarity, and cognitive sovereignty. Future research should focus on developing ethical frameworks to guide the application of these predictive cognitive modelling techniques, allowing for the creation of environments that respect the complexity of human spatial experience.
👉 More information
🗞 From Particles to Agents: Hallucination as a Metric for Cognitive Friction in Spatial Simulation
🧠 ArXiv: https://arxiv.org/abs/2601.21977
