SimuRA: Towards General Goal-Oriented Agent via Simulative Reasoning Architecture with LLM-Based World Model

AI-generated keywords: AI agents

AI-generated Key Points

  • AI agents built on large language models (LLMs) have shown great promise
  • Current approaches often focus on a one-task-one-agent model lacking scalability and generality
  • Humans are general problem-solvers who can reason and plan across diverse environments by simulating outcomes
  • Introduction of SimuRA (Simulative Reasoning Architecture), a goal-oriented framework for generalized agentic reasoning
  • SimuRA leverages a world model for planning through simulation, overcoming constraints of autoregressive LLMs
  • Experiments show success rate improvement in flight searches using SimuRA's world-model-based planning
  • SimuRA architecture includes policy module, world model, and critic module for action selection based on goals and outcomes evaluation
  • Natural language used as a compact representation for simulation in SimuRA ensures robustness and adaptability across tasks
  • SimuRA available as an open-source library through LLM Reasoners with REASONERAGENT-WEB serving as research preview
  • Ongoing efforts to expand the system to tackle broader challenges and showcase versatility across different task domains
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mingkai Deng, Jinyu Hou, Yilin Shen, Hongxia Jin, Graham Neubig, Zhiting Hu, Eric Xing

License: CC BY-NC-SA 4.0

Abstract: AI agents built on large language models (LLMs) hold enormous promise, but current practice focuses on a one-task-one-agent approach, which not only falls short of scalability and generality, but also suffers from the fundamental limitations of autoregressive LLMs. On the other hand, humans are general agents who reason by mentally simulating the outcomes of their actions and plans. Moving towards a more general and powerful AI agent, we introduce SimuRA, a goal-oriented architecture for generalized agentic reasoning. Based on a principled formulation of optimal agent in any environment, \modelname overcomes the limitations of autoregressive reasoning by introducing a world model for planning via simulation. The generalized world model is implemented using LLM, which can flexibly plan in a wide range of environments using the concept-rich latent space of natural language. Experiments on difficult web browsing tasks show that \modelname improves the success of flight search from 0\% to 32.2\%. World-model-based planning, in particular, shows consistent advantage of up to 124\% over autoregressive planning, demonstrating the advantage of world model simulation as a reasoning paradigm. We are excited about the possibility for training a single, general agent model based on LLMs that can act superintelligently in all environments. To start, we make SimuRA, a web-browsing agent built on \modelname with pretrained LLMs, available as a research demo for public testing.

Submitted to arXiv on 31 Jul. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2507.23773v1

AI agents built on large language models (LLMs) have shown great promise, but current approaches often focus on a one-task-one-agent model that lacks scalability and generality. These agents also face limitations inherent in autoregressive reasoning. In contrast, humans are general problem-solvers who can reason and plan across diverse environments by simulating outcomes and planning accordingly. To address these challenges, we introduce SimuRA (Simulative Reasoning Architecture), a goal-oriented framework for generalized agentic reasoning. By leveraging a world model for planning through simulation, SimuRA overcomes the constraints of autoregressive LLMs. This world model is implemented using LLMs, allowing for flexible planning in various environments using the rich latent space of natural language. Experiments conducted on challenging web browsing tasks demonstrate the effectiveness of SimuRA. The success rate of flight searches improved from 0% to 32.2%, with world-model-based planning consistently outperforming autoregressive planning by up to 124%. This highlights the advantage of simulation-based reasoning as a paradigm for AI agents. The architecture of SimuRA involves a policy module that proposes potential actions based on goals, a world model that simulates outcomes, and a critic module that evaluates these outcomes to select the best action. By utilizing natural language as a compact representation for simulation, SimuRA ensures robustness and adaptability across tasks. We have made SimuRA available as an open-source library through LLM Reasoners, with the web agent REASONERAGENT-WEB serving as a research preview. Ongoing efforts are focused on expanding the system to tackle broader challenges and showcase its versatility across different task domains. Overall, our results demonstrate that SimuRA offers significant improvements over baseline approaches in complex website navigation tasks. The architecture's ability to reason through simulation shows promise for developing more general and powerful AI agents capable of superintelligent performance across diverse environments.
Created on 26 Aug. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.