Memp: Exploring Agent Procedural Memory

AI-generated keywords: Procedural Memory

AI-generated Key Points

⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

Authors address limitations of Large Language Models (LLMs) in handling procedural memory
Proposal of Memp as a solution to provide agents with learnable, updatable, and lifelong procedural memory
Memp distills past agent trajectories into detailed step-by-step instructions and higher-level script-like abstractions
Strategies for Build, Retrieval, and Update processes of procedural memory discussed to enhance effectiveness
Continuous updating and correction of outdated information in sync with new experiences allows Memp to evolve over time
Empirical evaluations on TravelPlanner and ALFWorld show higher success rates and improved efficiency with refined memory repository
Transferring procedural memory from a robust model to a weaker one leads to significant performance gains
Importance of adaptive and evolving procedural memory mechanisms highlighted for enhancing AI agent performance based on Large Language Models

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Runnan Fang, Yuan Liang, Xiaobin Wang, Jialong Wu, Shuofei Qiao, Pengjun Xie, Fei Huang, Huajun Chen, Ningyu Zhang

arXiv: 2508.06433v1 - DOI (cs.CL)

Work in progress

License: NONEXCLUSIVE-DISTRIB 1.0

Abstract: Large Language Models (LLMs) based agents excel at diverse tasks, yet they suffer from brittle procedural memory that is manually engineered or entangled in static parameters. In this work, we investigate strategies to endow agents with a learnable, updatable, and lifelong procedural memory. We propose Memp that distills past agent trajectories into both fine-grained, step-by-step instructions and higher-level, script-like abstractions, and explore the impact of different strategies for Build, Retrieval, and Update of procedural memory. Coupled with a dynamic regimen that continuously updates, corrects, and deprecates its contents, this repository evolves in lockstep with new experience. Empirical evaluation on TravelPlanner and ALFWorld shows that as the memory repository is refined, agents achieve steadily higher success rates and greater efficiency on analogous tasks. Moreover, procedural memory built from a stronger model retains its value: migrating the procedural memory to a weaker model yields substantial performance gains.

Submitted to arXiv on 08 Aug. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2508.06433v1

⚠This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

Comprehensive Summary
Key points
Layman's Summary
Blog article

In their work titled "Memp: Exploring Agent Procedural Memory," authors Runnan Fang, Yuan Liang, Xiaobin Wang, Jialong Wu, Shuofei Qiao, Pengjun Xie, Fei Huang, Huajun Chen, and Ningyu Zhang address the limitations of Large Language Models (LLMs) based agents in handling procedural memory. These agents excel at various tasks but struggle with rigidly defined procedural memory that is either manually crafted or constrained by fixed parameters. To overcome this challenge, the authors propose Memp as a solution to endow agents with a learnable, updatable, and lifelong procedural memory. Memp operates by distilling past agent trajectories into detailed step-by-step instructions as well as higher-level script-like abstractions. The study delves into different strategies for the Build, Retrieval, and Update processes of procedural memory to enhance its effectiveness. By implementing a dynamic regimen that continuously updates, corrects, and removes outdated information from the repository in sync with new experiences, Memp evolves over time. Empirical evaluations conducted on TravelPlanner and ALFWorld demonstrate that refining the memory repository leads to progressively higher success rates and improved efficiency in performing similar tasks. Notably, transferring procedural memory from a more robust model to a weaker one results in significant performance gains. This highlights the value of a well-constructed procedural memory system in enhancing an agent's capabilities across various domains. Overall,the findings presented in this study shed light on the importance of adaptive and evolving procedural memory mechanisms for enhancing the performance of AI agents based on Large Language Models.

- Authors address limitations of Large Language Models (LLMs) in handling procedural memory
- Proposal of Memp as a solution to provide agents with learnable, updatable, and lifelong procedural memory
- Memp distills past agent trajectories into detailed step-by-step instructions and higher-level script-like abstractions
- Strategies for Build, Retrieval, and Update processes of procedural memory discussed to enhance effectiveness
- Continuous updating and correction of outdated information in sync with new experiences allows Memp to evolve over time
- Empirical evaluations on TravelPlanner and ALFWorld show higher success rates and improved efficiency with refined memory repository
- Transferring procedural memory from a robust model to a weaker one leads to significant performance gains
- Importance of adaptive and evolving procedural memory mechanisms highlighted for enhancing AI agent performance based on Large Language Models

Summary- Authors talk about problems with big talking computers in remembering how to do things. - They suggest using something called Memp to help these computers remember and learn better. - Memp takes what the computer has done before and turns it into step-by-step instructions and simpler ideas. - Ways to make sure Memp works well, like building, finding, and updating memories, are discussed. - By keeping memories up-to-date with new experiences, Memp can get better over time. Definitions- Large Language Models (LLMs): Big talking computers that can understand and generate human language. - Procedural memory: The memory of how to do things or perform tasks. - Trajectories: Paths or routes followed by something over time. - Abstractions: Simplified versions or general ideas of something complex. - Empirical evaluations: Tests or studies based on real-world observations rather than just theories.

Introduction

In recent years, Large Language Models (LLMs) have shown remarkable performance in various tasks such as language understanding, question answering, and dialogue generation. However, these models struggle with rigidly defined procedural memory that is either manually crafted or constrained by fixed parameters. This limitation hinders their ability to adapt and learn from new experiences over time. To address this challenge, a team of researchers from the University of Science and Technology of China has proposed Memp - an innovative solution that endows agents with a learnable, updatable, and lifelong procedural memory. In their paper titled "Memp: Exploring Agent Procedural Memory," authors Runnan Fang, Yuan Liang, Xiaobin Wang, Jialong Wu, Shuofei Qiao, Pengjun Xie, Fei Huang, Huajun Chen,and Ningyu Zhang delve into the details of Memp and its potential impact on improving the capabilities of AI agents based on LLMs.

The Limitations of Large Language Model-based Agents

Large Language Models (LLMs) are pre-trained neural networks that excel at various natural language processing tasks by learning patterns from large amounts of text data. These models can generate human-like text responses and perform well in open-ended conversations. However,the lack of a robust procedural memory system limits their ability to handle complex tasks that require step-by-step instructions or higher-level script-like abstractions. The current approach for incorporating procedural memory into LLM-based agents involves manually crafting rules or using fixed parameters to define procedures. This method is not only labor-intensive but also inflexible since it cannot adapt to new scenarios or update existing knowledge.

Memp: A Solution for Adaptive Procedural Memory

To overcome the limitations mentioned above,Memp proposes a novel framework for creating adaptive and evolving procedural memory systems for LLM-based agents. The key idea behind Memp is to distill past agent trajectories into detailed step-by-step instructions as well as higher-level script-like abstractions. Memp operates through three main processes: Build, Retrieval, and Update. In the Build process, Memp extracts information from the agent's past experiences and constructs a memory repository that contains both low-level instructions and high-level scripts. The Retrieval process involves searching for relevant procedures in the memory repository based on the current task at hand. Finally, in the Update process,Memp continuously updates, corrects,and removes outdated information from the repository in sync with new experiences.

Strategies for Building Procedural Memory

The authors of this paper explore different strategies for building procedural memory using Memp. One approach is to use a hierarchical structure where lower-level instructions are grouped under higher-level scripts. This allows for more efficient retrieval and utilization of knowledge since similar tasks can be performed by reusing existing scripts. Another strategy is to incorporate attention mechanisms that prioritize important steps or actions within an instruction sequence. This helps agents focus on critical details while performing a task and improves their overall performance.

Efficient Retrieval of Procedures

Retrieving relevant procedures from the memory repository plays a crucial role in enhancing an agent's performance. To achieve this,Memp uses a combination of similarity-based retrieval and reinforcement learning techniques. Similarity-based retrieval involves comparing the current task with stored procedures based on various metrics such as semantic similarity or action sequences. Reinforcement learning techniques help refine these retrieved procedures by providing feedback on their effectiveness in completing a given task.

Continuous Updating of Procedural Memory

One of the most significant advantages of Memp is its ability to continuously update procedural memory over time.Memp implements a dynamic regimen that ensures outdated or incorrect information is removed from the repository while new experiences are incorporated into it.This adaptive mechanism allows Memp to evolve and improve its performance over time.

Evaluation of Memp

To evaluate the effectiveness of Memp, the researchers conducted experiments on two different domains: TravelPlanner and ALFWorld. In both cases,Memp was compared with baseline models that did not have a procedural memory system. The results showed that refining the memory repository led to progressively higher success rates and improved efficiency in performing similar tasks. Notably, transferring procedural memory from a more robust model to a weaker one resulted in significant performance gains.This highlights the value of a well-constructed procedural memory system in enhancing an agent's capabilities across various domains.

Conclusion

In conclusion, "Memp: Exploring Agent Procedural Memory" presents an innovative solution for overcoming the limitations of Large Language Model-based agents in handling procedural memory. By distilling past agent trajectories into detailed step-by-step instructions as well as higher-level script-like abstractions,Memp provides agents with a learnable, updatable,and lifelong procedural memory system. The study also explores different strategies for building, retrieving,and updating this memory repository to enhance its effectiveness over time. The empirical evaluations conducted on TravelPlanner and ALFWorld demonstrate that incorporating Memp leads to significantly improved performance in completing complex tasks. This research sheds light on the importance of adaptive and evolving procedural memory mechanisms for enhancing the capabilities of AI agents based on Large Language Models.

Created on 15 Aug. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

78.7%

Memory Sharing for Large Language Model based Agents

cs.CL

77.3%

Mass-Editing Memory in a Transformer

cs.CL

77.1%

MemoryBank: Enhancing Large Language Models with Long-Term Memory

cs.CL

77.0%

$\text{Memory}^3$: Language Modeling with Explicit Memory

cs.CL

75.2%

Augmenting Language Models with Long-Term Memory

cs.CL

72.8%

Memory-assisted prompt editing to improve GPT-3 after deployment

cs.CL

72.5%

Large Language Model based Multi-Agents: A Survey of Progress and Challenges

cs.CL

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.