Agent-S: LLM Agentic workflow to automate Standard Operating Procedures

AI-generated keywords: Automating Standard Operating Procedures Large Language Models AI agents SOP workflow Fault-tolerance mechanism

AI-generated Key Points

  • Novel approach to automating Standard Operating Procedures (SOP) using Large Language Models (LLMs)
  • Proposed agentic workflow utilizes LLMs augmented with memory and various environments
  • Architecture includes task-specific LLMs, Global Action Repository (GAR), execution memory, and multiple environments
  • SOP workflow represented as a logical block of text where agent uses current execution memory for next actions
  • Fault-tolerance mechanism allows dynamic action repetition or seeking input from external knowledge sources
  • Experiments conducted on e-commerce seller domain SOPs to validate agent's performance under complex scenarios
  • Successful applications of LLM-based agents in web navigation, travel planning, video understanding, biomedical discovery, code documentation, etc.
  • Importance of memory in functionality of LLM-based agents with different methods explored in literature
  • Approach offers flexibility in handling SOPs and error management through single-state LLM prompt for predicting actions across all SOPs
  • Effectiveness of proposed agentic workflow demonstrated in experimental results for automating SOPs
  • Framework's simplicity and adaptability suitable for various applications beyond customer care operations
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Mandar Kulkarni

License: CC BY 4.0

Abstract: AI agents using Large Language Models (LLMs) as foundations have shown promise in solving complex real-world tasks. In this paper, we propose an LLM-based agentic workflow for automating Standard Operating Procedures (SOP). For customer care operations, an SOP defines a logical step-by-step process for human agents to resolve customer issues. We observe that any step in the SOP can be categorized as user interaction or API call, while the logical flow in the SOP defines the navigation. We use LLMs augmented with memory and environments (API tools, user interface, external knowledge source) for SOP automation. Our agentic architecture consists of three task-specific LLMs, a Global Action Repository (GAR), execution memory, and multiple environments. SOP workflow is written as a simple logical block of text. Based on the current execution memory and the SOP, the agent chooses the action to execute; it interacts with an appropriate environment (user/API) to collect observations and feedback, which are, in turn, inputted to memory to decide the next action. The agent is designed to be fault-tolerant, where it dynamically decides to repeat an action or seek input from an external knowledge source. We demonstrate the efficacy of the proposed agent on the three SOPs from the e-commerce seller domain. The experimental results validate the agent's performance under complex real-world scenarios.

Submitted to arXiv on 03 Feb. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2503.15520v1

In this paper, we present a novel approach to automating Standard Operating Procedures (SOP) using Large Language Models (LLMs) as the foundation for AI agents. SOPs are crucial in customer care operations as they provide a structured process for human agents to resolve customer issues. Our proposed agentic workflow utilizes LLMs augmented with memory and various environments such as API tools, user interfaces, and external knowledge sources. The architecture of our AI agent includes three task-specific LLMs, a Global Action Repository (GAR), execution memory, and multiple environments. The SOP workflow is represented as a simple logical block of text where the agent uses its current execution memory to determine the next action based on the SOP instructions. It interacts with the appropriate environment (user or API) to gather feedback and observations which are then inputted back into memory to determine subsequent actions. One key feature of our agent is its fault-tolerance mechanism that allows it to dynamically repeat actions or seek input from external knowledge sources when needed. We conducted experiments using three SOPs from the e-commerce seller domain to validate our agent's performance under complex real-world scenarios. Additionally, we discuss how LLM-based agents have been successfully applied in various domains such as web navigation, travel planning, video understanding, biomedical discovery, code documentation, knowledge graph reasoning, and knowledge base question answering. Memory plays a crucial role in these agents' functionality with different methods explored in existing literature. Our approach differs from previous work by offering more flexibility in handling SOPs and error management through a single-state LLM prompt that can predict actions across all SOPs. The experimental results demonstrate the effectiveness of our proposed agentic workflow for automating SOPs and highlight the importance of having an LLM with strong reasoning capabilities. In conclusion, our research presents a novel framework for automating SOP workflows using LLM-based AI agents. The architecture's simplicity and adaptability make it suitable for various applications beyond customer care operations. Further exploration could extend this framework to efficiently automate other types of workflows.
Created on 14 Nov. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.