In this paper, we present a novel approach to automating Standard Operating Procedures (SOP) using Large Language Models (LLMs) as the foundation for AI agents. SOPs are crucial in customer care operations as they provide a structured process for human agents to resolve customer issues. Our proposed agentic workflow utilizes LLMs augmented with memory and various environments such as API tools, user interfaces, and external knowledge sources. The architecture of our AI agent includes three task-specific LLMs, a Global Action Repository (GAR), execution memory, and multiple environments. The SOP workflow is represented as a simple logical block of text where the agent uses its current execution memory to determine the next action based on the SOP instructions. It interacts with the appropriate environment (user or API) to gather feedback and observations which are then inputted back into memory to determine subsequent actions. One key feature of our agent is its fault-tolerance mechanism that allows it to dynamically repeat actions or seek input from external knowledge sources when needed. We conducted experiments using three SOPs from the e-commerce seller domain to validate our agent's performance under complex real-world scenarios. Additionally, we discuss how LLM-based agents have been successfully applied in various domains such as web navigation, travel planning, video understanding, biomedical discovery, code documentation, knowledge graph reasoning, and knowledge base question answering. Memory plays a crucial role in these agents' functionality with different methods explored in existing literature. Our approach differs from previous work by offering more flexibility in handling SOPs and error management through a single-state LLM prompt that can predict actions across all SOPs. The experimental results demonstrate the effectiveness of our proposed agentic workflow for automating SOPs and highlight the importance of having an LLM with strong reasoning capabilities. In conclusion, our research presents a novel framework for automating SOP workflows using LLM-based AI agents. The architecture's simplicity and adaptability make it suitable for various applications beyond customer care operations. Further exploration could extend this framework to efficiently automate other types of workflows.
- - Novel approach to automating Standard Operating Procedures (SOP) using Large Language Models (LLMs)
- - Proposed agentic workflow utilizes LLMs augmented with memory and various environments
- - Architecture includes task-specific LLMs, Global Action Repository (GAR), execution memory, and multiple environments
- - SOP workflow represented as a logical block of text where agent uses current execution memory for next actions
- - Fault-tolerance mechanism allows dynamic action repetition or seeking input from external knowledge sources
- - Experiments conducted on e-commerce seller domain SOPs to validate agent's performance under complex scenarios
- - Successful applications of LLM-based agents in web navigation, travel planning, video understanding, biomedical discovery, code documentation, etc.
- - Importance of memory in functionality of LLM-based agents with different methods explored in literature
- - Approach offers flexibility in handling SOPs and error management through single-state LLM prompt for predicting actions across all SOPs
- - Effectiveness of proposed agentic workflow demonstrated in experimental results for automating SOPs
- - Framework's simplicity and adaptability suitable for various applications beyond customer care operations
Summary- A new way to make tasks easier using smart computer programs called Large Language Models (LLMs.
- These programs are designed to remember things and work in different situations.
- The setup includes specific programs for different tasks, a memory system, and different environments to work in.
- Tasks are written as instructions that the program follows using its memory.
- If something goes wrong, the program can try again or ask for help from other sources.
Definitions- Novel: New and different
- Automating: Making something happen automatically without human intervention
- Standard Operating Procedures (SOP): A set of step-by-step instructions for completing a task
- Large Language Models (LLMs): Smart computer programs that understand and generate human language
- Agentic: Acting like an agent or helper
- Workflow: The sequence of steps involved in completing a task
- Architecture: The structure or design of a system
- Fault-tolerance: Ability to continue working even if there are mistakes or problems
- Memory: The ability to store and recall information
Introduction:
Standard Operating Procedures (SOPs) are essential in customer care operations as they provide a structured process for human agents to resolve customer issues. However, these procedures can be time-consuming and tedious for human agents to follow, leading to delays and errors in the resolution process. To address this issue, researchers have proposed using Artificial Intelligence (AI) agents to automate SOPs. In this paper, we present a novel approach that utilizes Large Language Models (LLMs) as the foundation for AI agents to automate SOP workflows.
Background:
In recent years, LLMs have shown great potential in various natural language processing tasks due to their ability to generate coherent text and understand complex language structures. This has led researchers to explore their application in other domains beyond natural language processing.
Our Approach:
Our proposed agentic workflow uses LLMs augmented with memory and various environments such as API tools, user interfaces, and external knowledge sources. The architecture of our AI agent includes three task-specific LLMs, a Global Action Repository (GAR), execution memory, and multiple environments.
How it Works:
The SOP workflow is represented as a simple logical block of text where the agent uses its current execution memory to determine the next action based on the SOP instructions. It interacts with the appropriate environment (user or API) to gather feedback and observations which are then inputted back into memory to determine subsequent actions.
Fault-Tolerance Mechanism:
One key feature of our agent is its fault-tolerance mechanism that allows it to dynamically repeat actions or seek input from external knowledge sources when needed. This ensures that even if there are errors or unexpected situations during the execution of an SOP, the agent can still continue with minimal disruption.
Experimental Results:
To validate our agent's performance under complex real-world scenarios, we conducted experiments using three SOPs from the e-commerce seller domain. The results showed that our agentic workflow was able to successfully automate all three SOPs with high accuracy and efficiency.
Applications in Other Domains:
LLM-based agents have been successfully applied in various domains such as web navigation, travel planning, video understanding, biomedical discovery, code documentation, knowledge graph reasoning, and knowledge base question answering. This highlights the versatility of LLMs and their potential for use in different applications.
Importance of Memory:
Memory plays a crucial role in the functionality of LLM-based agents. Different methods have been explored in existing literature to incorporate memory into these agents. Our approach differs from previous work by offering more flexibility in handling SOPs and error management through a single-state LLM prompt that can predict actions across all SOPs.
Conclusion:
In conclusion, our research presents a novel framework for automating SOP workflows using LLM-based AI agents. The architecture's simplicity and adaptability make it suitable for various applications beyond customer care operations. Further exploration could extend this framework to efficiently automate other types of workflows. With the increasing capabilities of LLMs and their potential for use in various domains, we believe that our approach has great potential for future developments in automated processes.