SheetAgent: Towards A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models
AI-generated Key Points
⚠The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.
- Recent advancements in spreadsheet manipulation involve the integration of Large Language Models (LLMs) for automating tasks and enhancing efficiency.
- LLMs have shown promise in simpler operations but are limited in more complex scenarios requiring intricate reasoning challenges.
- The introduction of the $\textbf{SheetRM}$ benchmark addresses this gap by encompassing long-horizon tasks across multiple categories that demand manipulation based on reasoning-dependent factors from real-life complexities.
- The innovative $\textbf{SheetAgent}$ is proposed as an autonomous agent comprising three collaborative modules - Planner, Informer, and Retriever - leveraging LLM capabilities for advanced reasoning and precise spreadsheet manipulation without human intervention.
- SheetAgent showcases notable improvements ranging from 20% to 30% in pass rates across various benchmarks compared to baseline models through iterative task reasoning and reflection mechanisms.
- This enhanced precision underscores SheetAgent's superior table reasoning abilities, contributing towards developing a generalist agent tailored for spreadsheet reasoning and manipulation using LLMs.
- The research paper detailing these findings has been accepted by the Large Language Models and Cognition conference at ICML 2024. Interested individuals can explore further insights and visualizations of SheetAgent's capabilities at https://sheetagent.github.io.
Authors: Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao
Abstract: Spreadsheet manipulation is widely existing in most daily works and significantly improves working efficiency. Large language model (LLM) has been recently attempted for automatic spreadsheet manipulation but has not yet been investigated in complicated and realistic tasks where reasoning challenges exist (e.g., long horizon manipulation with multi-step reasoning and ambiguous requirements). To bridge the gap with the real-world requirements, we introduce $\textbf{SheetRM}$, a benchmark featuring long-horizon and multi-category tasks with reasoning-dependent manipulation caused by real-life challenges. To mitigate the above challenges, we further propose $\textbf{SheetAgent}$, a novel autonomous agent that utilizes the power of LLMs. SheetAgent consists of three collaborative modules: $\textit{Planner}$, $\textit{Informer}$, and $\textit{Retriever}$, achieving both advanced reasoning and accurate manipulation over spreadsheets without human interaction through iterative task reasoning and reflection. Extensive experiments demonstrate that SheetAgent delivers 20-30% pass rate improvements on multiple benchmarks over baselines, achieving enhanced precision in spreadsheet manipulation and demonstrating superior table reasoning abilities. More details and visualizations are available at https://sheetagent.github.io.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
⚠The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
⚠The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.