SheetAgent: Towards A Generalist Agent for Spreadsheet Reasoning and Manipulation via Large Language Models

AI-generated keywords: Spreadsheet manipulation Large Language Models SheetRM SheetAgent reasoning challenges

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Recent advancements in spreadsheet manipulation involve the integration of Large Language Models (LLMs) for automating tasks and enhancing efficiency.
  • LLMs have shown promise in simpler operations but are limited in more complex scenarios requiring intricate reasoning challenges.
  • The introduction of the $\textbf{SheetRM}$ benchmark addresses this gap by encompassing long-horizon tasks across multiple categories that demand manipulation based on reasoning-dependent factors from real-life complexities.
  • The innovative $\textbf{SheetAgent}$ is proposed as an autonomous agent comprising three collaborative modules - Planner, Informer, and Retriever - leveraging LLM capabilities for advanced reasoning and precise spreadsheet manipulation without human intervention.
  • SheetAgent showcases notable improvements ranging from 20% to 30% in pass rates across various benchmarks compared to baseline models through iterative task reasoning and reflection mechanisms.
  • This enhanced precision underscores SheetAgent's superior table reasoning abilities, contributing towards developing a generalist agent tailored for spreadsheet reasoning and manipulation using LLMs.
  • The research paper detailing these findings has been accepted by the Large Language Models and Cognition conference at ICML 2024. Interested individuals can explore further insights and visualizations of SheetAgent's capabilities at https://sheetagent.github.io.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, Jianye Hao

Paper of new version. Accepted by Large Language Models and Cognition @ ICML 2024

Abstract: Spreadsheet manipulation is widely existing in most daily works and significantly improves working efficiency. Large language model (LLM) has been recently attempted for automatic spreadsheet manipulation but has not yet been investigated in complicated and realistic tasks where reasoning challenges exist (e.g., long horizon manipulation with multi-step reasoning and ambiguous requirements). To bridge the gap with the real-world requirements, we introduce $\textbf{SheetRM}$, a benchmark featuring long-horizon and multi-category tasks with reasoning-dependent manipulation caused by real-life challenges. To mitigate the above challenges, we further propose $\textbf{SheetAgent}$, a novel autonomous agent that utilizes the power of LLMs. SheetAgent consists of three collaborative modules: $\textit{Planner}$, $\textit{Informer}$, and $\textit{Retriever}$, achieving both advanced reasoning and accurate manipulation over spreadsheets without human interaction through iterative task reasoning and reflection. Extensive experiments demonstrate that SheetAgent delivers 20-30% pass rate improvements on multiple benchmarks over baselines, achieving enhanced precision in spreadsheet manipulation and demonstrating superior table reasoning abilities. More details and visualizations are available at https://sheetagent.github.io.

Submitted to arXiv on 06 Mar. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2403.03636v2

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In the realm of spreadsheet manipulation, recent advancements have seen the integration of Large Language Models (LLMs) for automating tasks and enhancing efficiency. While LLMs have shown promise in simpler operations, their application has been limited to more complex scenarios involving intricate reasoning challenges. To address this gap and cater to real-world demands, a new benchmark known as $\textbf{SheetRM}$ has been introduced. This benchmark encompasses long-horizon tasks across multiple categories that require manipulation based on reasoning-dependent factors stemming from real-life complexities. To tackle these challenges, the innovative $\textbf{SheetAgent}$ has been proposed as an autonomous agent leveraging the capabilities of LLMs. Comprising three collaborative modules - namely the $\textit{Planner}$, $\textit{Informer}$, and $\textit{Retriever}$ - SheetAgent excels in advanced reasoning and precise spreadsheet manipulation without requiring human intervention. Through iterative task reasoning and reflection mechanisms, SheetAgent showcases its prowess by delivering notable improvements ranging from 20% to 30% in pass rates across various benchmarks when compared to baseline models. This enhanced precision in spreadsheet manipulation underscores SheetAgent's superior table reasoning abilities. The work conducted by Yibin Chen, Yifu Yuan, Zeyu Zhang, Yan Zheng, Jinyi Liu, Fei Ni, and Jianye Hao culminates in a significant contribution towards developing a generalist agent tailored for spreadsheet reasoning and manipulation using Large Language Models. The research paper detailing these findings has been accepted by the Large Language Models and Cognition conference at ICML 2024. For further insights and visualizations pertaining to SheetAgent's capabilities, interested individuals can explore additional information available at https://sheetagent.github.io.
Created on 09 Oct. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.