FLAP: Flow Adhering Planning with Constrained Decoding in LLMs

AI-generated keywords: FLAP

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors address planning for agents in task-oriented dialogs (TODs)
  • Challenge of ensuring faithful plans with Large Language Models (LLMs) due to bias towards pretraining data
  • Introduction of constrained decoding algorithm based on lookahead heuristic for faithful planning in TODs
  • Algorithm outperforms other baselines in performance metrics, eliminates need for fine-tuning LLMs using domain-specific data
  • Comparable results achieved with smaller LLMs (7B) compared to larger models (30B-40B parameters)
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shamik Roy, Sailik Sengupta, Daniele Bonadiman, Saab Mansour, Arshit Gupta

Under submission
License: CC BY-NC-ND 4.0

Abstract: Planning is a crucial task for agents in task oriented dialogs (TODs). Human agents typically resolve user issues by following predefined workflows, decomposing workflow steps into actionable items, and performing actions by executing APIs in order; all of which require reasoning and planning. With the recent advances in LLMs, there have been increasing attempts to use LLMs for task planning and API usage. However, the faithfulness of the plans to predefined workflows and API dependencies, is not guaranteed with LLMs because of their bias towards pretraining data. Moreover, in real life, workflows are custom-defined and prone to change, hence, quickly adapting agents to the changes is desirable. In this paper, we study faithful planning in TODs to resolve user intents by following predefined flows and preserving API dependencies. We propose a constrained decoding algorithm based on lookahead heuristic for faithful planning. Our algorithm alleviates the need for finetuning LLMs using domain specific data, outperforms other decoding and prompting-based baselines, and applying our algorithm on smaller LLMs (7B) we achieve comparable performance to larger LLMs (30B-40B).

Submitted to arXiv on 09 Mar. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2403.05766v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , In the paper titled "FLAP: Flow Adhering Planning with Constrained Decoding in LLMs," authored by Shamik Roy, Sailik Sengupta, Daniele Bonadiman, Saab Mansour, and Arshit Gupta, the authors address the critical task of planning for agents in task-oriented dialogs (TODs). Human agents typically adhere to predefined workflows and execute actions through APIs in a sequential manner to address user issues. However, this requires reasoning and planning skills. With the advancements in Large Language Models (LLMs), there has been a growing interest in utilizing them for task planning and API execution. One challenge faced with LLMs is ensuring faithful plans that adhere to predefined workflows and API dependencies due to their inherent bias towards pretraining data. Additionally, real-life workflows are often customized and subject to changes, highlighting the importance of quickly adapting agents to these modifications. To effectively address these challenges, the authors introduce a constrained decoding algorithm based on lookahead heuristic for faithful planning in TODs. This algorithm eliminates the need for fine-tuning LLMs using domain-specific data and outperforms other decoding and prompting-based baselines in performance metrics. Moreover, by applying their proposed algorithm on smaller LLMs (7B), comparable results are achieved compared to larger LLM models ranging from 30B-40B parameters. This research contributes valuable insights into enhancing planning accuracy in TOD scenarios while leveraging the capabilities of LLMs effectively.
Created on 17 Aug. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.