AutoTAMP: Autoregressive Task and Motion Planning with LLMs as Translators and Checkers
AI-generated Key Points
- Robots' ability to comprehend, strategize, and carry out intricate, long-term tasks articulated in natural language is crucial for effective human-robot interaction.
- Recent advancements in large language models (LLMs) show promise in translating natural language into sequences of actions for robots to execute complex tasks.
- A new approach involves few-shot translation from natural language task descriptions to an intermediary task representation, which can be utilized by a traditional task-and-motion planning (TAMP) algorithm to collaboratively solve both the task and motion plan.
- Automatic detection and correction of syntactic and semantic errors through autoregressive re-prompting enhance the translation process and result in notable enhancements in task completion rates.
- The newly proposed method showcased significant superiority over existing methods that employ LLMs as planners in navigating complex task domains.
- Challenges persist when dealing with complexities such as temporally-dependent multi-step actions, action sequence optimization, and task constraints despite efforts made towards enhancing executability through feedback mechanisms and verifying sub-task sequences' executability within the framework.
Authors: Yongchao Chen, Jacob Arkin, Yang Zhang, Nicholas Roy, Chuchu Fan
Abstract: For effective human-robot interaction, robots need to understand, plan, and execute complex, long-horizon tasks described by natural language. The recent and remarkable advances in large language models (LLMs) have shown promise for translating natural language into robot action sequences for complex tasks. However, many existing approaches either translate the natural language directly into robot trajectories, or factor the inference process by decomposing language into task sub-goals, then relying on a motion planner to execute each sub-goal. When complex environmental and temporal constraints are involved, inference over planning tasks must be performed jointly with motion plans using traditional task-and-motion planning (TAMP) algorithms, making such factorization untenable. Rather than using LLMs to directly plan task sub-goals, we instead perform few-shot translation from natural language task descriptions to an intermediary task representation that can then be consumed by a TAMP algorithm to jointly solve the task and motion plan. To improve translation, we automatically detect and correct both syntactic and semantic errors via autoregressive re-prompting, resulting in significant improvements in task completion. We show that our approach outperforms several methods using LLMs as planners in complex task domains.
Ask questions about this paper to our AI assistant
You can also chat with multiple papers at once here.
Assess the quality of the AI-generated content by voting
Score: 0
Why do we need votes?
Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.
The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.
Similar papers summarized with our AI tools
Navigate through even more similar papers through a
tree representationLook for similar papers (in beta version)
By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.
Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.