AdaCoder: An Adaptive Planning and Multi-Agent Framework for Function-Level Code Generation

AI-generated keywords: Multi-agent frameworks Function-level code generation Large Language Models (LLMs) Adaptability AdaCoder

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors Yueheng Zhu, Chao Liu, Xuan He, Xiaoxue Ren, Zhongxin Liu, Ruwei Pan, and Hongyu Zhang focus on multi-agent frameworks for function-level code generation
  • Frameworks aim to enhance software development productivity by automatically generating source code based on task descriptions
  • Agents powered by Large Language Models (LLMs) handle planning, code generation, testing, and debugging tasks
  • Study evaluates generalizability of existing frameworks across different foundation LLMs
  • Introduction of AdaCoder as an adaptive planning and multi-agent framework for function-level code generation by <Organization>
  • AdaCoder operates in two phases: initial code generation without planning in Phase-1 and iterative code generation with strategic planning in Phase-2
  • Evaluation shows AdaCoder's superior generalizability across diverse LLMs compared to existing frameworks
  • AdaCoder achieves a 27.69% higher Pass@1 rate than the best baseline MapCoder, is 16 times faster in inference speed, and consumes 12 times fewer tokens during operation
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yueheng Zhu, Chao Liu, Xuan He, Xiaoxue Ren, Zhongxin Liu, Ruwei Pan, Hongyu Zhang

Abstract: Recently, researchers have proposed many multi-agent frameworks for function-level code generation, which aim to improve software development productivity by automatically generating function-level source code based on task descriptions. A typical multi-agent framework consists of Large Language Model (LLM)-based agents that are responsible for task planning, code generation, testing, debugging, etc. Studies have shown that existing multi-agent code generation frameworks perform well on ChatGPT. However, their generalizability across other foundation LLMs remains unexplored systematically. In this paper, we report an empirical study on the generalizability of four state-of-the-art multi-agent code generation frameworks across six open-source LLMs with varying parameter sizes, architectures, and performance levels. Our study reveals the unstable generalizability of existing frameworks on diverse foundation LLMs. Based on the findings obtained from the empirical study, we propose AdaCoder, a novel adaptive planning, multi-agent framework for function-level code generation. AdaCoder has two phases. Phase-1 is an initial code generation step without planning, which uses an LLM-based coding agent and a script-based testing agent to unleash LLM's native power, identify cases beyond LLM's power, and determine the errors hindering execution. Phase-2 adds a rule-based debugging agent and an LLM-based planning agent for iterative code generation with planning. Our evaluation shows that AdaCoder achieves higher generalizability on diverse LLMs. Compared to the best baseline MapCoder, AdaCoder is on average 27.69% higher in Pass@1, 16 times faster in inference, and 12 times lower in token consumption.

Submitted to arXiv on 05 Apr. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2504.04220v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "AdaCoder: An Adaptive Planning and Multi-Agent Framework for Function-Level Code Generation," authors Yueheng Zhu, Chao Liu, Xuan He, Xiaoxue Ren, Zhongxin Liu, Ruwei Pan, and Hongyu Zhang delve into the realm of multi-agent frameworks designed for function-level code generation. The primary goal of these frameworks is to enhance software development productivity by automatically generating source code at the function level based on task descriptions. Typically, these frameworks consist of agents powered by Large Language Models (LLMs) that handle various tasks such as planning, code generation, testing, and debugging. Previous studies have demonstrated the effectiveness of existing multi-agent code generation frameworks on platforms like ChatGPT. However, their adaptability across different foundation LLMs has not been extensively explored. To address this gap in knowledge, the authors conducted an empirical study to assess the generalizability of four cutting-edge multi-agent code generation frameworks across six distinct open-source LLMs with varying parameter sizes, architectures, and performance levels. The results of their study unveiled the inconsistent generalizability of existing frameworks when applied to diverse foundation LLMs. Building upon the insights gained from their empirical investigation,<Organization> introduce AdaCoder as a novel adaptive planning and multi-agent framework for function-level code generation. <Organization> operates in two phases: Phase-1 involves initial code generation without planning using an LLM-based coding agent and a script-based testing agent to leverage the native capabilities of LLMs while identifying cases beyond their scope and pinpointing execution hindrances. In Phase-2,<Organization> incorporates a rule-based debugging agent and an LLM-based planning agent for iterative code generation with strategic planning. The evaluation of <Organization> demonstrates its superior generalizability across diverse LLMs compared to existing frameworks. On average, <Organization> achieves a 27.69% higher Pass@1 rate than the best baseline MapCoder while being 16 times faster in inference speed and consuming 12 times fewer tokens during operation. This showcases <Organization>'s efficacy in addressing challenges related to function-level code generation on varied foundation LLM platforms.
Created on 12 Jul. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.