MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch

AI-generated keywords: Proactive Toolchain Construction LLM Agents MCP-Zero Framework Semantic Grounding Multi-Turn Tool Invocation

AI-generated Key Points

  • Authors Xiang Fei, Xiawu Zheng, and Hao Feng introduce the MCP-Zero framework for large language models (LLMs)
  • Traditional approach of injecting tool schemas into prompts is costly and error-prone
  • MCP-Zero enables LLMs to autonomously determine when and which external tools to retrieve by constructing a task-specific toolchain from scratch
  • Framework consists of three key components: Proactive Tool Request, Hierarchical Vector Routing, and Iterative Proactive Invocation
  • Evaluation using MCP-tools dataset shows that MCP-Zero reduces context overhead, accurately selects tools, decreases token consumption on APIbank while maintaining high accuracy levels, and supports multi-turn tool invocation with consistent accuracy
  • Semantic grounding through sample demonstrations enhances model outputs by providing specific definitions for MCP servers and tools
  • Demonstrated patch acts as a schema anchor for future work to enhance model understanding through grammar-based decoders
  • Overall, MCP-Zero offers an innovative solution for proactive toolchain construction in LLM agents with improved efficiency in selecting external tools while maintaining high accuracy across tasks.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Xiang Fei, Xiawu Zheng, Hao Feng

License: CC BY 4.0

Abstract: Function-calling has enabled large language models (LLMs) to act as tool-using agents, but injecting thousands of tool schemas into the prompt is costly and error-prone. We introduce MCP-Zero, a proactive agent framework that lets the LLM itself decide when and which external tools to retrieve, thereby assembling a task-specific toolchain from scratch. The framework is built upon three components: (1) Proactive Tool Request, where the model emits a structured $\left<\operatorname{tool\_assistant}\right>$ block that explicitly specifies the desired server and task; (2) Hierarchical Vector Routing, a coarse-to-fine retrieval algorithm that first selects candidate servers and then ranks tools within each server based on the semantic similarity; (3) Iterative Proactive Invocation, enabling multi-round, cross-domain toolchain construction with minimal context overhead, and allowing the model to iteratively revise its request when the returned tools are insufficient. To evaluate our approach we also compile MCP-tools, a retrieval dataset comprising 308 MCP servers and 2,797 tools extracted from the official Model-Context-Protocol repository and normalized into a unified JSON schema. Experiments show that MCP-Zero (i) effectively addresses the context overhead problem of existing methods and accurately selects the correct tool from a pool of nearly 3,000 candidates (248.1k tokens); (ii) reduces token consumption by 98\% on the APIbank while maintaining high accuracy; and (iii) supports multi-turn tool invocation with consistent accuracy across rounds. The code and dataset will be released soon.

Submitted to arXiv on 01 Jun. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2506.01056v1

In their paper "MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch," authors Xiang Fei, Xiawu Zheng, and Hao Feng introduce a novel framework that addresses the challenges of integrating external tools into large language models (LLMs). The traditional approach of injecting numerous tool schemas into the prompt is costly and error-prone. To overcome this limitation, MCP-Zero enables LLMs to autonomously determine when and which external tools to retrieve by constructing a task-specific toolchain from scratch. The framework consists of three key components: Proactive Tool Request, Hierarchical Vector Routing, and Iterative Proactive Invocation. To evaluate their approach, the authors compiled MCP-tools dataset comprising 308 MCP servers and 2,797 tools extracted from the Model-Context-Protocol repository. Experimental results demonstrate that MCP-Zero effectively reduces context overhead and accurately selects tools from a large pool of candidates. It also significantly decreases token consumption on APIbank while maintaining high accuracy levels and supports multi-turn tool invocation with consistent accuracy across rounds. Additionally, the authors highlight the importance of semantic grounding provided by sample demonstrations in refining model outputs. By clarifying field meanings and providing specific definitions for MCP servers and tools, semantic matching becomes more precise. This demonstration patch acts as a schema anchor for future work to enhance model understanding through grammar-based decoders. Overall,<Organization> MCP-Zero presents an innovative solution for proactive toolchain construction in LLM agents. It offers improved efficiency in selecting external tools while maintaining high accuracy levels across various tasks.
Created on 13 Oct. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.