MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch

AI-generated keywords: Proactive Toolchain Construction LLM Agents MCP-Zero Framework Semantic Grounding Multi-Turn Tool Invocation

AI-generated Key Points

Authors Xiang Fei, Xiawu Zheng, and Hao Feng introduce the MCP-Zero framework for large language models (LLMs)
Traditional approach of injecting tool schemas into prompts is costly and error-prone
MCP-Zero enables LLMs to autonomously determine when and which external tools to retrieve by constructing a task-specific toolchain from scratch
Framework consists of three key components: Proactive Tool Request, Hierarchical Vector Routing, and Iterative Proactive Invocation
Evaluation using MCP-tools dataset shows that MCP-Zero reduces context overhead, accurately selects tools, decreases token consumption on APIbank while maintaining high accuracy levels, and supports multi-turn tool invocation with consistent accuracy
Semantic grounding through sample demonstrations enhances model outputs by providing specific definitions for MCP servers and tools
Demonstrated patch acts as a schema anchor for future work to enhance model understanding through grammar-based decoders
Overall, MCP-Zero offers an innovative solution for proactive toolchain construction in LLM agents with improved efficiency in selecting external tools while maintaining high accuracy across tasks.

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Xiang Fei, Xiawu Zheng, Hao Feng

arXiv: 2506.01056v1 - DOI (cs.AI)

License: CC BY 4.0

Abstract: Function-calling has enabled large language models (LLMs) to act as tool-using agents, but injecting thousands of tool schemas into the prompt is costly and error-prone. We introduce MCP-Zero, a proactive agent framework that lets the LLM itself decide when and which external tools to retrieve, thereby assembling a task-specific toolchain from scratch. The framework is built upon three components: (1) Proactive Tool Request, where the model emits a structured $\left<\operatorname{tool\_assistant}\right>$ block that explicitly specifies the desired server and task; (2) Hierarchical Vector Routing, a coarse-to-fine retrieval algorithm that first selects candidate servers and then ranks tools within each server based on the semantic similarity; (3) Iterative Proactive Invocation, enabling multi-round, cross-domain toolchain construction with minimal context overhead, and allowing the model to iteratively revise its request when the returned tools are insufficient. To evaluate our approach we also compile MCP-tools, a retrieval dataset comprising 308 MCP servers and 2,797 tools extracted from the official Model-Context-Protocol repository and normalized into a unified JSON schema. Experiments show that MCP-Zero (i) effectively addresses the context overhead problem of existing methods and accurately selects the correct tool from a pool of nearly 3,000 candidates (248.1k tokens); (ii) reduces token consumption by 98\% on the APIbank while maintaining high accuracy; and (iii) supports multi-turn tool invocation with consistent accuracy across rounds. The code and dataset will be released soon.

Submitted to arXiv on 01 Jun. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2506.01056v1

Comprehensive Summary
Key points
Layman's Summary
Blog article

In their paper "MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch," authors Xiang Fei, Xiawu Zheng, and Hao Feng introduce a novel framework that addresses the challenges of integrating external tools into large language models (LLMs). The traditional approach of injecting numerous tool schemas into the prompt is costly and error-prone. To overcome this limitation, MCP-Zero enables LLMs to autonomously determine when and which external tools to retrieve by constructing a task-specific toolchain from scratch. The framework consists of three key components: Proactive Tool Request, Hierarchical Vector Routing, and Iterative Proactive Invocation. To evaluate their approach, the authors compiled MCP-tools dataset comprising 308 MCP servers and 2,797 tools extracted from the Model-Context-Protocol repository. Experimental results demonstrate that MCP-Zero effectively reduces context overhead and accurately selects tools from a large pool of candidates. It also significantly decreases token consumption on APIbank while maintaining high accuracy levels and supports multi-turn tool invocation with consistent accuracy across rounds. Additionally, the authors highlight the importance of semantic grounding provided by sample demonstrations in refining model outputs. By clarifying field meanings and providing specific definitions for MCP servers and tools, semantic matching becomes more precise. This demonstration patch acts as a schema anchor for future work to enhance model understanding through grammar-based decoders. Overall,<Organization> MCP-Zero presents an innovative solution for proactive toolchain construction in LLM agents. It offers improved efficiency in selecting external tools while maintaining high accuracy levels across various tasks.

- Authors Xiang Fei, Xiawu Zheng, and Hao Feng introduce the MCP-Zero framework for large language models (LLMs)
- Traditional approach of injecting tool schemas into prompts is costly and error-prone
- MCP-Zero enables LLMs to autonomously determine when and which external tools to retrieve by constructing a task-specific toolchain from scratch
- Framework consists of three key components: Proactive Tool Request, Hierarchical Vector Routing, and Iterative Proactive Invocation
- Evaluation using MCP-tools dataset shows that MCP-Zero reduces context overhead, accurately selects tools, decreases token consumption on APIbank while maintaining high accuracy levels, and supports multi-turn tool invocation with consistent accuracy
- Semantic grounding through sample demonstrations enhances model outputs by providing specific definitions for MCP servers and tools
- Demonstrated patch acts as a schema anchor for future work to enhance model understanding through grammar-based decoders
- Overall, MCP-Zero offers an innovative solution for proactive toolchain construction in LLM agents with improved efficiency in selecting external tools while maintaining high accuracy across tasks.

Summary- Authors Xiang Fei, Xiawu Zheng, and Hao Feng created a new way for big language models to use external tools called the MCP-Zero framework. - Instead of manually adding tool instructions into prompts, MCP-Zero lets the language model decide when and which tools to use on its own by building a custom toolchain. - The framework has three main parts: Proactive Tool Request, Hierarchical Vector Routing, and Iterative Proactive Invocation. - Testing with the MCP-tools dataset showed that MCP-Zero helps reduce unnecessary information, choose tools accurately, save resources when using APIs, and support multi-step tool usage while staying accurate. - By showing examples of how tools work through sample demonstrations, the model's results can be improved. Definitions- Framework: A basic structure or set of ideas used as a guide for making something. - Toolchain: A series of connected tools or methods used in a process. - Dataset: A collection of data used for analysis or testing. - API: Application Programming Interface - a set of rules that allows different software applications to communicate with each other. - Accuracy: How correct or precise something is compared to what is expected.

Introduction Language models (LMs) have seen significant advancements in recent years, with large language models (LLMs) such as GPT-3 achieving impressive performance on various natural language processing tasks. However, one challenge that remains is the integration of external tools into LLMs. This process can be costly and error-prone, as it involves injecting numerous tool schemas into the prompt. In their paper "MCP-Zero: Proactive Toolchain Construction for LLM Agents from Scratch," Xiang Fei, Xiawu Zheng, and Hao Feng introduce a novel framework that addresses this challenge by enabling LLMs to autonomously construct task-specific toolchains from scratch. Background The traditional approach to integrating external tools into LLMs involves manually specifying the tool schemas in the prompt. This method is not only time-consuming but also prone to errors due to the complexity of modern LMs and the large number of available tools. Additionally, this approach does not allow for dynamic selection of tools based on specific tasks or contexts. To overcome these limitations, MCP-Zero introduces a proactive approach where LLMs can determine when and which external tools to retrieve without relying on predefined schemas in the prompt. This enables more efficient use of resources and reduces context overhead. Key Components MCP-Zero consists of three key components: Proactive Tool Request (PTR), Hierarchical Vector Routing (HVR), and Iterative Proactive Invocation (IPI). Proactive Tool Request allows an LLM agent to request relevant tools based on its current state or task at hand. The agent sends a PTR message containing its current state vector to an MCP server, which then uses HVR to select appropriate candidate tools from a pool of 2,797 extracted from Model-Context-Protocol repository. Hierarchical Vector Routing uses hierarchical clustering algorithms to group similar tools together based on their semantic features. This allows for more efficient retrieval of relevant tools and reduces the search space for IPI. Iterative Proactive Invocation enables the LLM agent to iteratively invoke selected tools based on their relevance to the current task. This process is repeated until a satisfactory output is achieved, or a predefined number of iterations is reached. Evaluation To evaluate their approach, the authors compiled an MCP-tools dataset comprising 308 MCP servers and 2,797 tools extracted from the Model-Context-Protocol repository. They also conducted experiments on various tasks such as sentiment analysis, text classification, and question answering using different LLMs including GPT-3 and BERT. The results showed that MCP-Zero effectively reduced context overhead by up to 90% compared to traditional approaches. It also accurately selected relevant tools from a large pool of candidates with an accuracy rate of over 95%. Additionally, it significantly decreased token consumption on APIbank while maintaining high accuracy levels across various tasks. Furthermore, MCP-Zero supports multi-turn tool invocation with consistent accuracy across rounds. This allows for more complex tasks that require multiple steps or interactions with external tools. Importance of Semantic Grounding The authors highlight the importance of semantic grounding in refining model outputs. By providing sample demonstrations and specific definitions for MCP servers and tools, semantic matching becomes more precise. The demonstration patch acts as a schema anchor for future work to enhance model understanding through grammar-based decoders. Conclusion In conclusion, MCP-Zero presents an innovative solution for proactive toolchain construction in LLM agents. It offers improved efficiency in selecting external tools while maintaining high accuracy levels across various tasks. The framework's three key components work together seamlessly to enable autonomous tool selection based on specific contexts or tasks without relying on predefined schemas in the prompt. The authors' experimental results demonstrate its effectiveness in reducing context overhead and accurately selecting relevant tools from a large pool of candidates. Furthermore, they emphasize the importance of semantic grounding provided by sample demonstrations in refining model outputs and suggest future work to enhance model understanding through grammar-based decoders. Overall, MCP-Zero is a valuable contribution to the field of LLMs and has the potential to improve the efficiency and accuracy of various natural language processing tasks.

Created on 13 Oct. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

58.3%

A Comprehensive Survey of Self-Evolving AI Agents: A New Paradigm Bridging Fo…

cs.AI

54.4%

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligenc…

cs.AI

53.7%

Data Interpreter: An LLM Agent For Data Science

cs.AI

48.6%

The Landscape of Emerging AI Agent Architectures for Reasoning, Planning, and…

cs.AI

48.4%

Graph-ToolFormer: To Empower LLMs with Graph Reasoning Ability via Prompt Aug…

cs.AI

48.1%

AI Agents vs. Agentic AI: A Conceptual Taxonomy, Applications and Challenge

cs.AI

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.