Towards Agentic RAG with Deep Reasoning: A Survey of RAG-Reasoning Systems in LLMs

AI-generated keywords: Retrieval-Augmented Generation Reasoning Large Language Models Multimodality Trustworthiness

AI-generated Key Points

  • Comprehensive survey on Retrieval-Augmented Generation (RAG) and reasoning with Large Language Models (LLMs)
  • Synthesis of over 200 research papers to create a unified taxonomy for advanced reasoning techniques in RAG
  • Categorization into three main frameworks: Reasoning-Enhanced RAG, RAG-Enhanced Reasoning, and Synergized RAG-Reasoning systems
  • Emphasis on optimizing each stage of RAG through multi-step reasoning and leveraging retrieved knowledge for complex inference tasks
  • Future research focus on genuine multimodality with Multi-modal Large Language Models (MLLMs) and agentic capabilities through hybrid-modal chain-of-thought reasoning
  • Importance of retrieval trustworthiness for reliable downstream reasoning in Synergized RAG-Reasoning systems
  • Suggestions for enhancing system traceability with techniques like watermarking and digital fingerprinting
  • Need for dynamic and adaptive methods to combat adversarial attacks and ensure system robustness
  • Tight coupling between retrieval and reasoning improves factual grounding, logical coherence, and adaptability in LLMs
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yangning Li, Weizhi Zhang, Yuyao Yang, Wei-Chieh Huang, Yaozu Wu, Junyu Luo, Yuanchen Bei, Henry Peng Zou, Xiao Luo, Yusheng Zhao, Chunkit Chan, Yankai Chen, Zhongfen Deng, Yinghui Li, Hai-Tao Zheng, Dongyuan Li, Renhe Jiang, Ming Zhang, Yangqiu Song, Philip S. Yu

submitted to ARR May
License: CC BY 4.0

Abstract: Retrieval-Augmented Generation (RAG) lifts the factuality of Large Language Models (LLMs) by injecting external knowledge, yet it falls short on problems that demand multi-step inference; conversely, purely reasoning-oriented approaches often hallucinate or mis-ground facts. This survey synthesizes both strands under a unified reasoning-retrieval perspective. We first map how advanced reasoning optimizes each stage of RAG (Reasoning-Enhanced RAG). Then, we show how retrieved knowledge of different type supply missing premises and expand context for complex inference (RAG-Enhanced Reasoning). Finally, we spotlight emerging Synergized RAG-Reasoning frameworks, where (agentic) LLMs iteratively interleave search and reasoning to achieve state-of-the-art performance across knowledge-intensive benchmarks. We categorize methods, datasets, and open challenges, and outline research avenues toward deeper RAG-Reasoning systems that are more effective, multimodally-adaptive, trustworthy, and human-centric. The collection is available at https://github.com/DavidZWZ/Awesome-RAG-Reasoning.

Submitted to arXiv on 13 Jul. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2507.09477v1

This comprehensive survey explores the intersection of Retrieval-Augmented Generation (RAG) and reasoning with Large Language Models (LLMs). The authors synthesize over 200 research papers to provide a unified taxonomy that encompasses advanced reasoning techniques in RAG as well as the integration of retrieved knowledge for complex inference tasks. The scope of the survey prioritizes breadth over depth and categorizes methods into three main frameworks: Reasoning-Enhanced RAG, RAG-Enhanced Reasoning, and Synergized RAG-Reasoning systems. These frameworks focus on optimizing each stage of RAG through multi-step reasoning, leveraging retrieved knowledge for complex inference tasks, and combining search and reasoning iteratively to achieve state-of-the-art performance across knowledge-intensive benchmarks. The authors also highlight the need for future research to move beyond traditional vision-text paradigms towards genuine multimodality by strengthening foundational abilities of Multi-modal Large Language Models (MLLMs) such as grounding and cross-modal reasoning. They also emphasize enhancing agentic capabilities through hybrid-modal chain-of-thought reasoning for real-world interaction via multimodal search tools. Furthermore, retrieval trustworthiness is crucial in maintaining reliable downstream reasoning in Synergized RAG-Reasoning systems. Techniques like watermarking and digital fingerprinting are suggested to enhance system traceability. Future research should focus on developing dynamic and adaptive methods to combat adversarial attacks and ensure system robustness. In conclusion, this survey charts the rapid convergence of retrieval and reasoning in LLMs, showcasing how tight coupling between retrieval and reasoning improves factual grounding, logical coherence, and adaptability. The authors identify research avenues towards deeper RAG-Reasoning systems that are more effective, multimodally-adaptive, trustworthy, and human-centric. The collection of resources related to this survey can be found at https://github.com/DavidZWZ/Awesome-RAG-Reasoning.
Created on 31 Jul. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.