From Local to Global: A Graph RAG Approach to Query-Focused Summarization

AI-generated keywords: GraphRAG

AI-generated Key Points

  • GraphRAG is a novel approach that combines retrieval-augmented generation (RAG) with knowledge graph generation and query-focused summarization (QFS) to facilitate human sensemaking over entire text corpora.
  • Initial evaluations show significant enhancements in comprehensiveness and diversity of answers compared to a vector RAG baseline.
  • GraphRAG outperforms a global but graph-free approach utilizing map-reduce source text summarization.
  • One key advantage of GraphRAG is its ability to provide summaries of root-level communities in the entity-based graph index for datasets requiring numerous global queries, serving as a superior data index compared to vector RAG.
  • Opportunities for further refinement include exploring RAG schemes that operate locally through embedding-based matching of user queries and graph annotations, as well as hybrid RAG strategies combining embedding-based matching with just-in-time community report generation before employing map-reduce summarization mechanisms.
  • Considerations around the broader impacts of using GraphRAG for question answering over large document collections are crucial, including clear disclosures regarding AI use and potential errors in outputs to mitigate risks to downstream sensemaking and decision-making tasks.
  • GraphRAG presents a viable solution to address risks associated with questions of a global nature by providing more accurate representations of source data compared to vector RAG.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Darren Edge, Ha Trinh, Newman Cheng, Joshua Bradley, Alex Chao, Apurva Mody, Steven Truitt, Dasha Metropolitansky, Robert Osazuwa Ness, Jonathan Larson

License: CC BY 4.0

Abstract: The use of retrieval-augmented generation (RAG) to retrieve relevant information from an external knowledge source enables large language models (LLMs) to answer questions over private and/or previously unseen document collections. However, RAG fails on global questions directed at an entire text corpus, such as "What are the main themes in the dataset?", since this is inherently a query-focused summarization (QFS) task, rather than an explicit retrieval task. Prior QFS methods, meanwhile, do not scale to the quantities of text indexed by typical RAG systems. To combine the strengths of these contrasting methods, we propose GraphRAG, a graph-based approach to question answering over private text corpora that scales with both the generality of user questions and the quantity of source text. Our approach uses an LLM to build a graph index in two stages: first, to derive an entity knowledge graph from the source documents, then to pregenerate community summaries for all groups of closely related entities. Given a question, each community summary is used to generate a partial response, before all partial responses are again summarized in a final response to the user. For a class of global sensemaking questions over datasets in the 1 million token range, we show that GraphRAG leads to substantial improvements over a conventional RAG baseline for both the comprehensiveness and diversity of generated answers.

Submitted to arXiv on 24 Apr. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2404.16130v2

, , , , In this study, we introduce GraphRAG, a novel approach that combines retrieval-augmented generation (RAG) with knowledge graph generation and query-focused summarization (QFS) to facilitate human sensemaking over entire text corpora. Our initial evaluations demonstrate significant enhancements in both the comprehensiveness and diversity of answers compared to a vector RAG baseline. Additionally, GraphRAG outperforms a global but graph-free approach utilizing map-reduce source text summarization. One key advantage of GraphRAG is its ability to provide summaries of root-level communities in the entity-based graph index for datasets requiring numerous global queries. These community summaries serve as a superior data index compared to vector RAG and achieve competitive performance with other global methods at a fraction of the token cost. Moving forward, there are opportunities for further refinement and adaptation of the GraphRAG approach. For instance, exploring RAG schemes that operate locally through embedding-based matching of user queries and graph annotations could enhance performance. Hybrid RAG strategies combining embedding-based matching with just-in-time community report generation before employing map-reduce summarization mechanisms show promise for future development. Moreover, considerations around the broader impacts of using GraphRAG for question answering over large document collections are crucial. Clear disclosures regarding AI use and potential errors in outputs should accompany system utilization to mitigate risks to downstream sensemaking and decision-making tasks. Compared to vector RAG, GraphRAG presents a viable solution to address these risks for questions of a global nature by providing more accurate representations of source data. Furthermore, our evaluation methodology includes a "control criterion" called Directness, which assesses how specifically and clearly an answer addresses the question. This criterion serves as a reference point against which other evaluation criteria can be judged. In our assessments, the LLM compares answers generated by different systems based on various criteria before giving a final judgment on the preferred answer or indicating a tie if they are similar. Overall, GraphRAG offers an innovative approach to question answering over private text corpora by leveraging knowledge graphs and QFS techniques. With further research and development, this method has the potential to enhance sensemaking capabilities across diverse domains and use cases while addressing challenges associated with large-scale information retrieval tasks.
Created on 24 May. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.