Corrective Retrieval Augmented Generation

AI-generated keywords: Large language models

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Hallucinations persist in large language models (LLMs) due to limitations in ensuring text accuracy based on parametric knowledge alone
  • Retrieval-augmented generation (RAG) is a potential solution to complement LLMs, but its effectiveness depends on the quality of retrieved documents
  • Corrective Retrieval Augmented Generation (CRAG) aims to enhance text generation robustness by evaluating and improving the quality of retrieved documents
  • CRAG integrates large-scale web searches to enhance retrieval quality and uses a decompose-then-recompose algorithm to focus on key information while filtering out irrelevant details
  • Experimental evaluations show significant performance improvements with CRAG compared to traditional RAG-based methods
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling

Update results, add more analysis, and fix typos

Abstract: Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured solely by the parametric knowledge they encapsulate. Although retrieval-augmented generation (RAG) is a practicable complement to LLMs, it relies heavily on the relevance of retrieved documents, raising concerns about how the model behaves if retrieval goes wrong. To this end, we propose the Corrective Retrieval Augmented Generation (CRAG) to improve the robustness of generation. Specifically, a lightweight retrieval evaluator is designed to assess the overall quality of retrieved documents for a query, returning a confidence degree based on which different knowledge retrieval actions can be triggered. Since retrieval from static and limited corpora can only return sub-optimal documents, large-scale web searches are utilized as an extension for augmenting the retrieval results. Besides, a decompose-then-recompose algorithm is designed for retrieved documents to selectively focus on key information and filter out irrelevant information in them. CRAG is plug-and-play and can be seamlessly coupled with various RAG-based approaches. Experiments on four datasets covering short- and long-form generation tasks show that CRAG can significantly improve the performance of RAG-based approaches.

Submitted to arXiv on 29 Jan. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2401.15884v3

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , In the realm of large language models (LLMs), the issue of hallucinations persists due to inherent limitations in ensuring the accuracy of generated texts solely based on their encapsulated parametric knowledge. While retrieval-augmented generation (RAG) presents itself as a viable solution to complement LLMs, its effectiveness heavily relies on the relevance and quality of retrieved documents. This raises concerns about potential model behavior deviations in cases where retrieval fails. To address this challenge, a novel approach known as Corrective Retrieval Augmented Generation (CRAG) is introduced with the primary objective of enhancing the robustness of text generation processes. The core concept behind CRAG involves implementing a lightweight retrieval evaluator specifically designed to assess the overall quality of retrieved documents for a given query. This evaluation process results in the generation of a confidence degree, which subsequently triggers various knowledge retrieval actions based on its value. Recognizing that traditional retrieval methods from static and limited corpora may yield suboptimal results, CRAG integrates large-scale web searches as an extension to augment and enhance the quality of retrieved information. Furthermore, CRAG incorporates a unique decompose-then-recompose algorithm that enables selective focus on key information within retrieved documents while filtering out irrelevant or extraneous details. This strategic approach ensures that only pertinent data is utilized in the generation process, thereby improving overall output quality. Notably, CRAG is designed to be seamlessly integrated with diverse RAG-based approaches, offering flexibility and adaptability across different applications. Experimental evaluations conducted across four datasets encompassing both short- and long-form generation tasks demonstrate significant performance improvements achieved by CRAG when compared to conventional RAG-based methodologies. By effectively addressing issues related to retrieval accuracy and document relevance, CRAG emerges as a promising solution for enhancing text generation capabilities within large language models.
Created on 11 Nov. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.