Corrective Retrieval Augmented Generation

AI-generated keywords: Large Language Models (LLMs) Corrective Retrieval Augmented Generation (CRAG) retrieval-augmented generation (RAG) lightweight retrieval evaluator decompose-then-recompose algorithm

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • The paper introduces the concept of Corrective Retrieval Augmented Generation (CRAG) to improve text generation by large language models (LLMs).
  • CRAG addresses limitations of retrieval-augmented generation (RAG) by incorporating a lightweight retrieval evaluator.
  • CRAG leverages large-scale web searches to augment results from static and limited corpora.
  • CRAG utilizes a decompose-then-recompose algorithm to filter out irrelevant information and focus on key information.
  • The proposed approach can be easily integrated with various RAG-based approaches.
  • Experimental results on four datasets show that CRAG significantly improves performance for both short- and long-form generation tasks.
  • CRAG presents an innovative solution to enhance text generation by addressing challenges faced by LLMs.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Shi-Qi Yan, Jia-Chen Gu, Yun Zhu, Zhen-Hua Ling

Abstract: Large language models (LLMs) inevitably exhibit hallucinations since the accuracy of generated texts cannot be secured solely by the parametric knowledge they encapsulate. Although retrieval-augmented generation (RAG) is a practicable complement to LLMs, it relies heavily on the relevance of retrieved documents, raising concerns about how the model behaves if retrieval goes wrong. To this end, we propose the Corrective Retrieval Augmented Generation (CRAG) to improve the robustness of generation. Specifically, a lightweight retrieval evaluator is designed to assess the overall quality of retrieved documents for a query, returning a confidence degree based on which different knowledge retrieval actions can be triggered. Since retrieval from static and limited corpora can only return sub-optimal documents, large-scale web searches are utilized as an extension for augmenting the retrieval results. Besides, a decompose-then-recompose algorithm is designed for retrieved documents to selectively focus on key information and filter out irrelevant information in them. CRAG is plug-and-play and can be seamlessly coupled with various RAG-based approaches. Experiments on four datasets covering short- and long-form generation tasks show that CRAG can significantly improve the performance of RAG-based approaches.

Submitted to arXiv on 29 Jan. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2401.15884v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The paper introduces the concept of Corrective Retrieval Augmented Generation (CRAG) as a solution to improve the accuracy and reliability of text generation by large language models (LLMs). CRAG addresses the limitations of retrieval-augmented generation (RAG) by incorporating a lightweight retrieval evaluator that assesses the quality of retrieved documents and triggers different knowledge retrieval actions. It also leverages large-scale web searches to augment the results from static and limited corpora. Additionally, CRAG utilizes a decompose-then-recompose algorithm to filter out irrelevant information and focus on key information in retrieved documents. The proposed approach can be easily integrated with various RAG-based approaches, making it a plug-and-play solution. Experimental results on four datasets show that CRAG significantly improves the performance of RAG-based approaches for both short- and long-form generation tasks. In summary, this paper presents an innovative solution to enhance text generation by addressing the challenges faced by LLMs through corrective retrieval augmented generation.
Created on 06 Feb. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.