BRIEF-Pro: Universal Context Compression with Short-to-Long Synthesis for Fast and Accurate Multi-Hop Reasoning

AI-generated keywords: BRIEF-Pro

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors introduce BRIEF-Pro as a solution for challenges in retrieval-augmented generation (RAG) for multi-hop reasoning tasks
  • BRIEF-Pro is a universal and lightweight compressor that distills relevant evidence from retrieved documents into concise summaries
  • Model trained using short contexts to compress extended contexts exceeding 10k words
  • Users can control the length of the summary by specifying the desired number of sentences
  • Experiments show BRIEF-Pro generates more concise and relevant summaries compared to existing methods, enhancing performance across different language models
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Jia-Chen Gu, Junyi Zhang, Di Wu, Yuankai Li, Kai-Wei Chang, Nanyun Peng

Code and data: https://github.com/JasonForJoy/BRIEF

Abstract: As retrieval-augmented generation (RAG) tackles complex tasks, increasingly expanded contexts offer richer information, but at the cost of higher latency and increased cognitive load on the model. To mitigate this bottleneck, especially for intricate multi-hop questions, we introduce BRIEF-Pro. It is a universal, lightweight compressor that distills relevant evidence for a given query from retrieved documents into a concise summary for seamless integration into in-context RAG. Using seed data consisting of relatively short contexts (fewer than 1k words), BRIEF-Pro is trained to perform abstractive compression of extended contexts exceeding 10k words across a wide range of scenarios. Furthermore, BRIEF-Pro offers flexible user control over summary length by allowing users to specify the desired number of sentences. Experiments on four open-domain multi-hop question-answering datasets show that BRIEF-Pro generates more concise and relevant summaries, enhancing performance across small, large, and proprietary language models. With the 70B reader model, 32x compression by BRIEF-Pro improves QA performance by 4.67% on average over LongLLMLingua's 9x, while requiring only 23% of its computational overhead.

Submitted to arXiv on 15 Oct. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2510.13799v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

, , , , In their paper titled "BRIEF-Pro: Universal Context Compression with Short-to-Long Synthesis for Fast and Accurate Multi-Hop Reasoning," authors Jia-Chen Gu, Junyi Zhang, Di Wu, Yuankai Li, Kai-Wei Chang, and Nanyun Peng introduce a novel approach to address the challenges posed by retrieval-augmented generation (RAG) in handling complex tasks. They highlight that while expanded contexts provide valuable information, they also come with drawbacks such as increased latency and cognitive load on the model. To overcome these limitations, especially in the context of intricate multi-hop questions, the authors propose BRIEF-Pro. <BRIEF-Pro></BRIEF-Pro> is described as a universal and lightweight compressor designed to distill relevant evidence from retrieved documents into concise summaries that can seamlessly integrate into in-context RAG. The model is trained using seed data with short contexts (less than 1k words) to perform abstractive compression of extended contexts exceeding 10k words across various scenarios. Notably, BRIEF-Pro offers users flexibility in controlling the length of the summary by allowing them to specify the desired number of sentences. The authors conducted experiments on four open-domain multi-hop question-answering datasets to evaluate the performance of BRIEF-Pro. The results show that BRIEF-Pro generates more concise and relevant summaries compared to existing methods, thereby enhancing performance across different language models – including small, large, and proprietary ones. In particular, when tested with the 70B reader model, BRIEF-Pro achieved a significant improvement in QA performance by achieving 32x compression over LongLLMLingua's 9x compression while requiring only 23% of its computational overhead. This demonstrates the effectiveness of BRIEF-Pro in efficiently summarizing complex information for enhanced multi-hop reasoning tasks.
Created on 17 Oct. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.