Generating Query-Relevant Document Summaries via Reinforcement Learning

AI-generated keywords: E-commerce search engines ranking models ReLSum framework reinforcement learning text summarization

AI-generated Key Points

  • Traditional reliance on product titles for ranking models in e-commerce search engines is insufficient in capturing query intent and delivering optimal relevance predictions.
  • Product descriptions are often too verbose and lengthy for real-time ranking, leading to challenges in achieving search relevance.
  • The novel reinforcement learning framework ReLSum aims to generate concise and query-relevant summaries of product descriptions efficiently.
  • ReLSum leverages relevance scores as rewards to align summarization and ranking objectives effectively, addressing misaligned learning targets seen in previous methods.
  • The framework utilizes a trainable large language model (LLM) to produce summaries that enhance the performance of cross-encoder ranking models.
  • ReLSum has shown significant improvements in offline metrics such as recall and NDCG, as well as online user engagement metrics.
  • Generating single summaries for each document or product through reinforcement learning can enhance performance on downstream tasks like search relevance.
  • Strategies for constructing training datasets have been included to facilitate efficient fine-tuning of summary-generating models within the ReLSum framework.
  • The paper compares Direct Preference Optimization (DPO) and Group Relative Policy Optimization (GRPO) methodologies for text summarization purposes, contributing to optimizing text summarization processes using advanced approaches like LLMs in zero-shot and few-shot settings.
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Nitin Yadav, Changsung Kang, Hongwei Shang, Ming Sun

License: CC BY 4.0

Abstract: E-commerce search engines often rely solely on product titles as input for ranking models with latency constraints. However, this approach can result in suboptimal relevance predictions, as product titles often lack sufficient detail to capture query intent. While product descriptions provide richer information, their verbosity and length make them unsuitable for real-time ranking, particularly for computationally expensive architectures like cross-encoder ranking models. To address this challenge, we propose ReLSum, a novel reinforcement learning framework designed to generate concise, query-relevant summaries of product descriptions optimized for search relevance. ReLSum leverages relevance scores as rewards to align the objectives of summarization and ranking, effectively overcoming limitations of prior methods, such as misaligned learning targets. The framework employs a trainable large language model (LLM) to produce summaries, which are then used as input for a cross-encoder ranking model. Experimental results demonstrate significant improvements in offline metrics, including recall and NDCG, as well as online user engagement metrics. ReLSum provides a scalable and efficient solution for enhancing search relevance in large-scale e-commerce systems.

Submitted to arXiv on 11 Aug. 2025

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2508.08404v1

In the realm of e-commerce search engines, traditional reliance on product titles for ranking models has proven insufficient in capturing query intent and delivering optimal relevance predictions. This is due to the fact that product descriptions, while richer in information, are often too verbose and lengthy for real-time ranking. This issue is further exacerbated by computationally expensive architectures like cross-encoder models. To address this challenge, a novel reinforcement learning framework called ReLSum has been introduced. This framework aims to generate concise and query-relevant summaries of product descriptions to enhance search relevance efficiently. By leveraging relevance scores as rewards to align summarization and ranking objectives effectively, ReLSum addresses issues such as misaligned learning targets seen in previous methods. One key aspect of the ReLSum framework is its use of a trainable large language model (LLM) to produce summaries that serve as input for cross-encoder ranking models. This approach has shown significant improvements in offline metrics such as recall and NDCG, as well as online user engagement metrics. The paper also delves into the concept of generating single summaries for each document or product through reinforcement learning with the ultimate goal of enhancing performance on downstream tasks such as search relevance. To facilitate efficient fine-tuning of summary-generating models within this framework, strategies for constructing training datasets have been included. Additionally, the paper compares two methodologies - Direct Preference Optimization (DPO) and Group Relative Policy Optimization (GRPO) - for summarization purposes. The exploration of these techniques contributes to advancing the field by optimizing text summarization processes using cutting-edge approaches like LLMs in zero-shot and few-shot settings. Overall, this comprehensive study not only sheds light on the challenges faced by e-commerce search engines but also presents innovative solutions through advanced reinforcement learning frameworks tailored for summarization tasks. By improving the quality of generated summaries and aligning them with ranking objectives, this research paves the way for enhanced search relevance in large-scale e-commerce systems while balancing performance with latency constraints effectively.
Created on 14 Aug. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.