Enhancing Relevance of Embedding-based Retrieval at Walmart

AI-generated keywords: Embedding-based Neural Retrieval (EBR) Relevance Reward Model (RRM) product search retrieval accuracy customer shopping experience

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Research focuses on enhancing relevance of Embedding-based Neural Retrieval (EBR) at Walmart for product search
  • Initial implementation of EBR system at Walmart showed promising results in improving relevance and add-to-cart rates
  • Challenges included relevance degradation due to false positives/negatives in training data and difficulties in handling query misspellings
  • Proposed approaches to strengthen EBR model capabilities, including:
  • Introduction of Relevance Reward Model (RRM) based on human relevance feedback
  • Techniques like typo-aware training and semi-positive generation employed to enhance performance
  • Strategies aim to improve retrieval accuracy by addressing common issues encountered during product search queries
  • Effectiveness of enhancements validated through offline relevance evaluation, online AB tests, and successful deployments in live production environments
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Juexin Lin, Sachin Yadav, Feng Liu, Nicholas Rossi, Praveen Reddy Suram, Satya Chembolu, Prijith Chandran, Hrushikesh Mohapatra, Tony Lee, Alessandro Magnani, Ciya Liao

8 pages, 3 figures, CIKM 2024

Abstract: Embedding-based neural retrieval (EBR) is an effective search retrieval method in product search for tackling the vocabulary gap between customer search queries and products. The initial launch of our EBR system at Walmart yielded significant gains in relevance and add-to-cart rates [1]. However, despite EBR generally retrieving more relevant products for reranking, we have observed numerous instances of relevance degradation. Enhancing retrieval performance is crucial, as it directly influences product reranking and affects the customer shopping experience. Factors contributing to these degradations include false positives/negatives in the training data and the inability to handle query misspellings. To address these issues, we present several approaches to further strengthen the capabilities of our EBR model in terms of retrieval relevance. We introduce a Relevance Reward Model (RRM) based on human relevance feedback. We utilize RRM to remove noise from the training data and distill it into our EBR model through a multi-objective loss. In addition, we present the techniques to increase the performance of our EBR model, such as typo-aware training, and semi-positive generation. The effectiveness of our EBR is demonstrated through offline relevance evaluation, online AB tests, and successful deployments to live production. [1] Alessandro Magnani, Feng Liu, Suthee Chaidaroon, Sachin Yadav, Praveen Reddy Suram, Ajit Puthenputhussery, Sijie Chen, Min Xie, Anirudh Kashi, Tony Lee, et al. 2022. Semantic retrieval at walmart. In Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining. 3495-3503.

Submitted to arXiv on 09 Aug. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2408.04884v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

The research presented in this study focuses on enhancing the relevance of Embedding-based Neural Retrieval (EBR) at Walmart for product search. The initial implementation of the EBR system at Walmart showed promising results in improving relevance and add-to-cart rates. However, there were instances of relevance degradation due to factors such as false positives/negatives in training data and difficulties in handling query misspellings. To address these challenges, the researchers proposed several approaches to strengthen the capabilities of the EBR model in terms of retrieval relevance. One key contribution is the introduction of a Relevance Reward Model (RRM) based on human relevance feedback. This model helps filter out noise from training data and incorporates it into the EBR model through a multi-objective loss function. Additionally, techniques like typo-aware training and semi-positive generation were employed to enhance the performance of the EBR model further. These strategies aim to improve retrieval accuracy by addressing common issues encountered during product search queries. The effectiveness of these enhancements was validated through offline relevance evaluation, online AB tests, and successful deployments in live production environments. The study showcases how refining the EBR model can lead to significant improvements in retrieval relevance, ultimately enhancing the overall customer shopping experience at Walmart.
Created on 26 Sep. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.