An Embedding-Based Grocery Search Model at Instacart

AI-generated keywords: Embedding-based model Grocery search Instacart E-commerce search optimization Self-adversarial learning

AI-generated Key Points

The license of the paper does not allow us to build upon its content and the key points are generated using the paper metadata rather than the full article.

  • Authors address the challenge of optimizing e-commerce search by leveraging large yet noisy log data
  • Introduce an embedding-based model for grocery search on Instacart platform
  • System employs a two-tower transformer-based encoder architecture to learn representations of user queries and product information
  • Focus on content-based features to overcome cold-start problem in e-commerce search engines
  • Propose self-adversarial learning method and cascade training approach to effectively train model on noisy data
  • Report significant 10% relative improvement in RECALL@20 metrics through rigorous testing on offline human evaluation dataset
  • Model demonstrates notable enhancements in online A/B testing scenarios: 4.1% increase in cart-adds per search (CAPS) and 1.5% boost in gross merchandise value (GMV)
  • Authors provide detailed insights into training and deployment of embedding-based search model, highlighting its effectiveness
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Yuqing Xie, Taesik Na, Xiao Xiao, Saurav Manchanda, Young Rao, Zhihong Xu, Guanghua Shu, Esther Vasiete, Tejaswi Tenneti, Haixun Wang

Accepted by SIGIR eCom, July 15, 2022

Abstract: The key to e-commerce search is how to best utilize the large yet noisy log data. In this paper, we present our embedding-based model for grocery search at Instacart. The system learns query and product representations with a two-tower transformer-based encoder architecture. To tackle the cold-start problem, we focus on content-based features. To train the model efficiently on noisy data, we propose a self-adversarial learning method and a cascade training method. AccOn an offline human evaluation dataset, we achieve 10% relative improvement in RECALL@20, and for online A/B testing, we achieve 4.1% cart-adds per search (CAPS) and 1.5% gross merchandise value (GMV) improvement. We describe how we train and deploy the embedding based search model and give a detailed analysis of the effectiveness of our method.

Submitted to arXiv on 12 Sep. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

The license of the paper does not allow us to build upon its content and the AI assistant only knows about the paper metadata rather than the full article.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2209.05555v1

This paper's license doesn't allow us to build upon its content and the summarizing process is here made with the paper's metadata rather than the article.

In their paper titled "An Embedding-Based Grocery Search Model at Instacart," authors Yuqing Xie, Taesik Na, Xiao Xiao, Saurav Manchanda, Young Rao, Zhihong Xu, Guanghua Shu, Esther Vasiete, Tejaswi Tenneti, and Haixun Wang address the challenge of optimizing e-commerce search by leveraging large yet noisy log data. The study introduces an embedding-based model specifically designed for grocery search on the Instacart platform. This system employs a two-tower transformer-based encoder architecture to learn representations of both user queries and product information. To overcome the cold-start problem commonly encountered in e-commerce search engines, the focus is placed on content-based features. To effectively train the model on noisy data, the researchers propose a self-adversarial learning method along with a cascade training approach. Through rigorous testing on an offline human evaluation dataset, they report a significant 10% relative improvement in RECALL@20 metrics. Furthermore, during online A/B testing scenarios, the model demonstrates notable enhancements with a 4.1% increase in cart-adds per search (CAPS) and a 1.5% boost in gross merchandise value (GMV). The authors delve into the details of how they trained and deployed this embedding-based search model while providing an insightful analysis of its effectiveness. Their findings shed light on the potential of utilizing advanced techniques to enhance e-commerce search functionalities and improve user experience within online grocery platforms like Instacart.
Created on 28 Feb. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

The previous summary was created more than a year ago and can be re-run (if necessary) by clicking on the Run button below.

The license of this specific paper does not allow us to build upon its content and the summarizing tools will be run using the paper metadata rather than the full article. However, it still does a good job, and you can also try our tools on papers with more open licenses.

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.