Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities

AI-generated keywords: personalized recommender systems embeddings approximate nearest neighbor search dynamic feed generation e-commerce platforms

AI-generated Key Points

Personalized recommender systems commonly use embeddings to encode customer actions and items
Retrieval is done in the embedding space through approximate nearest neighbor search
Challenges include restrictions on diversity within user embeddings and costly real-time infrastructure maintenance
A method was proposed to address these challenges by dynamically updating customer profiles and generating personalized feeds every two minutes using precomputed embeddings
The method was implemented at Bol, an e-commerce platform, resulting in enhanced customer engagement and a 4.9% uplift in conversions
Historical customer interactions data is used with an encoder model to generate query embeddings for items, which are matched against item embeddings using ANNS for recommendations
The system incorporates ranking algorithms, diversity considerations, business rules integration, and merging recommendations based on recent interactions
By updating near real-time every two minutes, the method successfully delivers tailored feeds while addressing concerns about diverse user interests and minimizing infrastructure costs

Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Binyam Gebre, Karoliina Ranta, Stef van den Elzen, Ernst Kuiper, Thijs Baars, Tom Heskes

arXiv: 2402.16073v2 - DOI (cs.IR)

9 pages, 8 figures

License: CC BY 4.0

Abstract: In personalized recommender systems, embeddings are often used to encode customer actions and items, and retrieval is then performed in the embedding space using approximate nearest neighbor search. However, this approach can lead to two challenges: 1) user embeddings can restrict the diversity of interests captured and 2) the need to keep them up-to-date requires an expensive, real-time infrastructure. In this paper, we propose a method that overcomes these challenges in a practical, industrial setting. The method dynamically updates customer profiles and composes a feed every two minutes, employing precomputed embeddings and their respective similarities. We tested and deployed this method to personalise promotional items at Bol, one of the largest e-commerce platforms of the Netherlands and Belgium. The method enhanced customer engagement and experience, leading to a significant 4.9% uplift in conversions.

Submitted to arXiv on 25 Feb. 2024

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2402.16073v2

Comprehensive Summary
Key points
Layman's Summary
Blog article

In personalized recommender systems, embeddings are commonly utilized to encode customer actions and items. Retrieval is then conducted in the embedding space through approximate nearest neighbor search. However, this approach presents challenges such as potential restrictions on the diversity of captured interests within user embeddings and the necessity for costly real-time infrastructure to maintain their up-to-date status. To address these issues in a practical industrial setting, a method was proposed that dynamically updates customer profiles and generates personalized feeds every two minutes by leveraging precomputed embeddings and their corresponding similarities. This method was implemented and tested at Bol, a prominent e-commerce platform serving the Netherlands and Belgium, to personalize promotional items. The results were promising with enhanced customer engagement and experience leading to a notable 4.9% uplift in conversions. The process involved utilizing historical customer interactions data alongside an encoder model to generate query embeddings for items in the catalogue. These query embeddings were then matched against item embeddings stored in an index vector database using approximate nearest neighbor search (ANNS) to retrieve relevant recommendations. Furthermore, the system incorporated elements of ranking algorithms, diversity considerations, business rules integration, and merging recommendations based on recent customer interactions. By combining these components effectively within a near real-time framework that updates every two minutes, the method successfully delivered tailored feeds to users while addressing concerns related to maintaining diverse user interests and minimizing infrastructure costs associated with real-time updates. Authors Binyam Gebre, Karoliina Ranta, Stef van den Elzen, Ernst Kuiper, Thijs Baars, and Tom Heskes present this innovative approach in their paper titled "Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities," offering valuable insights for enhancing personalization strategies in e-commerce platforms through efficient use of embedding techniques and dynamic feed generation processes.

- Personalized recommender systems commonly use embeddings to encode customer actions and items
- Retrieval is done in the embedding space through approximate nearest neighbor search
- Challenges include restrictions on diversity within user embeddings and costly real-time infrastructure maintenance
- A method was proposed to address these challenges by dynamically updating customer profiles and generating personalized feeds every two minutes using precomputed embeddings
- The method was implemented at Bol, an e-commerce platform, resulting in enhanced customer engagement and a 4.9% uplift in conversions
- Historical customer interactions data is used with an encoder model to generate query embeddings for items, which are matched against item embeddings using ANNS for recommendations
- The system incorporates ranking algorithms, diversity considerations, business rules integration, and merging recommendations based on recent interactions
- By updating near real-time every two minutes, the method successfully delivers tailored feeds while addressing concerns about diverse user interests and minimizing infrastructure costs

Summary1. Recommender systems help suggest things you might like based on what you've done before. 2. They find these suggestions by looking at a special way of representing your actions and the things you like. 3. Sometimes it's hard to make sure the suggestions are different enough and don't cost too much to do quickly. 4. A new way was made to solve these problems by updating your profile often and making personalized suggestions every two minutes. 5. This new way was used in a shop called Bol and made customers more interested, leading to more people buying things. Definitions- Personalized: Made just for you - Recommender systems: Tools that suggest things you might like - Embeddings: Special codes that represent actions or items - Nearest neighbor search: Finding similar things - Infrastructure maintenance: Taking care of all the technology needed - Precomputed: Figured out in advance - Conversions: People buying things after seeing them - Interactions data: Information about what people do online

Personalization has become a crucial aspect of e-commerce platforms, with the aim of providing tailored recommendations to customers based on their interests and preferences. In order to achieve this, personalized recommender systems often utilize embeddings to encode customer actions and items. However, this approach presents challenges such as limited diversity in captured interests within user embeddings and the need for costly real-time infrastructure updates. To address these issues in a practical industrial setting, a team of researchers from Bol, a prominent e-commerce platform serving the Netherlands and Belgium, proposed a method that dynamically updates customer profiles and generates personalized feeds every two minutes by leveraging precomputed embeddings and their corresponding similarities. This innovative approach was presented in their paper titled "Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities." The research team consisted of Binyam Gebre, Karoliina Ranta, Stef van den Elzen, Ernst Kuiper, Thijs Baars, and Tom Heskes. Their work offers valuable insights into enhancing personalization strategies in e-commerce platforms through efficient use of embedding techniques and dynamic feed generation processes. The Methodology The proposed method involved utilizing historical customer interactions data alongside an encoder model to generate query embeddings for items in the catalogue. These query embeddings were then matched against item embeddings stored in an index vector database using approximate nearest neighbor search (ANNS) to retrieve relevant recommendations. In addition to ANNS, the system also incorporated elements of ranking algorithms, diversity considerations, business rules integration, and merging recommendations based on recent customer interactions. By combining these components effectively within a near real-time framework that updates every two minutes, the method successfully delivered tailored feeds to users while addressing concerns related to maintaining diverse user interests and minimizing infrastructure costs associated with real-time updates. Implementation at Bol To test the effectiveness of their method in a practical industrial setting, the research team implemented it at Bol – one of Europe's largest online retailers. The platform serves millions of customers in the Netherlands and Belgium, making it an ideal testing ground for their approach. The team used a dataset consisting of customer interactions with items on the Bol platform over a period of three months. This data was used to train an encoder model that generated query embeddings for each item in the catalogue. These embeddings were then stored in an index vector database along with their corresponding similarities. Results and Impact The results of implementing this method at Bol were promising, with enhanced customer engagement and experience leading to a notable 4.9% uplift in conversions. The dynamic feed generation process every two minutes ensured that customers received personalized recommendations based on their recent interactions, resulting in increased user satisfaction and retention. Moreover, by leveraging precomputed embeddings and ANNS, the system was able to efficiently retrieve relevant recommendations without the need for costly real-time infrastructure updates. This helped reduce operational costs for Bol while still providing high-quality personalized feeds to its customers. Conclusion In conclusion, the paper "Pfeed: Generating near real-time personalized feeds using precomputed embedding similarities" presents a practical solution for addressing challenges faced by personalized recommender systems such as limited diversity within user embeddings and costly real-time infrastructure updates. By leveraging precomputed embeddings and ANNS within a dynamic feed generation process every two minutes, this method successfully delivered tailored feeds to users while improving customer engagement and conversion rates at Bol – one of Europe's largest e-commerce platforms.

Created on 03 Jun. 2024

Assess the quality of the AI-generated content by voting

Score: 0

Similar papers summarized with our AI tools

60.7%

Pre-training Tasks for User Intent Detection and Embedding Retrieval in E-com…

cs.IR

58.6%

Recommendation as Instruction Following: A Large Language Model Empowered Rec…

cs.IR

57.9%

SPAR: Personalized Content-Based Recommendation via Long Engagement Attention

cs.IR

57.1%

Retrieve Anything To Augment Large Language Models

cs.IR

56.4%

Page-level Optimization of e-Commerce Item Recommendations

cs.IR

56.4%

An Incremental Update Framework for Online Recommenders with Data-Driven Prior

cs.IR

56.2%

Recommendation Unlearning

cs.IR

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.