Monolith: Real Time Recommendation System With Collisionless Embedding Table

AI-generated keywords: Scalable

AI-generated Key Points

  • Having a scalable and real-time recommendation system is crucial in customer feedback-driven businesses
  • Traditional deep learning frameworks like TensorFlow or PyTorch may fall short for recommendation scenarios due to being optimized for static parameters and dense computations
  • Monolith is a novel approach tailored specifically for online training, addressing challenges of traditional frameworks by observing application workloads and production environments
  • Key contributions of Monolith include collisionless embedding table creation, memory-saving optimizations, robust online training architecture with high fault-tolerance capabilities
  • Monolith prioritizes real-time interactions over absolute reliability, successfully integrated into BytePlus Recommend product
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Zhuoran Liu, Leqi Zou, Xuan Zou, Caihua Wang, Biao Zhang, Da Tang, Bolin Zhu, Yijie Zhu, Peng Wu, Ke Wang, Youlong Cheng

ORSUM@ACM RecSys 2022
License: CC BY 4.0

Abstract: Building a scalable and real-time recommendation system is vital for many businesses driven by time-sensitive customer feedback, such as short-videos ranking or online ads. Despite the ubiquitous adoption of production-scale deep learning frameworks like TensorFlow or PyTorch, these general-purpose frameworks fall short of business demands in recommendation scenarios for various reasons: on one hand, tweaking systems based on static parameters and dense computations for recommendation with dynamic and sparse features is detrimental to model quality; on the other hand, such frameworks are designed with batch-training stage and serving stage completely separated, preventing the model from interacting with customer feedback in real-time. These issues led us to reexamine traditional approaches and explore radically different design choices. In this paper, we present Monolith, a system tailored for online training. Our design has been driven by observations of our application workloads and production environment that reflects a marked departure from other recommendations systems. Our contributions are manifold: first, we crafted a collisionless embedding table with optimizations such as expirable embeddings and frequency filtering to reduce its memory footprint; second, we provide an production-ready online training architecture with high fault-tolerance; finally, we proved that system reliability could be traded-off for real-time learning. Monolith has successfully landed in the BytePlus Recommend product.

Submitted to arXiv on 16 Sep. 2022

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2209.07663v2

, , , , In the fast-paced world of customer feedback-driven businesses, having a scalable and real-time recommendation system is crucial. While widely used in production settings, traditional deep learning frameworks like TensorFlow or PyTorch often fall short when it comes to meeting the specific demands of recommendation scenarios. These frameworks are optimized for static parameters and dense computations, making them less effective for dynamic and sparse feature-based recommendations. Additionally, their separation of batch-training and serving stages hinders real-time interaction with customer feedback. To address these challenges, a novel approach has been developed - Monolith, a system tailored specifically for online training. This innovative design sets it apart from conventional recommendation systems by carefully observing application workloads and production environments. The key contributions of Monolith include the creation of a collisionless embedding table with memory-saving optimizations such as expirable embeddings and frequency filtering. Furthermore, the system offers a robust online training architecture with high fault-tolerance capabilities. A significant aspect highlighted by this research is the trade-off between system reliability and real-time learning efficiency. By prioritizing real-time interactions over absolute reliability, Monolith has proven its effectiveness in practice by successfully integrating into the BytePlus Recommend product. This groundbreaking approach marks a departure from traditional methodologies and opens up new possibilities for enhancing recommendation systems in time-critical business settings.
Created on 06 Jan. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.