A Learned Cache Eviction Framework with Minimal Overhead

AI-generated keywords: Machine Learning

AI-generated Key Points

  • Machine Learning (ML) is effective in reducing cache miss ratios compared to traditional heuristics
  • Existing ML-based caching systems often require a large number of predictions for eviction decisions
  • A new framework called Machine learning At the Tail (MAT) integrates ML with a traditional cache system based on a heuristic algorithm
  • MAT aims to reduce the number of costly predictions required per eviction while achieving comparable performance to state-of-the-art ML caches
  • Experiments on 8 production workloads showed MAT significantly reduced predictions-per-eviction from 63 to just 2 while maintaining similar miss ratios
  • Comparison between MAT and an LRU-based caching system demonstrated similar request rates
  • MAT combines strengths of heuristic algorithms and machine learning techniques to improve cache performance in high-throughput environments
Also access our AI generated: Comprehensive summary, Lay summary, Blog-like article; or ask questions about this paper to our AI assistant.

Authors: Dongsheng Yang, Daniel S. Berger, Kai Li, Wyatt Lloyd

License: CC BY-SA 4.0

Abstract: Recent work shows the effectiveness of Machine Learning (ML) to reduce cache miss ratios by making better eviction decisions than heuristics. However, state-of-the-art ML caches require many predictions to make an eviction decision, making them impractical for high-throughput caching systems. This paper introduces Machine learning At the Tail (MAT), a framework to build efficient ML-based caching systems by integrating an ML module with a traditional cache system based on a heuristic algorithm. MAT treats the heuristic algorithm as a filter to receive high-quality samples to train an ML model and likely candidate objects for evictions. We evaluate MAT on 8 production workloads, spanning storage, in-memory caching, and CDNs. The simulation experiments show MAT reduces the number of costly ML predictions-per-eviction from 63 to 2, while achieving comparable miss ratios to the state-of-the-art ML cache system. We compare a MAT prototype system with an LRU-based caching system in the same setting and show that they achieve similar request rates.

Submitted to arXiv on 27 Jan. 2023

Ask questions about this paper to our AI assistant

You can also chat with multiple papers at once here.

AI assistant instructions?

Results of the summarizing process for the arXiv paper: 2301.11886v1

, , , , Machine Learning (ML) has been proven effective in reducing cache miss ratios by making better eviction decisions compared to traditional heuristics. However, existing ML-based caching systems often require a large number of predictions to make an eviction decision, which can be impractical for high-throughput caching systems. To address this issue, a new framework called Machine learning At the Tail (MAT) has been introduced. MAT integrates an ML module with a traditional cache system that is based on a heuristic algorithm. In this framework, the heuristic algorithm acts as a filter to identify high-quality samples for training the ML model and potential candidate objects for eviction. By leveraging this hybrid approach, MAT aims to build efficient ML-based caching systems that can achieve comparable performance to state-of-the-art ML caches while minimizing the number of costly predictions required per eviction. In order to evaluate the effectiveness of MAT, experiments were conducted on 8 production workloads across various domains including storage, in-memory caching, and Content Delivery Networks (CDNs). The results showed that MAT was able to significantly reduce the number of predictions-per-eviction from 63 to just 2 while maintaining similar miss ratios to existing ML cache systems. Additionally, a comparison between a MAT prototype system and an LRU-based caching system demonstrated similar request rates between the two approaches. Overall, MAT presents a promising solution for building efficient ML-based caching systems by combining the strengths of both heuristic algorithms and machine learning techniques. This framework has the potential to improve cache performance in high-throughput environments while minimizing computational overhead associated with making eviction decisions.
Created on 25 Apr. 2025

Assess the quality of the AI-generated content by voting

Score: 0

Why do we need votes?

Votes are used to determine whether we need to re-run our summarizing tools. If the count reaches -10, our tools can be restarted.

Similar papers summarized with our AI tools

Navigate through even more similar papers through a

tree representation

Look for similar papers (in beta version)

By clicking on the button above, our algorithm will scan all papers in our database to find the closest based on the contents of the full papers and not just on metadata. Please note that it only works for papers that we have generated summaries for and you can rerun it from time to time to get a more accurate result while our database grows.

Disclaimer: The AI-based summarization tool and virtual assistant provided on this website may not always provide accurate and complete summaries or responses. We encourage you to carefully review and evaluate the generated content to ensure its quality and relevance to your needs.