Affiliation:
1. Carnegie Mellon University, Pittsburgh, PA, USA
Abstract
Many recent caching systems aim to improve miss ratios, but there is no good sense among practitioners of how much further miss ratios can be improved. In other words, should the systems community continue working on this problem? Currently, there is no principled answer to this question. In practice, object sizes often vary by several orders of magnitude, where computing the optimal miss ratio (OPT) is known to be NP-hard. The few known results on caching with variable object sizes provide very weak bounds and are impractical to compute on traces of realistic length. We propose a new method to compute upper and lower bounds on OPT. Our key insight is to represent caching as a min-cost flow problem, hence we call our method the flow-based offline optimal (FOO). We prove that, under simple independence assumptions, FOO's bounds become tight as the number of objects goes to infinity. Indeed, FOO's error over 10M requests of production CDN and storage traces is negligible: at most 0.3%. FOO thus reveals, for the first time, the limits of caching with variable object sizes. While FOO is very accurate, it is computationally impractical on traces with hundreds of millions of requests. We therefore extend FOO to obtain more efficient bounds on OPT, which we call practical flow-based offline optimal (PFOO). We evaluate PFOO on several full production traces and use it to compare OPT to prior online policies. This analysis shows that current caching systems are in fact still far from optimal, suffering 11--43% more cache misses than OPT, whereas the best prior offline bounds suggest that there is essentially no room for improvement.
Funder
National Science Foundation
Google
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Safety, Risk, Reliability and Quality,Computer Science (miscellaneous)
Reference85 articles.
1. Marc Abrams C. R. Standridge Ghaleb Abdulla S. Williams and Edward A. Fox . 1995. Caching Proxies: Limitations and Potentials. Technical Report. Virginia Polytechnic Institute & State University Blacksburgh VA. Marc Abrams C. R. Standridge Ghaleb Abdulla S. Williams and Edward A. Fox . 1995. Caching Proxies: Limitations and Potentials. Technical Report. Virginia Polytechnic Institute & State University Blacksburgh VA.
2. Principles of Optimal Page Replacement
3. Ravindra K Ahuja Thomas L Magnanti and James B Orlin . 1993. Network flows: theory algorithms and applications. Prentice hall. Ravindra K Ahuja Thomas L Magnanti and James B Orlin . 1993. Network flows: theory algorithms and applications. Prentice hall.
4. Susanne Albers Sanjeev Arora and Sanjeev Khanna . 1999. Page replacement for general caching problems. In SODA. 31--40. Susanne Albers Sanjeev Arora and Sanjeev Khanna . 1999. Page replacement for general caching problems. In SODA. 31--40.
5. New Results on a Generalized Coupon Collector Problem Using Markov Chains
Cited by
28 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Beyond Belady to Attain a Seemingly Unattainable Byte Miss Ratio for Content Delivery Networks;IEEE Transactions on Parallel and Distributed Systems;2024-11
2. LBSC: A Cost-Aware Caching Framework for Cloud Databases;2024 IEEE 40th International Conference on Data Engineering (ICDE);2024-05-13
3. A Learning-Based Caching Mechanism for Edge Content Delivery;Proceedings of the 15th ACM/SPEC International Conference on Performance Engineering;2024-05-07
4. Latency-Aware Cache Mechanism for Resolver Service of Domain Name Systems;NOMS 2024-2024 IEEE Network Operations and Management Symposium;2024-05-06
5. PreVision: An Out-of-Core Matrix Computation System with Optimal Buffer Replacement;Proceedings of the ACM on Management of Data;2024-03-12