research-article

TinyLFU: A Highly Efficient Cache Admission Policy

Authors:
Gil Einziger

Nokia Bell Labs, Kfar Saba, Israel

Nokia Bell Labs, Kfar Saba, Israel
View Profile

,
Roy Friedman

Technion, Haifa, Israel

Technion, Haifa, Israel
View Profile

,
Ben Manes

Independent

Independent
View Profile

Authors Info & Claims

ACM Transactions on Storage Volume 13 Issue 4Article No.: 35pp 1–31https://doi.org/10.1145/3149371

Published:17 November 2017Publication History

ACM Transactions on Storage

Abstract

This article proposes to use a frequency-based cache admission policy in order to boost the effectiveness of caches subject to skewed access distributions. Given a newly accessed item and an eviction candidate from the cache, our scheme decides, based on the recent access history, whether it is worth admitting the new item into the cache at the expense of the eviction candidate.

This concept is enabled through a novel approximate LFU structure called TinyLFU, which maintains an approximate representation of the access frequency of a large sample of recently accessed items. TinyLFU is very compact and lightweight as it builds upon Bloom filter theory.

We study the properties of TinyLFU through simulations of both synthetic workloads and multiple real traces from several sources. These simulations demonstrate the performance boost obtained by enhancing various replacement policies with the TinyLFU admission policy. Also, a new combined replacement and eviction policy scheme nicknamed W-TinyLFU is presented. W-TinyLFU is demonstrated to obtain equal or better hit ratios than other state-of-the-art replacement policies on these traces. It is the only scheme to obtain such good results on all traces.

References

Ismail Ari, Melanie Gottwals, and Dick Henze. 2006. Performance boosting and workload isolation in storage area networks with SANCache. In Proceedings of the 23rd IEEE/14th NASA Goddard Conference on Mass Storage Systems and Technologies (MSST’06). 263--273.Google Scholar
Martin Arlitt, Ludmila Cherkasova, John Dilley, Rich Friedrich, and Tai Jin. 1999. Evaluating content management techniques for web proxy caches. In Proceedings of the 2nd Workshop on Internet Server Performance.Google Scholar
Martin Arlitt, Rich Friedrich, and Tai Jin. 2000. Performance evaluation of web proxy cache replacement policies. Perform. Eval. 39, 1--4 (Feb. 2000), 149--164. Google ScholarDigital Library
Hilla Atzmon, Roy Friedman, and Roman Vitenberg. 2002. Replacement policies for a distributed object caching service. In Confederated Int. Conf. DOA, CoopIS and ODBASE. 661--674. Google ScholarDigital Library
Sorav Bansal and Dharmendra S. Modha. 2004. CAR: Clock with adaptive replacement. In Proceedings of the 3rd USENIX Conference on File and Storage Technologies (FAST’04). 187--200. Google ScholarDigital Library
László Belady. 1966. A study of replacement algorithms for a virtual storage computer. IBM Syst. J. 5, 2 (1966), 78--101. Google ScholarDigital Library
Giuseppe Bianchi, Elisa Boschi, Simone Teofili, and Brian Trammell. 2010. Measurement data reduction through variation rate metering. In Proceedings IEEE INFOCOM. Google ScholarDigital Library
Giuseppe Bianchi, Nico d’Heureuse, and Saverio Niccolini. 2011. On-demand time-decaying Bloom filters for telemarketer detection. SIGCOMM Comput. Commun. Rev. 41, 5 (Oct. 2011), 5--12. Google ScholarDigital Library
G. Bianchi, S. Teofili, E. Boschi, B. Trammell, and C. Greco. 2010. Scalable and fast approximate excess rate detection. In Proceedings of the Future Network and Mobile Summit.Google Scholar
Allan Borodin and Ran El-Yaniv. 1998. Online Computation and Competitive Analysis. Cambridge University Press. Google ScholarDigital Library
Lee Breslau, Pei Cao, Li Fan, Graham Phillips, and Scott Shenker. 1999. Web caching and zipf-like distributions: Evidence and implications. In INFOCOM. 126--134.Google Scholar
Yu Jin Cao, Jing, Aiyou Chen, Tian Bu, and Zhi-Li Zhang. 2009. Identifying high cardinality internet hosts. In IEEE INFOCOM. 810--818.Google Scholar
Wei Koong Chai, Diliang He, Ioannis Psaras, and George Pavlou. 2012. Cache “less for more” in information-centric networks. In Proceedings of the 11th International IFIP TC 6 Conference on Networking. Springer-Verlag, 27--40. Google ScholarDigital Library
Kai Cheng and Yahiko Kambayashi. 2000. LRU-SP: A size-adjusted and popularity-aware LRU replacement algorithm for web caching. In Proceedings of the 24th IEEE International Computer Software and Applications Conference (COMPSAC’00). 48--53. Google ScholarDigital Library
Xu Cheng, C. Dale, and Jiangchuan Liu. 2008. Statistics and social network of youtube videos. In Proceedings of the 16th International Workshop on Quality of Service (IWQoS’08). 229--238.Google ScholarCross Ref
Ludmila Cherkasova. 1998. Improving WWW Proxies Performance with Greedy-Dual-Size-Frequency Caching Policy. HP Technical Report.Google Scholar
Gregory Chockler, Guy Laden, and Ymir Vigfusson. 2010. Data caching as a cloud service. In Proceedings of the 4th ACM International Workshop on Large Scale Distributed Systems and Middleware (LADIS’10). ACM, 18--21. Google ScholarDigital Library
Gregory Chockler, Guy Laden, and Ymir Vigfusson. 2011. Design and implementation of caching services in the cloud. IBM J. Res. Devel. 55, 6 (2011), 9:1--9:11. Google ScholarDigital Library
Baek-Young Choi, Jaesung Park, and Zhi-Li Zhang. 2002. Adaptive random sampling for load change detection. SIGMETRICS Perform. Eval. Rev. 30, 1 ( June 2002), 272--273. Google ScholarDigital Library
Edith Cohen, Haim Kaplan, and Uri Zwick. 2002. Competitive analysis of the LRFU paging algorithm. Algorithmica 33, 4 (2002), 511--516.Google ScholarCross Ref
Saar Cohen and Yossi Matias. 2003. Spectral bloom filters. In Proceedings of the ACM SIGMOD International Conference on Management of Data. 241--252. Google ScholarDigital Library
F. J. Corbato. 1968. A Paging Experiment with the Multics System. Technical Report Project MAC Report MAC-M-384. MIT.Google Scholar
Graham Cormode and S. Muthukrishnan. 2004. An improved data stream summary: The count-min sketch and its applications. J. Algorithms 55 (2004), 29--38. Google ScholarDigital Library
Xenofontas Dimitropoulos, Marc Stoecklin, Paul Hurley, and Andreas Kind. 2008. The eternal sunshine of the sketch data structure. Comput. Netw. 52, 17 (Dec. 2008), 3248--3257. Google ScholarDigital Library
Gil Einziger, Benny Fellman, and Yaron Kassner. 2015. Independent counter estimation buckets. In Proceedings of the IEEE Conference on Computer Communications (INFOCOM’15). 2560--2568.Google ScholarCross Ref
Gil Einziger and Roy Friedman. 2015a. A formal analysis of conservative update based approximate counting. In Proceedings of the International Conference on Computing, Networking and Communications (ICNC’15). 255--259.Google ScholarCross Ref
Gil Einziger and Roy Friedman. 2015b. TinySet - An access efficient self adjusting bloom filter construction. In Proceedings of the 24th International Conference on Computer Communication and Networks (ICCCN’15).Google ScholarCross Ref
Gil Einziger and Roy Friedman. 2016. Counting with TinyTable: Every bit counts! In Proceedings of the 17th International Conference on Distributed Computing and Networking (ICDCN’16). Google ScholarDigital Library
Gil Einziger, Roy Friedman, and Yoav Kantor. 2016. Shades: Expediting Kademlias lookup process. Comput. Netw. 99 (April 2016), 37--50. Google ScholarDigital Library
Cristian Estan, Ken Keys, David Moore, and George Varghese. 2004. Building a better netflow. SIGCOMM Comput. Commun. Rev. 34, 4 (Aug. 2004), 323–336. Google ScholarDigital Library
Cristian Estan and George Varghese. 2002. New directions in traffic measurement and accounting. SIGCOMM Comput. Commun. Rev. 32, 4 (Aug. 2002), 323--336. Google ScholarDigital Library
Li Fan, Pei Cao, Jussara Almeida, and Andrei Z. Broder. 2000. Summary cache: A scalable wide-area web cache sharing protocol. IEEE/ACM Trans. Netw. 8, 3 ( June 2000), 281--293. Google ScholarDigital Library
Brad Fitzpatrick. 2004. Distributed caching with memcached. Linux J. 2004, 124 (Aug. 2004), 5. Google ScholarDigital Library
Phillipa Gill, Martin Arlitt, Zongpeng Li, and Anirban Mahanti. 2007. Youtube traffic characterization: A view from the edge. In Proceedings of the 7th ACM SIGCOMM Conference on Internet Measurement (IMC’07). 15--28. Google ScholarDigital Library
Amit Goyal, Hal Daumé, and Graham Cormode. 2012. Sketch algorithms for estimating point queries in NLP. In Proceedings of the Joint Conference on Empirical Methods in Natural Language Processing and Computational Natural Language Learning. 1093--1103. Google ScholarDigital Library
Amit Goyal and Hal Daumeaé. 2011. Lossy conservative update (LCU) sketch: Succinct approximate count storage. In Proceedings of the 25th AAAI Conference on Artificial Intelligence. Google ScholarDigital Library
Amit Goyal, Jagadeesh Jagarlamudi, Hal Daumé III, and Suresh Venkatasubramanian. 2010. Sketching techniques for large scale NLP. In Proceedings of the 6th NAACL HLT Web as Corpus Workshop (WAC’10). 17--25. Google ScholarDigital Library
John L. Hennessy and David A. Patterson. 2012. Computer Architecture - A Quantitative Approach. 5th ed. Morgan Kaufmann.Google Scholar
Chengchen Hu, Bin Liu, Hongbo Zhao, Kai Chen, Yan Chen, Chunming Wu, and Yu Cheng. 2010. DISCO: Memory efficient and accurate flow statistics for network measurement. In Proceedings of the 30th IEEE International Conference on Distributed Computing Systems (ICDCS’10). 665--674. Google ScholarDigital Library
Chengchen Hu, Sheng Wang, Jia Tian, Bin Liu, Yu Cheng, and Yan Chen. 2008. Accurate and efficient traffic monitoring using adaptive non-linear sampling method. In INFOCOM. IEEE, 26--30.Google Scholar
Qi Huang, Ken Birman, Robbert van Renesse, Wyatt Lloyd, Sanjeev Kumar, and Harry C. Li. 2013. An analysis of Facebook photo caching. In Proceedings of the 24th ACM Symposium on Operating Systems Principles (SOSP’13). 167--181. Google ScholarDigital Library
Song Jiang, Feng Chen, and Xiaodong Zhang. 2005. CLOCK-Pro: An effective improvement of the CLOCK replacement. In Proceedings of the Annual Conference on USENIX Annual Technical Conference (ATEC’05). 35--35. Google ScholarDigital Library
Song Jiang and Xiaodong Zhang. 2002. LIRS: An efficient low inter-reference recency set replacement policy to improve buffer cache performance. ACM SIGMETRICS (2002), 31--42. Google ScholarDigital Library
Theodore Johnson and Dennis Shasha. 1994. 2Q: A low overhead high performance buffer management replacement algorithm. In Proceedings of the 20th International Conference on Very Large Data Bases (VLDB’94). 439--450. Google ScholarDigital Library
George Karakostas and Dimitrios Serpanos. 2002. Exploitation of different types of locality for web caches. In Proceedings of the 7th International Symposium on Computers and Communications (ISCC’02). Google ScholarDigital Library
Ramakrishna Karedla, J. Spencer Love, and Bradley G. Wherry. 1994. Caching strategies to improve disk system performance. IEEE Computer 27, 3 (1994), 38–46. Google ScholarDigital Library
Anirban Ketan Shah and Mitra Dhruv Matani. 2010. An O(1) Algorithm for Implementing the LFU Cache Eviction Scheme. Technical Report. Retrieved from http://dhruvbird.com/lfu.pdf.Google Scholar
Piotr Kolaczkowski. 2007. Memory efficient algorithm for mining recent frequent items in a stream. In Proceedings of the International Conference on Rough Sets and Intelligent Systems Paradigms (RSEISP’07). 485--494. Google ScholarDigital Library
Marc Liberatore and Prashant Shenoy. 2016. UMass Trace Repository. Retrieved from http://traces.cs.umass.edu/index.php/Main/About.Google Scholar
Yi Lu, Andrea Montanari, Balaji Prabhakar, Sarang Dharmapurikar, and Abdul Kabbani. 2008. Counter braids: A novel counter architecture for per-flow measurement. In Proceedings of the ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems. 121--132. Google ScholarDigital Library
Ben Manes. 2016. Caffeine: A High Performance Caching Library for Java 8. Retrieved from https://github.com/ben-manes/caffeine.Google Scholar
Nimrod Megiddo and Dharmendra S. Modha. 2003. ARC: A self-tuning, low overhead replacement cache. In Proceedings of the 2nd USENIX Conference on File and Storage Technologies (FAST’03). 115--130. Google ScholarDigital Library
Nimrod Megiddo and Dharmendra S. Modha. 2006. System and method for implementing an adaptive replacement cache policy. US Patent 6996676, February 7.Google Scholar
Satish Narayanasamy, Timothy Sherwood, Suleyman Sair, Brad Calder, and George Varghese. 2003. Catching accurate profiles in hardware. In Proceedings of the 9th International Symposium on High-Performance Computer Architecture (HPCA’03). 269--280. Google ScholarDigital Library
Elizabeth J. O’Neil, Patrick E. O’Neil, and Gerhard Weikum. 1993. The LRU-K page replacement algorithm for database disk buffering. ACM SIGMOD Rec. 22, 2 ( June 1993), 297--306. Google ScholarDigital Library
OpenBSD. 2014. TuQueue. Retrieved from https://www.tedunangst.com/flak/post/2Q-buffer-cache-algorithm.Google Scholar
Stefan Podlipnig and Laszlo Böszörmenyi. 2003. A survey of web cache replacement strategies. ACM Comput. Surv. 35, 4 (Dec. 2003), 374--398. Google ScholarDigital Library
Ioannis Psaras, Wei Koong Chai, and George Pavlou. 2012. Probabilistic in-network caching for information-centric networks. In Proceedings of the 2nd Edition of the ICN Workshop on Information-Centric Networking. ACM, 55--60. Google ScholarDigital Library
Konstantinos Psounis and Balaji Prabhakar. 2002. Efficient randomized web-cache replacement schemes using samples from past eviction times. IEEE/ACM Trans. Netw. 10, 4 (Aug. 2002), 441--455. Google ScholarDigital Library
Frederic Raspall and Sebastia Sallent. 2008. Adaptive shared-state sampling. In Proceedings of the 8th ACM SIGCOMM Conference on Internet Measurement (IMC’08). 271--284. Google ScholarDigital Library
Dimitrios N. Serpanos and Wayne H. Wolf. 2000. Caching web objects using zipf’s law. In Proceedings of the IEEE International Conference on Multimedia and Expo. 727–730.Google Scholar
Rade Stanojevic. 2007. Small active counters. In Proceedings of the 26th IEEE International Conference on Computer Communications (INFOCOM’07). 2153--2161. Google ScholarDigital Library
Geetika Tewari and Kim Hazelwood. 2004. Adaptive Web Proxy Caching Algorithms. Technical Report TR-13-04. Harvard University.Google Scholar
Erez Tsidon, Iddo Hanniel, and Isaac Keslassy. 2012. Estimators also need shared values to grow together. In INFOCOM. 1889--1897.Google Scholar
Guido Urdaneta, Guillaume Pierre, and Maarten van Steen. 2009. Wikipedia workload analysis for decentralized hosting. Elsevier Comput. Netw. 53, 11 ( July 2009), 1830--1845. Google ScholarDigital Library
Stephen Williams, Marc Abrams, and Charles R. Standridge. 1996. Removal Policies in Network Caches for World-Wide Web Documents. In Proceedings of the ACM SIGCOMM Conference. 293–305. Google ScholarDigital Library
Neal Young. 1991. On-line caching as cache size varies. In Proceedings of the 2nd Annual ACM-SIAM Symposium on Discrete Algorithms (SODA’91). 241--250. Google ScholarDigital Library
Yuanyuan Zhou, James Philbin, and Kai Li. 2001. The multi-queue replacement algorithm for second level buffer caches. In Proceedings of the USENIX Annual Technical Conference. 91--104. Google ScholarDigital Library

Index Terms

TinyLFU: A Highly Efficient Cache Admission Policy
1. Information systems
  1. Information storage systems
    1. Storage management
      1. Hierarchical storage management

Recommendations

Improved Techniques for Caches of Search Engines Results
WISM '10: Proceedings of the 2010 International Conference on Web Information Systems and Mining - Volume 01

Result caching is an efficient technique for reducing the query processing load, hence it is commonly used in search engines. In this paper, we study query result caching and proposes a cache management policy for achieving higher hit ratios compared to ...
Read More
PAAP: prefetch-aware admission policies for query results cache in web search engines
SIGIR '14: Proceedings of the 37th international ACM SIGIR conference on Research & development in information retrieval

Caching query results is an efficient technique for Web search engines. Admission policy can prevent infrequent queries from taking space of more frequent queries in the cache. In this paper we present two novel admission policies tailored for query ...
Read More
Criticality aware tiered cache hierarchy: a fundamental relook at multi-level cache hierarchies
ISCA '18: Proceedings of the 45th Annual International Symposium on Computer Architecture

On-die caches are a popular method to help hide the main memory latency. However, it is difficult to build large caches without substantially increasing their access latency, which in turn hurts performance. To overcome this difficulty, on-die caches ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Article

Published in
ACM Transactions on Storage Volume 13, Issue 4
Special Issue on MSST 2017 and Regular Papers
November 2017
329 pages
ISSN:1553-3077
EISSN:1553-3093
DOI:10.1145/3160863
Editor:
Sam H. Noh
Ulsan National Institute of Science and Technology, Ulsan, Korea
Issue’s Table of Contents
Copyright © 2017 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 17 November 2017
- Revised: 1 September 2017
- Accepted: 1 September 2017
- Received: 1 December 2015
Published in tos Volume 13, Issue 4

Permissions
Request permissions about this article.
Request Permissions

Check for updates
Author Tags
Caching
admission policy
counting bloom filter
eviction policy
sketches
Qualifiers
- research-article
- Research
- Refereed
Conference
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 83
  Total Citations
  View Citations
- 2,379
  Total Downloads
- Downloads (Last 12 months)245
- Downloads (Last 6 weeks)30
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

TinyLFU: A Highly Efficient Cache Admission Policy

ACM Transactions on Storage

Abstract

References

Cited By

Index Terms

Recommendations

Improved Techniques for Caches of Search Engines Results

PAAP: prefetch-aware admission policies for query results cache in web search engines

Criticality aware tiered cache hierarchy: a fundamental relook at multi-level cache hierarchies

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

TinyLFU: A Highly Efficient Cache Admission Policy

ACM Transactions on Storage

Abstract

References

Cited By

Index Terms

Recommendations

Improved Techniques for Caches of Search Engines Results

PAAP: prefetch-aware admission policies for query results cache in web search engines

Criticality aware tiered cache hierarchy: a fundamental relook at multi-level cache hierarchies

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Author Tags

Qualifiers

Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media