skip to main content
research-article

Tools for Predicting the Reliability of Large-Scale Storage Systems

Published:16 August 2016Publication History
Skip Abstract Section

Abstract

Data-intensive applications require extreme scaling of their underlying storage systems. Such scaling, together with the fact that storage systems must be implemented in actual data centers, increases the risk of data loss from failures of underlying components. Accurate engineering requires quantitatively predicting reliability, but this remains challenging due to the need to account for extreme scale, redundancy scheme type and strength, distribution architecture, and component dependencies. This article introduces CQSim-R, a tool suite for predicting the reliability of large-scale storage system designs and deployments. CQSim-R includes (a) direct calculations based on an only-drives-fail failure model and (b) an event-based simulator for detailed prediction that handles failures of and failure dependencies among arbitrary (drive or nondrive) components. These are based on a common combinatorial framework for modeling placement strategies. The article demonstrates CQSim-R using models of common storage systems, including replicated and erasure coded designs. New results, such as the poor reliability scaling of spread-placed systems and a quantification of the impact of data center distribution and rack-awareness on reliability, demonstrate the usefulness and generality of the tools. Analysis and empirical studies show the tools’ soundness, performance, and scalability.

References

  1. John E. Angus. 1988. On computing MTBF for a k-out-of-n:G repairable system. IEEE Transactions on Reliability 37, 312--313.Google ScholarGoogle ScholarCross RefCross Ref
  2. Lakshmi N. Bairavasundaram, Garth R. Goodson, Shankar Pasupathy, and Jiri Schindler. 2007. An analysis of latent sector errors in disk drives. In Proceedings of the 2007 ACM International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’07). Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. Dhruba Borthakur. 2007. The Hadoop Distributed File System: Architecture and Design. Technical Report. Apache Software Foundation.Google ScholarGoogle Scholar
  4. CERN-LHC. 2016. Computing. Retrieved June 28, 2016, from http://home.cern/about/computing.Google ScholarGoogle Scholar
  5. Peter M. Chen, Edward K. Lee, Garth A. Gibson, Randy H. Katz, and David A. Patterson. 1994. RAID: High-performance, reliable secondary storage. ACM Computing Surveys 26, 145--185. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. Asaf Cidon, Stephen Rumble, Ryan Stutsman, Sachin Katti, John Ousterhout, and Mendel Rosenblum. 2013. Copysets: Reducing the frequency of data loss in cloud storage. In Proceedings of the 2013 Usenix Annual Technical Conference. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. G. Elerath and M. Pecht. 2007. Enhanced reliability modeling of RAID storage systems. In Proceedings of the IEEE/IFIP International Conference on Dependable Systems and Networks. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. Jon G. Elerath and Jiri Schindler. 2014. Beyond MTTDL: A closed-form RAID 6 reliability equation. ACM Transactions on Storage 10, 2, Article No. 7. DOI:http://dx.doi.org/10.1145/2577386 Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Daniel Ford, Franois Labelle, Florentina I. Popovici, Murray Stokely, Van anh Truong, Luiz Barroso, Carrie Grimes, and Sean Quinlan. 2010. Availability in globally distributed storage systems. In Proceedings of the 9th Usenix Symposium on Operating Systems Design and Implementation. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. Kevin Greenan, James Plank, and Jay Wylie. 2010. Mean time to meaningless: MTTDL, Markov models, and storage system reliability. In Proceedings of the 2nd USENIX Conference on Hot Topics in Storage Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. V. Guerriero. 2012. Power law distribution: Method of multi-scale inferential statistics. Journal of Modern Mathematics Frontier, 21--28.Google ScholarGoogle Scholar
  12. Ilias Iliadis and Vinodh Venkatesan. 2015. Rebuttal to beyond mttdl: A closed-form raid-6 reliability equation. ACM Transactions on Storage 11, 2, Article No. 9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. H.-W. Kao, J.-F. Paris, T. Schwarz, and D. Long. 2013. A flexible simulation tool for estimating data loss risks in storage arrays. In Proceedings of the IEEE 29th Symposium on Mass Storage Systems and Technologies.Google ScholarGoogle Scholar
  14. Michael Ovsiannikov, Silvius Rus, Damian Reeves, Paul Sutter, Sriram Rao, and Jim Kelly. 2013. The quantcast file system. Proceedings of the VLDB Endowment 6, 11, 1092--1101. Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. Kestutis Patiejunas. 2014. Freezing exabytes of data at Facebook’s cold storage. In Proceedings of the Workshop on Designing Storage Architectures for Digital Collections 2014.Google ScholarGoogle Scholar
  16. Eduardo Pinheiro, Wolf-Dietrich Weber, and Luiz Andre Barroso. 2007. Failure trends in a large disk drive population. In Proceedings of the 5th USENIX Conference on File and Storage Technologies. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. K. K. Rao, J. L. Hafner, and R. A. Golding. 2006. Reliability for networked storage nodes. In Proceedings of the International Conference on Dependable Systems and Networks. Jason Resch and Ilya Volvovski. 2013. Reliability models for highly fault-tolerant storage systems. arXiv.org, arXiv:1310.4702. (2013). Google ScholarGoogle ScholarDigital LibraryDigital Library
  18. Biance Schroeder, Sotirios Damouras, and Phillipa Gill. 2010. Understanding latent sector errors and how to protect against them. ACM Transactions on Storage 6, 3, Article No. 9. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Bianca Schroeder and Garth A. Gibson. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th Usenix Conference on File and Storage Technologies. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. Mark Storer, Kevin Greenan, Ethan Miller, and Kaladhar Voruganti. 2008. Pergamum: Replacing tape with energy efficient, reliable, disk-based archival storage. In Proceedings of the 6th USENIX Conference on File and Storage Technologies. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Vinodh Venkatesan and Ilias Iliadis. 2012. A general reliability model for data storage systems. In Proceedings of the 9th International Conference on Quantitative Evaluation of Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  22. Vinodh Venkatesan, Ilias Iliadis, Christina Fragouli, and Rudiger Urbanke. 2011. Reliability of clustered vs. declustered replica placement in data storage systems. In Proceedings of the 19th International Symposium on Modeling, Analysis, & Simulation of Computer and Telecommunication Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  23. Vinodh Venkatesan, Ilias Iliadis, and Robert Haas. 2012. Reliability of data storage systems under network rebuild bandwidth constraints. In Proceedings of the 20th International Symposium on Modeling, Analysis, & Simulation of Computer and Telecommunication Systems. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. Sage A. Weil, Scott A. Brandt, Ethan L. Miller, Darrell D. E. Long, and Carlos Maltzahn. 2006b. Ceph: A scalable, high-performance distributed file system. In Proceedings of the 7th USENIX Symposium on Operating Systems Design and Implementations. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. Sage A. Weil, Scott A. Brandt, Ethan L. Miller, and Carlos Maltzahn. 2006a. CRUSH: Controlled, scalable, decentralized placement of replicated data. In Proceedings of the 2006 ACM/IEEE Conference on Supercomputing. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. Qin Xin, Ethan L. Miller, Thomas Schwarz, Darrell D. E. Long, Scott A. Brandt, and Witold Litwin. 2003. Reliability mechanisms for very large storage systems. In Proceedings of the 20th IEEE Conference on Mass Storage Systems and Technologies. Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Tools for Predicting the Reliability of Large-Scale Storage Systems

        Recommendations

        Comments

        Login options

        Check if you have access through your login credentials or your institution to get full access on this article.

        Sign in

        Full Access

        • Published in

          cover image ACM Transactions on Storage
          ACM Transactions on Storage  Volume 12, Issue 4
          August 2016
          213 pages
          ISSN:1553-3077
          EISSN:1553-3093
          DOI:10.1145/2940403
          Issue’s Table of Contents

          Copyright © 2016 ACM

          Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

          Publisher

          Association for Computing Machinery

          New York, NY, United States

          Publication History

          • Published: 16 August 2016
          • Accepted: 1 April 2016
          • Revised: 1 November 2015
          • Received: 1 July 2015
          Published in tos Volume 12, Issue 4

          Permissions

          Request permissions about this article.

          Request Permissions

          Check for updates

          Qualifiers

          • research-article
          • Research
          • Refereed

        PDF Format

        View or Download as a PDF file.

        PDF

        eReader

        View online with eReader.

        eReader