skip to main content
research-article

LDM: Log Disk Mirroring with Improved Performance and Reliability for SSD-Based Disk Arrays

Published:20 May 2016Publication History
Skip Abstract Section

Abstract

With the explosive growth in data volume, the I/O bottleneck has become an increasingly daunting challenge for big data analytics. Economic forces, driven by the desire to introduce flash-based Solid-State Drives (SSDs) into the high-end storage market, have resulted in hybrid storage systems in the cloud. However, a single flash-based SSD cannot satisfy the performance, reliability, and capacity requirements of enterprise or HPC storage systems in the cloud. While an array of SSDs organized in a RAID structure, such as RAID5, provides the potential for high storage capacity and bandwidth, reliability and performance problems will likely result from the parity update operations. In this article, we propose a Log Disk Mirroring scheme (LDM) to improve the performance and reliability of SSD-based disk arrays. LDM is a hybrid disk array architecture that consists of several SSDs and two hard disk drives (HDDs). In an LDM array, the two HDDs are mirrored as a write buffer that temporally absorbs the small write requests. The small and random write data are written on the mirroring buffer by using the logging technique that sequentially appends new data. The small write data are merged and destaged to the SSD-based disk array during the system idle periods. Our prototype implementation of the LDM array and the performance evaluations show that the LDM array significantly outperforms the pure SSD-based disk arrays by a factor of 20.4 on average, and outperforms HPDA by a factor of 5.0 on average. The reliability analysis shows that the MTTDL of the LDM array is 2.7 times and 1.7 times better than that of pure SSD-based disk arrays and HPDA disk arrays.

References

  1. N. Agrawal, V. Prabhakaran, T. Wobber, J. Davis, M. Manasse, and R. Panigrahy. 2008. Design tradeoffs for SSD performance. In Proceedings of the 2008 USENIX Annual Technical Conference (USENIX’08). 57--70. Google ScholarGoogle ScholarDigital LibraryDigital Library
  2. M. Balakrishnan, A. Kadav, V. Prabhakaran, and D. Malkhi. 2010a. Differential RAID: Rethinking RAID for SSD reliability. In Proceedings of the 5th European Conference on Computer systems (EuroSys’10). 15--26. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. M. Balakrishnan, A. Kadav, V. Prabhakaran, and D. Malkhi. 2010b. Differential RAID: Rethinking RAID for SSD reliability. ACM Transactions on Storage 6, 2 (2010), 1--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. A. M. Caulfield, J. Coburn, T. Mollov, A. De, A. Akel, J. He, A. Jagatheesan, R. K. Gupta, A. Snavely, and S. Swanson. 2010. Understanding the impact of emerging non-volatile memories on high-performance, IO-intensive computing. In Proceedings of the 2010 International Conference for High Performance Computing, Networking, Storage and Analysis (SC’10). 1--11. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. J. C. W. Chan, Q. Ding, P. P. C. Lee, and H. H. W. Chan. 2014. Parity logging with reserved space: Towards efficient updates and recovery in erasure-coded clustered storage. In Proceedings of the 12th USENIX Conference on File and Storage Technologies (FAST’14). 163--176. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. F. Chen, D. A. Koufaty, and X. Zhang. 2009. Understanding intrinsic characteristics and system implications of flash memory based solid state drives. In Proceedings of the 11th ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’09). 181--192. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. C. Chung and H. Hsu. 2014. Partial parity cache and data cache management method to improve the performance of an SSD-based RAID. IEEE Transactions on Very Large Scale Integration Systems 22, 7 (2014), 1470--1480.Google ScholarGoogle ScholarCross RefCross Ref
  8. C. Dirik and B. Jacob. 2009. The performance of PC solid-state disks as a function of bandwidth, concurrency, device architecture, and system organization. In Proceedings of the 36th International Symposium on Computer Architecture (ISCA’09). 279--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. SanDisk Solid State Driver. 2015. https://itblog.sandisk.com/truth-ssds-hdd-vendors-do-not-want-you-to-know/.Google ScholarGoogle Scholar
  10. Y. Du, F. Liu, Z. Chen, and X. Ma. 2011. WeLe-RAID: A SSD-based RAID for system endurance and performance. In Proceedings of the 8th IFIP International Conference on Network and Parallel Computing (NPC’11). 248--262. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. EMC Symmetrix DMX Architecture. 2010. Retrieved from https://www.emc.com/collateral/hardware/solution-overview/c1011-symm-dmx-architecture-prod-desc-gd.pdf.Google ScholarGoogle Scholar
  12. R. Golding, P. Bosch, and C. Staelin. 1995. Idleness is not sloth. In Proceedings of the USENIX Technical Conference (USENIX’95). 201--212. Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. K. Greenan, D. D. E. Long, E. L. Miller, T. Schwarz, and A. Wildani. 2009. Building flexible, fault-tolerant flash-based storage systems. In Proceedings of the 5th Workshop on Hot Topics in System Dependability (HotDep’09).Google ScholarGoogle Scholar
  14. L. M. Grupp, J. D. Davis, and S. Swanson. 2012. The bleak future of NAND flash memory. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12). Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. S. Im and D. Shin. 2011. Flash-aware RAID techniques for dependable and high-performance flash memory SSD. IEEE Transactions on Computers 1, 60 (2011), 80--92. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. N. Jeremic, G. Mühl, A. Busse, and J. Richling. 2011. The pitfalls of deploying solid-state drive RAIDs. In Proceedings of the 4th Annual International Conference on Systems and Storage (SYSTOR’11). 1--13. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. H. Kao, J. Paris, D. D. E. Long, and T. Schwarz. 2013. A flexible simulation tool for estimating data loss risks in storage arrays. In Proceedings of the 29th IEEE Symposium on Massive Storage Systems and Technologies (MSST’13). 1--5.Google ScholarGoogle Scholar
  18. Y. Kim, A. Gupta, B. Urgaonkar, P. Berman, and A. Sivasubramaniam. 2011a. HybridStore: A cost-efficient, high-performance storage system combining SSDs and HDDs. In Proceedings of the 19th Annual IEEE/ACM International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOTS’11). 227--236. Google ScholarGoogle ScholarDigital LibraryDigital Library
  19. Y. Kim, S. Oral, G. M. Shipman, J. Lee, D. A. Dillow, and F. Wang. 2011b. Harmonia: A globally coordinated garbage collector for arrays of solid-state drives. In Proceedings of the 27th IEEE Symposium on Mass Storage Systems and Technologies (MSST’11). 1--12. Google ScholarGoogle ScholarDigital LibraryDigital Library
  20. L. Lin, Y. Zhu, J. Yue, Z. Cai, and B. Segee. 2011. Hot random off-loading: A hybrid storage system with dynamic data migration. In Proceedings of the 19th Annual Meeting of the IEEE International Symposium on Modeling, Analysis and Simulation of Computer and Telecommunication Systems (MASCOSTS’11). 318--325. Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. Logical Volume Manager (Linux). 2015. Retrieved from https://en.wikipedia.org/wiki/Logical_Volume_Manager_ (Linux).Google ScholarGoogle Scholar
  22. B. Mao, H. Jiang, D. Feng, S. Wu, J. Chen, L. Zeng, and L. Tian. 2010. HPDA: A hybrid parity-based disk array for enhanced performance and reliability. In Proceedings of 24th International Parallel & Distributed Processing Symposium (IPDPS’’10). 1--12.Google ScholarGoogle Scholar
  23. B. Mao, H. Jiang, S. Wu, Y. Fu, and L. Tian. 2014. Read performance optimization for deduplication-based storage systems in the cloud. ACM Transactions on Storage 10, 2 (2014), 1--22. Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. B. Mao, H. Jiang, S. Wu, L. Tian, D. Feng, J. Chen, and L. Zeng. 2012. HPDA: A hybrid parity-based disk array for enhanced performance and reliability. ACM Transactions on Storage 8, 1 (2012), Article 4. Google ScholarGoogle ScholarDigital LibraryDigital Library
  25. B. Mao and S. Wu. 2015. Exploiting request characteristics and internal parallelism to improve SSD performance. In Proceedings of the 33rd IEEE International Conference on Computer Design (ICCD’15). 476--479. Google ScholarGoogle ScholarDigital LibraryDigital Library
  26. B. Mao, S. Wu, and H. Jiang. 2015. Improving storage availability in cloud-of-clouds with hybrid redundant data distribution. In Proceedings of the 29th IEEE International Parallel & Distributed Processing Symposium (IPDPS’’15). 633--642. Google ScholarGoogle ScholarDigital LibraryDigital Library
  27. J. Meza, Q. Wu, S. Kumar, and O. Mutlu. 2015. A large-scale study of flash memory failures in the field. In Proceedings of the 2015 ACM SIGMETRICS International Conference on Measurement and Modeling of Computer Systems (SIGMETRICS’15). 177--190. Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. Microsoft Enterprise Traces. 2009. Retrieved from http://iotta.snia.org/traces/list/BlockIO.Google ScholarGoogle Scholar
  29. C. Min, K. Kim, H. Cho, S. Lee, and Y. Eom. 2012. SFS: Random write considered harmful in solid state drives. In Proceedings of the 10th USENIX Conference on File and Storage Technologies (FAST’12). Google ScholarGoogle ScholarDigital LibraryDigital Library
  30. S. Moon and A. L. Narasimha Reddy. 2013. Don’t Let RAID raid the lifetime of your SSD array. In Proceedings of the 5th USENIX Workshop on Hot Topics in Storage and File Systems (HotStorage’13). Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. D. Narayanan, E. Thereska, A. Donnelly, S. Elnikety, and A. Rowstron. 2009. Migrating server storage to SSDs: Analysis of tradeoffs. In Proceedings of the 4th European Conference on Computer Systems (EuroSys’09). 145--158. Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. D. A. Patterson, G. Gibson, and R. H. Katz. 1988. A case for redundant arrays of inexpensive disks (RAID). In Proceedings of the International Conference on Management of Data (SIGMOD’88). 109--116. Google ScholarGoogle ScholarDigital LibraryDigital Library
  33. M. Rosenblum and J. Ousterhout. 1992. The design and implementation of a log-structured file system. ACM Transactions on Computer Systems 10, 1 (1992), 26--52. Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. Samsung Report. 2008. Retrieved from http://news.cnet.com/8301-13924_3-9876557-64.html.Google ScholarGoogle Scholar
  35. B. Schroeder and G. A. Gibson. 2007. Disk failures in the real world: What does an MTTF of 1,000,000 hours mean to you? In Proceedings of the 5th USENIX Conference on File and Storage Technologies (FAST’07). 1--16. Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. Solid State Storage Initiative. 2010. Retreived from http://www.snia.org/forums/sssi.Google ScholarGoogle Scholar
  37. G. Soundararajan, V. Prabhakaran, M. Balakrishnan, and T. Wobber. 2010. Extending SSD lifetimes with disk-based write caches. In Proceedings of the 8th USENIX Conference on File and Storage Technologies (FAST’10). 101--114. Google ScholarGoogle ScholarDigital LibraryDigital Library
  38. UMass Trace Repository. 2010. Retrieved from http://traces.cs.umass.edu/index.php/Storage/Storage.Google ScholarGoogle Scholar
  39. Virtualizing Storage for Scale, Resiliency, and Efficiency. 2012. Retrieved from http://blogs.msdn.com/b/b8/archive/2012/01/05/virtualizing-storage-for-scale-resiliency-and-efficiency.aspx.Google ScholarGoogle Scholar
  40. Y. Wang, W. Wang, T. Xie, W. Pan, Y. Gao, and Y. Ouyang. 2014. CR5M: A mirroring-powered channel-RAID5 architecture for an SSD. In Proceedings of 30th Symposium on Mass Storage Systems and Technologies (MSST’14). 1--10.Google ScholarGoogle Scholar
  41. S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2009. WorkOut: I/O workload outsourcing for boosting the raid reconstruction performance. In Proceedings of the 7th USENIX Conference on File and Storage Technologies (FAST’09). 239--252. Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. S. Wu, H. Jiang, D. Feng, L. Tian, and B. Mao. 2011. Improving availability of RAID-structured storage systems by workload outsourcing. IEEE Transactions on Computers 60, 1 (2011), 64--79. Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. S. Wu, H. Jiang, and B. Mao. 2012. IDO: Intelligent data outsourcing with improved raid reconstruction performance in large-scale data centers. In Proceedings of the 26th USENIX Large Installation System Administration (LISA’12). 17--32. Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. S. Wu, H. Jiang, and B. Mao. 2015. Proactive data migration for improved storage availability in large-scale data centers. IEEE Transactions on Computers 64, 9 (2015), 2637--2651.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. S. Wu, W. Yang, B. Mao, and Y. Lin. 2015. MC-RAIS: Multi-chunk redundant array of independent SSDs with improved performance. In Proceedings of the 15th International Conference on Algorithms and Architectures for Parallel Processing (ICA3PP’15). 18--32.Google ScholarGoogle Scholar
  46. T. Xie and Y. Sun. 2010. Dynamic data reallocation in hybrid disk arrays. IEEE Transactions on Parallel and Distributed Systems 21, 9 (2010), 1330--1341. Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. Q. Yang and J. Ren. 2011. I-CASH: Intelligently coupled array of SSD and HDD. In Proceedings of the 17th International Symposium on High Performance Computer Architecture (HPCA’11). 278--289. Google ScholarGoogle ScholarDigital LibraryDigital Library
  48. L. Yi, J. Shu, J. Ou, and W. Zheng. 2013. CG-Resync: Conversion-guided resynchronization for a SSD-based RAID array. In Proceedings of the 31st International Conference on Computer Design (ICCD’13). 455--458.Google ScholarGoogle Scholar
  49. L. Zeng, D. Feng, J. Chen, Q. Wei, B. Veeravalli, and W. Liu. 2012. HRAID6ML: A hybrid RAID6 storage architecture with mirrored logging. In Proceedings of the 28th IEEE Conference on Massive Data Storage (MSST’12). 1--6.Google ScholarGoogle Scholar
  50. Y. Zhang, A. C. Arpaci-Dusseau, and R. H. Arpaci-Dusseau. 2013. Warped mirrors for flash. In Proceedings of the 29th IEEE Symposium on Massive Storage Systems and Technologies (MSST’13). 1--12.Google ScholarGoogle Scholar

Index Terms

  1. LDM: Log Disk Mirroring with Improved Performance and Reliability for SSD-Based Disk Arrays

            Recommendations

            Comments

            Login options

            Check if you have access through your login credentials or your institution to get full access on this article.

            Sign in

            Full Access

            • Published in

              cover image ACM Transactions on Storage
              ACM Transactions on Storage  Volume 12, Issue 4
              August 2016
              213 pages
              ISSN:1553-3077
              EISSN:1553-3093
              DOI:10.1145/2940403
              Issue’s Table of Contents

              Copyright © 2016 ACM

              Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

              Publisher

              Association for Computing Machinery

              New York, NY, United States

              Publication History

              • Published: 20 May 2016
              • Accepted: 1 February 2016
              • Revised: 1 November 2015
              • Received: 1 October 2014
              Published in tos Volume 12, Issue 4

              Permissions

              Request permissions about this article.

              Request Permissions

              Check for updates

              Qualifiers

              • research-article
              • Research
              • Refereed

            PDF Format

            View or Download as a PDF file.

            PDF

            eReader

            View online with eReader.

            eReader