ABSTRACT
Legacy archival workloads have a typical write-once-read-never pattern, which fits well for tape based archival systems. With the emergence of newer applications like Facebook, Yahoo! Flickr, Apple iTunes, demand for a new class of archives has risen, where archived data continues to get accessed, albeit at lesser frequency and relaxed latency requirements. We call these types of archival storage systems as active archives. However, keeping archived data on always spinning storage media to fulfill occasional read requests is not practical due to significant power costs. Using spin-down disks, having better latency characteristics as compared to tapes, for active archives can save significant power. In this paper, we present a two-tier architecture for active archives comprising of online and offline disks, and provide an access-aware intelligent data layout mechanism to bring power efficiency. We validate the proposed mechanism with real-world archival traces. Our results indicate that the proposed clustering and optimized data layout algorithms save upto 78% power over random placement.
- T. Achterberg. Scip: solving constraint integer programs. Mathematical Programming Computation, 1(1):1--41, 2009.Google ScholarCross Ref
- S. Balakrishnan, R. Black, A. Donnelly, P. England, A. Glass, D. Harper, S. Legtchenko, A. Ogus, E. Peterson, and A. Rowstron. Pelican: A building block for exascale cold data storage. In Proceedings of the 11th USENIX conference on Operating Systems Design and Implementation, pages 351--365. USENIX Association, 2014. Google ScholarDigital Library
- D. Beaver, S. Kumar, H. C. Li, J. Sobel, P. Vajgel, et al. Finding a needle in haystack: Facebook's photo storage. In OSDI, volume 10, pages 1--8, 2010. Google ScholarDigital Library
- E. V. Carrera, E. Pinheiro, and R. Bianchini. Conserving disk energy in network servers. In Proceedings of the 17th annual international conference on Supercomputing, pages 86--97. ACM, 2003. Google ScholarDigital Library
- M. Grawinkel, L. Nagel, M. Mäsker, F. Padua, A. Brinkmann, and L. Sorth. Analysis of the ecmwf storage landscape. In Proc. of the 13th USENIX Conference on File and Storage Technologies (FAST), 2015. Google ScholarDigital Library
- K. M. Greenan, D. D. Long, E. L. Miller, T. J. Schwarz, and J. J. Wylie. A spin-up saved is energy earned: Achieving power-efficient, erasure-coded storage. In HotDep, 2008. Google ScholarDigital Library
- P. Gupta, A. Wildani, E. L. Miller, D. Rosenthal, I. F. Adams, C. Strong, and A. Hospodor. An economic perspective of disk vs. flash media in archival storage. In IEEE 22nd International Symposium on Modelling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2014, pages 249--254. IEEE, 2014. Google ScholarDigital Library
- J. Huang, F. Zhang, X. Qin, and C. Xie. Exploiting redundancies and deferred writes to conserve energy in erasure-coded storage clusters. ACM Transactions on Storage (TOS), 9(2):4, 2013. Google ScholarDigital Library
- A. Kathpal and G. A. N. Yasa. Nakshatra: Towards running batch analytics on an archive. In IEEE 22nd International Symposium on Modelling, Analysis & Simulation of Computer and Telecommunication Systems (MASCOTS), 2014, pages 479--482. IEEE, 2014. Google ScholarDigital Library
- T. Xie and Y. Sun. Sacrificing reliability for energy saving: Is it worthwhile for disk arrays? In Parallel and Distributed Processing, 2008. IPDPS 2008. IEEE International Symposium on, pages 1--12. IEEE, 2008.Google ScholarCross Ref
- Q. Zhu, Z. Chen, L. Tan, Y. Zhou, K. Keeton, and J. Wilkes. Hibernator: helping disk arrays sleep through the winter. In ACM SIGOPS Operating Systems Review, volume 39, pages 177--190. ACM, 2005. Google ScholarDigital Library
Index Terms
- Data layout for power efficient archival storage systems
Comments