Understanding and coping with failures in large-scale storage systems

January 2005

Author:
Qin Xin
University of California, Santa Cruz
,
Chair:
Ethan L. Miller
University of California, Santa Cruz

Publisher:

University of California at Santa Cruz
Computer and Information Sciences Dept. 265 Applied Sciences Building Santa Cruz, CA
United States

ISBN:978-0-542-38519-3

Order Number:AAI3194083

Pages:

134

Purchase on ProQuest

Bibliometrics

Abstract

Reliability for very large-scale storage systems has become more and more important as the need for storage has grown dramatically. New phenomena related to system reliability appear as systems scale up. In such a system, failures are a normality. In order to ensure high reliability for petabyte-scale storage systems in scientific applications, characterization of failures and techniques of coping with them are studied in this thesis.

The thesis first describes the architecture of a petabyte-scale storage system and characterizes the challenges of achieving high reliability in such a system. The long disk recovery time and the large number of system components are identified as the main obstacles against high system reliability.

The thesis then presents a fast recovery mechanism, FARM, which greatly reduces data loss in the occurrence of multiple disk failures. Reliability of a petabyte-scale system with and without FARM has been evaluated. Accordingly, various aspects of system reliability, such as failure detection latency, bandwidth utilization for recovery, disk space utilization, and system scale, have been examined by simulations.

The overall system reliability is modeled and estimated by quantitative analysis based on Markov models and event-driven simulations. It is found that disk failure models which take infant mortality into consideration result in more precise reliability estimation than the traditional model which assumes a constant failure rate, since infant mortality has a pronounced impact on petabyte-scale systems. To safeguard data against failures from young disk drives, an adaptive data redundancy scheme is presented and evaluated.

A petabyte-scale storage system is typically built up by thousands of components in a complicated interconnect structure. The impact of various failures on the interconnection networks is gauged and the performance and robustness under degraded modes are evaluated in a simulated petabyte-scale storage system with different configurations of network topology.

This thesis is directed towards understanding and coping with failures in petabyte-scale storage systems. It addresses several emerging reliability challenges posed by the increasing scale of storage systems and study the methods to improving system reliability. The research is targeted to help system architects in the designs of reliable storage systems at petabyte-scale and beyond.

Contributors

Qin Xin
University of California, Santa Cruz
- Publication Years2003 - 2005
- Publication counts6
- Citation count113
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article19
View Full Profile
Ethan L. Miller
University of California, Santa Cruz
- Publication Years1991 - 2023
- Publication counts125
- Citation count2,350
- Available for Download51
- Downloads (cumulative)49,726
- Downloads (12 months)21,246
- Downloads (6 weeks)1,028
- Average Downloads per Article975
- Average Citation per Article19
View Full Profile

Index Terms

Understanding and coping with failures in large-scale storage systems
1. Computing methodologies
  1. Modeling and simulation
    1. Simulation types and techniques
      1. Discrete-event simulation
2. Hardware

Recommendations

Evaluation of Distributed Recovery in Large-Scale Storage Systems
HPDC '04: Proceedings of the 13th IEEE International Symposium on High Performance Distributed Computing

Storage clusters consisting of thousands of disk drives are now being used both for their large capacity and high throughput. However, their reliability is far worse than that of smaller storage systems due to the increased number of storage nodes. RAID ...
Read More
Storage Deduplication by Virtual Large-Scale Disks
NBIS '12: Proceedings of the 2012 15th International Conference on Network-Based Information Systems

Recently, the demand of low cost large scale storages increases. We developed VLSD (Virtual Large Scale Disks) toolkit for constructing virtual disk based distributed storages, which aggregate free spaces of individual disks. VLSD realizes low-cost ...
Read More
LSM-tree managed storage for large-scale key-value store
SoCC '17: Proceedings of the 2017 Symposium on Cloud Computing

Key-value stores are increasingly adopting LSM-trees as their enabling data structure in the backend storage, and persisting their clustered data through a file system. A file system is expected to not only provide file/directory abstraction to organize ...
Read More

Comments

Browse Theses

Sections

Index Terms

Evaluation of Distributed Recovery in Large-Scale Storage Systems

Storage Deduplication by Virtual Large-Scale Disks

LSM-tree managed storage for large-scale key-value store

Sections

Save to Binder

Index Terms

Recommendations

Evaluation of Distributed Recovery in Large-Scale Storage Systems

Storage Deduplication by Virtual Large-Scale Disks

LSM-tree managed storage for large-scale key-value store