research-article

Public Access

Ares: a framework for quantifying the resilience of deep neural networks

Authors:
Brandon Reagen

Harvard University

Harvard University
View Profile

,
Udit Gupta

Harvard University

Harvard University
View Profile

,
Lillian Pentecost

Harvard University

Harvard University
View Profile

,
Paul Whatmough

Harvard University

Harvard University
View Profile

,
Sae Kyu Lee

Harvard University

Harvard University
View Profile

,
Niamh Mulholland

Harvard University

Harvard University
View Profile

,
David Brooks

Harvard University

Harvard University
View Profile

,
Gu-Yeon Wei

Harvard University

Harvard University
View Profile

DAC '18: Proceedings of the 55th Annual Design Automation ConferenceJune 2018Article No.: 17Pages 1–6https://doi.org/10.1145/3195970.3195997

Published:24 June 2018Publication History

DAC '18: Proceedings of the 55th Annual Design Automation Conference

Pages 1–6

ABSTRACT

As the use of deep neural networks continues to grow, so does the fraction of compute cycles devoted to their execution. This has led the CAD and architecture communities to devote considerable attention to building DNN hardware. Despite these efforts, the fault tolerance of DNNs has generally been overlooked. This paper is the first to conduct a large-scale, empirical study of DNN resilience. Motivated by the inherent algorithmic resilience of DNNs, we are interested in understanding the relationship between fault rate and model accuracy. To do so, we present Ares: a light-weight, DNN-specific fault injection framework validated within 12% of real hardware. We find that DNN fault tolerance varies by orders of magnitude with respect to model, layer type, and structure.

References

"Solid state drive (ssd) requirements and endurance test method." https://www.jedec.org/standards-documents/focus/flash/solid-state-drives, 2017.Google Scholar
B. Reagen, P. Whatmough, R. Adolf, S. Rama, H. Lee, S. K. Lee, J. M. Hernandez-Lobato, G.-Y. Wei, and D. Brooks, "Minerva: Enabling low-power, highly-accurate deep neural network accelerators," ISCA, 2016. Google ScholarDigital Library
S. K. S. Hari, T. Tsai, M. Stephenson, S. W. Keckler, and J. Emer, "Sassifi: An architecture-level fault injection tool for gpu application resilience evaluation," ISPASS, 2017.Google Scholar
P. N. Whatmough, S. K. Lee, H. Lee, S. Rama, D. Brooks, and G. Y. Wei, "A 28nm soc with a 1.2ghz 568nj/prediction sparse deep-neural-network engine with 0.1 timing error rate tolerance for iot applications," ISSCC, Feb 2017.Google Scholar
I. Goodfellow, Y. Bengio, and A. Courville in Deep Learning, MIT Press, 2016. Google ScholarDigital Library
P. Kerlirzin and F. Vallet, "Robustness in multilayer perceptrons," Neural Computation, 1993. Google ScholarDigital Library
Y. L. Cun, J. S. Denker, and S. A. Solla, "Optimal brain damage," NIPS, 1990. Google ScholarDigital Library
G. Li, S. Hari, M. Sullivan, T. Tsai, K. Pattabiraman, J. Emer, and S. W. Keckler, "Understanding error propagation in deep learning neural network (dnn) accelerators and applications," SC, 2017. Google ScholarDigital Library
O. Temam, "A defect-tolerant accelerator for emerging high-performance applications," ISCA, June 2012. Google ScholarDigital Library
B. Randell, P. Lee, and P. C. Treleaven, "Reliability issues in computing system design," ACM Comput. Surv., June 1978. Google ScholarDigital Library
"Keras: The python deep learning library." http://keras.io, 2018.Google Scholar
J. Bergstra, O. Breuleux, F. Bastien, P. Lamblin, R. Pascanu, G. Desjardins, J. Turian, D. Warde-Farley, and Y. Bengio, "Theano: a CPU and GPU math expression compiler," SciPy, 2010.Google Scholar
"Tensorflow: An open-source software library for machine intelligence." https://www.tensorflow.org/, 2018.Google Scholar

Ares: a framework for quantifying the resilience of deep neural networks
1. Hardware

Recommendations

Ares: A framework for quantifying the resilience of deep neural networks
2018 55th ACM/ESDA/IEEE Design Automation Conference (DAC)
As the use of deep neural networks continues to grow, so does the fraction of compute cycles devoted to their execution. This has led the CAD and architecture communities to devote considerable attention to building DNN hardware. Despite these efforts, ...
Read More
Reliability Measure of Hardware Redundancy Fault-Tolerant Digital Systems with Intermittent Faults

While significant results are available which allow estimation of reliability measure for systems with permanent faults, no generally applicable results are available for intermittent (transient) faults. Methods are presented here which allow ...
Read More
Lossless-constraint Denoising based Auto-encoders

In this paper, we address the poor generalization ability problem of traditional auto-encoder on noise data, and propose a Lossless-constraint Denoising (LD) method, which can enhance the anti-noise ability and robustness of auto-encoders. We ...
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in

DAC '18: Proceedings of the 55th Annual Design Automation Conference
June 2018
1089 pages
ISBN:9781450357005
DOI:10.1145/3195970

Copyright © 2018 ACM
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]
Sponsors
In-Cooperation
Publisher
Association for Computing Machinery
New York, NY, United States
Publication History
- Published: 24 June 2018
Permissions
Request permissions about this article.
Request Permissions

Check for updates
Qualifiers
- research-article
Conference

Acceptance Rates
Overall Acceptance Rate1,770of5,499submissions,32%
Upcoming Conference
DAC '24

Sponsor:

sigda

61st ACM/IEEE Design Automation Conference

June 23 - 27, 2024

San Francisco , CA , USA
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 142
  Total Citations
  View Citations
- 2,696
  Total Downloads
- Downloads (Last 12 months)585
- Downloads (Last 6 weeks)69
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Ares: a framework for quantifying the resilience of deep neural networks

DAC '18: Proceedings of the 55th Annual Design Automation Conference

ABSTRACT

References

Cited By

Recommendations

Ares: A framework for quantifying the resilience of deep neural networks

Reliability Measure of Hardware Redundancy Fault-Tolerant Digital Systems with Intermittent Faults

Lossless-constraint Denoising based Auto-encoders

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Other Metrics

Article Metrics

Other Metrics

Cited By

PDF Format

eReader

Digital Edition

Caption

Ares: a framework for quantifying the resilience of deep neural networks

DAC '18: Proceedings of the 55th Annual Design Automation Conference

ABSTRACT

References

Cited By

Recommendations

Ares: A framework for quantifying the resilience of deep neural networks

Reliability Measure of Hardware Redundancy Fault-Tolerant Digital Systems with Intermittent Faults

Lossless-constraint Denoising based Auto-encoders

Comments

Login options

Full Access

Published in

Sponsors

In-Cooperation

Publisher

Publication History

Permissions

Check for updates

Qualifiers

Conference

Acceptance Rates

Upcoming Conference

Funding Sources

Article Metrics

Other Metrics

PDF Format

eReader

Digital Edition

Share this Publication link

Share on Social Media