skip to main content
10.1145/3195970.3195997acmconferencesArticle/Chapter ViewAbstractPublication PagesdacConference Proceedingsconference-collections
research-article
Public Access

Ares: a framework for quantifying the resilience of deep neural networks

Published:24 June 2018Publication History

ABSTRACT

As the use of deep neural networks continues to grow, so does the fraction of compute cycles devoted to their execution. This has led the CAD and architecture communities to devote considerable attention to building DNN hardware. Despite these efforts, the fault tolerance of DNNs has generally been overlooked. This paper is the first to conduct a large-scale, empirical study of DNN resilience. Motivated by the inherent algorithmic resilience of DNNs, we are interested in understanding the relationship between fault rate and model accuracy. To do so, we present Ares: a light-weight, DNN-specific fault injection framework validated within 12% of real hardware. We find that DNN fault tolerance varies by orders of magnitude with respect to model, layer type, and structure.

References

  1. "Solid state drive (ssd) requirements and endurance test method." https://www.jedec.org/standards-documents/focus/flash/solid-state-drives, 2017.Google ScholarGoogle Scholar
  2. B. Reagen, P. Whatmough, R. Adolf, S. Rama, H. Lee, S. K. Lee, J. M. Hernandez-Lobato, G.-Y. Wei, and D. Brooks, "Minerva: Enabling low-power, highly-accurate deep neural network accelerators," ISCA, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. S. K. S. Hari, T. Tsai, M. Stephenson, S. W. Keckler, and J. Emer, "Sassifi: An architecture-level fault injection tool for gpu application resilience evaluation," ISPASS, 2017.Google ScholarGoogle Scholar
  4. P. N. Whatmough, S. K. Lee, H. Lee, S. Rama, D. Brooks, and G. Y. Wei, "A 28nm soc with a 1.2ghz 568nj/prediction sparse deep-neural-network engine with 0.1 timing error rate tolerance for iot applications," ISSCC, Feb 2017.Google ScholarGoogle Scholar
  5. I. Goodfellow, Y. Bengio, and A. Courville in Deep Learning, MIT Press, 2016. Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. P. Kerlirzin and F. Vallet, "Robustness in multilayer perceptrons," Neural Computation, 1993. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. Y. L. Cun, J. S. Denker, and S. A. Solla, "Optimal brain damage," NIPS, 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  8. G. Li, S. Hari, M. Sullivan, T. Tsai, K. Pattabiraman, J. Emer, and S. W. Keckler, "Understanding error propagation in deep learning neural network (dnn) accelerators and applications," SC, 2017. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. O. Temam, "A defect-tolerant accelerator for emerging high-performance applications," ISCA, June 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. B. Randell, P. Lee, and P. C. Treleaven, "Reliability issues in computing system design," ACM Comput. Surv., June 1978. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. "Keras: The python deep learning library." http://keras.io, 2018.Google ScholarGoogle Scholar
  12. J. Bergstra, O. Breuleux, F. Bastien, P. Lamblin, R. Pascanu, G. Desjardins, J. Turian, D. Warde-Farley, and Y. Bengio, "Theano: a CPU and GPU math expression compiler," SciPy, 2010.Google ScholarGoogle Scholar
  13. "Tensorflow: An open-source software library for machine intelligence." https://www.tensorflow.org/, 2018.Google ScholarGoogle Scholar
  1. Ares: a framework for quantifying the resilience of deep neural networks

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in
    • Published in

      cover image ACM Conferences
      DAC '18: Proceedings of the 55th Annual Design Automation Conference
      June 2018
      1089 pages
      ISBN:9781450357005
      DOI:10.1145/3195970

      Copyright © 2018 ACM

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 24 June 2018

      Permissions

      Request permissions about this article.

      Request Permissions

      Check for updates

      Qualifiers

      • research-article

      Acceptance Rates

      Overall Acceptance Rate1,770of5,499submissions,32%

      Upcoming Conference

      DAC '24
      61st ACM/IEEE Design Automation Conference
      June 23 - 27, 2024
      San Francisco , CA , USA

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader