skip to main content
10.1145/2486092.2486108acmconferencesArticle/Chapter ViewAbstractPublication PagespadsConference Proceedingsconference-collections
research-article

Optimizing parallel simulation of multicore systems using domain-specific knowledge

Published:19 May 2013Publication History

ABSTRACT

This paper presents two optimization techniques for the basic Null-message algorithm in the context of parallel simulation of multicore computer architectures. Unlike the general, application-independent optimization methods, these are application-specific optimizations that make use of system properties of the simulation application. We demonstrate in two aspects that the domain-specific knowledge offers great potential for optimization. First, it allows us to send Null-messages much less eagerly, thus greatly reducing the amount of Null-messages. Second, the internal state of the simulation application allows us to make conservative forecast of future outgoing events. This leads to the creation of an enhanced synchronization algorithm called Forecast Null-message algorithm, which, by combining the forecast from both sides of a link, can greatly improve the simulation look-ahead. Compared with the basic Null-message algorithm, our optimizations greatly reduce the number of Null-messages and increase simulation performance significantly as a result. On a subset of the PARSEC benchmarks, a maximum speedup of about 6 is achieved with 17 LPs.

References

  1. C. Bienia and K. Li. Parsec 2.0: A new benchmark suite for chip-multiprocessors. Proceedings of the 5th Annual Workshop on Modeling, Benchmarking and Simulation, 2009.Google ScholarGoogle Scholar
  2. K. Chandy and J. Misra. Distributed simulation: a case study in design and verification of distributed programs. IEEE Transactions on Software Engineering, SE-5(5):440--452, 1979. Google ScholarGoogle ScholarDigital LibraryDigital Library
  3. J. Chen, L. Dabbiru, D. Wong, M. Annavaram, and M. Dubois. Adaptive and speculative slack simulations of cmps on cmps. Proceedings of the 43rd Annual IEEE/ACM International Symposium on Microarchitecture, pages 523--534, 2010. Google ScholarGoogle ScholarDigital LibraryDigital Library
  4. M. Chidester and A. George. Parallel simulation of chip-multiprocessor architectures. ACM Transactions on Modeling and Computer Simulation, 12(3):176--200, July 2002. Google ScholarGoogle ScholarDigital LibraryDigital Library
  5. P. Clements, F. Bachmann, L. Bass, D. Garlan, J. Ivers, R. Little, P. Merson, R. Nord, and J. Stafford. Documenting Software Architectures: Views and Beyond. Addison-Wesley, 2nd edition, 2011.Google ScholarGoogle Scholar
  6. R. DeVries. Reducing null messages in misra's distributed discrete event simulation method. IEEE Transactions on Software Engineering, 16(1):82--91, January 1990. Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. J. Duato, S. Yalamanchili, and L. Ni. Interconnection Networks, an Engineering Approach. Morgan Kaufmann, 2003.Google ScholarGoogle Scholar
  8. R. Fujimoto. Parallel and Distributed Simulation Systems. John Wiley & Sons, 2000. Google ScholarGoogle ScholarDigital LibraryDigital Library
  9. Intel. Pin - a dynamic binary instrumentation tool. http://software.intel.com/en-us/articles/pin-a-dynamic-binary-instrumen%tation-tool.Google ScholarGoogle Scholar
  10. S. Keckler, K. Olukotun, and H. Hofstee, editors. Multicore Processors and Systems. Springer, 2009. Google ScholarGoogle ScholarDigital LibraryDigital Library
  11. C. Kersey, A. Rodrigues, and S. Yalamanchili. A universal parallel front-end for execution driven microarchitecture simulation. Proceedings of the 2012 Workshop on Rapid Simulation and Performance Evaluation Methods and Tools, pages 25--32, 2012. Google ScholarGoogle ScholarDigital LibraryDigital Library
  12. G. Loh, S. Subramaniam, and Y. Xie. Zesto: A cycle-level simulator for highly detailed microarchitecture exploration. International Symposium on Performance Analysis of Software and Systems, pages 53--64, 2009.Google ScholarGoogle ScholarCross RefCross Ref
  13. manifold.gatech.edu. Manifold. http://manifold.gatech.edu.Google ScholarGoogle Scholar
  14. J. Miller, H. Kasture, G.Kurian, C. Gruenwald, N. Beckmann, C. Celio, J. Eastep, and A. Agarwal. Graphite: A distributed parallel simulator for multicores. Proceedings of the 16th International Symposium on High-Performance Computer Architecture, pages 1--12, 2010.Google ScholarGoogle ScholarCross RefCross Ref
  15. J. Misra. Distributed discrete event simulation. ACM Computing Surveys, 18(1):39--65, March 1986. Google ScholarGoogle ScholarDigital LibraryDigital Library
  16. A. Rodrigues, K. Hemmert, B. Barrett, C. Kersey, R. Oldfield, M. Weston, R. Risen, J. Cook, P. Rosenfeld, E. CooperBalls, and B.Jacob. The structural simulation toolkit. ACM SIGMETRICS Performance Evaluation Review, 38(4):37--42, March 2011. Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. W.-K. Su and C. Seitz. Variants of the chandy-misra-bryant distributed discrete-event simulation algorithm. Technical Report Caltech-CS-TR-88-22, California Institute of Technology, 1988. Google ScholarGoogle Scholar

Index Terms

  1. Optimizing parallel simulation of multicore systems using domain-specific knowledge

      Recommendations

      Comments

      Login options

      Check if you have access through your login credentials or your institution to get full access on this article.

      Sign in
      • Published in

        cover image ACM Conferences
        SIGSIM PADS '13: Proceedings of the 1st ACM SIGSIM Conference on Principles of Advanced Discrete Simulation
        May 2013
        426 pages
        ISBN:9781450319201
        DOI:10.1145/2486092

        Copyright © 2013 ACM

        Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

        Publisher

        Association for Computing Machinery

        New York, NY, United States

        Publication History

        • Published: 19 May 2013

        Permissions

        Request permissions about this article.

        Request Permissions

        Check for updates

        Qualifiers

        • research-article

        Acceptance Rates

        SIGSIM PADS '13 Paper Acceptance Rate29of75submissions,39%Overall Acceptance Rate398of779submissions,51%

      PDF Format

      View or Download as a PDF file.

      PDF

      eReader

      View online with eReader.

      eReader