ABSTRACT
The design of high-performance computing architectures requires performance analysis of large-scale parallel applications to derive various parameters concerning hardware design and software development. The process of performance analysis and benchmarking an application can be done in several ways with varying degrees of fidelity. One of the most cost-effective ways is to do a coarse-grained study of large-scale parallel applications through the use of program skeletons. The concept of a "program skeleton" that we discuss in this paper is an abstracted program that is derived from a larger program where source code that is determined to be irrelevant is removed for the purposes of the skeleton. In this work, we develop a semi-automatic approach for extracting program skeletons based on compiler program analysis. We demonstrate correctness of our skeleton extraction process by comparing details from communication traces, as well as show the performance speedup of using skeletons by running simulations in the SST/macro simulator.
- V. S. Adve, R. Bagrodia, E. Deelman, and R. Sakellariou. Compiler-optimized simulation of large-scale applications on high performance architectures. Journal of Parallel and Distributed Computing, 62(3):393--426, March 2002. Google ScholarDigital Library
- D. H. Bailey, E. Barszcz, J. T. Barton, D. S. Browning, R. L. Carter, L. Dagum, R. A. Fatoohi, P. O. Frederickson, T. A. Lasinski, R. S. Schreiber, H. D. Simon, V. Venkatakrishnan, and S. K. Weeratunga. The NAS parallel benchmarks-summary and preliminary results. In Proceedings of the 1991 ACM/IEEE conference on Supercomputing, Supercomputing '91, pages 158--165, New York, NY, USA, 1991. ACM. Google ScholarDigital Library
- J. Dongarra, T. Hey, and E. Strohmaier. PARKBENCH: Methodology, Relations and Results. In H. M. Liddell, A. Colbrook, L. O. Hertzberger, and P. M. A. Sloot, editors, HPCN Europe, volume 1067 of Lecture Notes in Computer Science, pages 770--777. Springer, 1996. Google ScholarDigital Library
- C. L. Janssen, H. Adalsteinsson, S. Cranford, J. P. Kenny, A. Pinar, D. A. Evensky, and J. Mayo. A simulator for large-scale parallel architectures. International Journal of Parallel and Distributed Systems, 1(2):57--73, 2010. Google ScholarDigital Library
- Lawrence Livermore National Laboratory. ROSE Compiler. http://www.roseCompiler.org.Google Scholar
- R. Preissl, M. Schulz, D. Kranzlmüller, B. R. de Supinski, and D. J. Quinlan. Using MPI Communication Patterns to Guide Source Code Transformations. In Proceedings of the 8th International Conference on Computational Science (ICCS), pages 253--260. Springer-Verlag, 2008. Google ScholarDigital Library
- Sandia National Laboratories. HPCCG. http://bec.syr.edu/hpccg.html.Google Scholar
- Sandia National Laboratories. The Structural Simulation Toolkit. http://sst.sandia.gov/.Google Scholar
- J. Shalf, D. Quinlan, and C. Janssen. Rethinking hardware-software codesign for exascale systems. IEEE Computer, 44(11):22--30, Nov. 2011. Google ScholarDigital Library
- J. Subhlok and Q. Xu. Automatic construction of coordinated performance skeletons. In Proceedings of the 2008 IEEE International Symposium on Parallel and Distributed Processing (IPDPS), pages 1--5, April 2008.Google ScholarCross Ref
Index Terms
- Semi-automatic extraction of software skeletons for benchmarking large-scale parallel applications
Recommendations
Static Analysis Techniques for Semiautomatic Synthesis of Message Passing Software Skeletons
Special Issue on PADSThe design of high-performance computing architectures requires performance analysis of large-scale parallel applications to derive various parameters concerning hardware design and software development. The process of performance analysis and ...
Parallel Simulation of Large-Scale Parallel Applications
Accurate and efficient simulation of large parallel applications can be facilitated with the use of direct execution and parallel discrete-event simulation. This paper describes MPI-SIM, a direct execution-driven parallel simulator designed to predict ...
Comments