ABSTRACT
The BlueGene/L supercomputer is expected to deliver new levels of application performance by providing a combination of good single-node computational performance and high scalability. To achieve good single-node performance, the BlueGene/L design includes a special dual floating-point unit on each processor and the ability to use two processors per node. BlueGene/L also includes both a torus and a tree network to achieve high scalability. We demonstrate how benchmarks and applications can take advantage of these architectural features to get the most out of BlueGene/L.
- {1} N.R. Adiga et al. An overview of the BlueGene/L supercomputer. In SC2002 - High Performance Networking and Computing, Baltimore, MD, November 2002. Google ScholarDigital Library
- {2} ASCI Red Homepage. http://www.sandia.gov/ASCI/Red/.Google Scholar
- {3} S. Larsen and S. Amarasinghe, Exploiting superword level parallelism with multimedia instruction sets. In Proceedings of SIGPLAN 2004 Conference on Programming Language Design and Implementation, pages 145-156, June 2000. Google ScholarDigital Library
- {4} A. Eichenberger, P. Wu, and K. O'Brien. Vectorization for short SIMD architectures with alignment constraints. In Proceedings of SIGPLAN 2004 Conference on Programming Language Design and Implementation, Washington, D.C., June 2004. Google ScholarDigital Library
- {5} G. Almasi, L. Bachega, S. Chatterjee, D. Lieber, X. Martorell, and J.E. Moreira. Enabling dual-core mode in BlueGene/L: Challenges and solutions. In Proceedings of 15th Symposium on Computer Architecture and High Performance Computing, Sao Paulo, Brazil, November 2003. Google ScholarDigital Library
- {6} NAS Parallel Benchmarks. http://www.nas.nasa.gov/Software/NPB.Google Scholar
- {7} The Linpack Benchmark. http:/www.netlib.org/benchmark/top500/lists/linpack.html.Google Scholar
- {8} ASCI Purple Benchmark Page. http://www.llnl.gov/asci/purple/benchmarks/limited/code_list.html.Google Scholar
- {9} IBM Mathematical Acceleration Subsystem. http://techsupport.services.ibm.com/server/mass.Google Scholar
- {10} Metis home page. http://www-users.cs.umn.edu/~karypis/metis/index.html.Google Scholar
- {11} CPMD home page. http://www.cpmd.orgGoogle Scholar
- {12} Enzo Home Page, http://cosmos.ucsd.edu/enzo.Google Scholar
Recommendations
The BlueGene/L supercomputer and quantum ChromoDynamics
SC '06: Proceedings of the 2006 ACM/IEEE conference on SupercomputingWe describe our methods for performing quantum chromodynamics (QCD) simulations that sustain up to 20% of the peak performance on BlueGene supercomputers. We present our methods, scaling properties, and first cutting edge results relevant to QCD. We ...
Unlocking performance portability on LUMI-G supercomputer: A virtual screening case study
IWOCL '24: Proceedings of the 12th International Workshop on OpenCL and SYCLHigh-Performance Computing is the target system for virtual screening applications, which aim to suggest which candidates to test in the drug discovery process. The HPC heterogeneity of modern systems raises the functional and performance portability ...
Performance Optimization of the HPCG Benchmark on the Sunway TaihuLight Supercomputer
In this article, we present some key techniques for optimizing HPCG on Sunway TaihuLight and demonstrate how to achieve high performance in memory-bound applications by exploiting specific characteristics of the hardware architecture. In particular, we ...
Comments