The hybrid/heterogeneous nature of future microprocessors and large high-performance computing systems will result in a reliance on two major types of components: multicore/manycore central processing units and special purpose hardware/massively parallel accelerators. While these technologies have numerous benefits, they also pose substantial performance challenges for developers, including scalability, software tuning, and programming issues. Researchers at the Forefront Reveal Results from Their Own State-of-the-Art WorkEdited by some of the top researchers in the field and with contributions from a variety of international experts, Scientific Computing with Multicore and Accelerators focuses on the architectural design and implementation of multicore and manycore processors and accelerators, including graphics processing units (GPUs) and the Sony Toshiba IBM (STI) Cell Broadband Engine (BE) currently used in the Sony PlayStation 3. The book explains how numerical libraries, such as LAPACK, help solve computational science problems; explores the emerging area of hardware-oriented numerics; and presents the design of a fast Fourier transform (FFT) and a parallel list ranking algorithm for the Cell BE. It covers stencil computations, auto-tuning, optimizations of a computational kernel, sequence alignment and homology, and pairwise computations. The book also evaluates the portability of drug design applications to the Cell BE and illustrates how to successfully exploit the computational capabilities of GPUs for scientific applications. It concludes with chapters on dataflow frameworks, the Charm++ programming model, scan algorithms, and a portable intracore communication framework. Explores the New Computational Landscape of Hybrid Processors By offering insight into the process of constructing and effectively using the technology, this volume provides a thorough and practical introduction to the area of hybrid computing. It discusses introductory concepts and simple examples of parallel computing, logical and performance debugging for parallel computing, and advanced topics and issues related to the use and building of many applications.
Cited By
- Thomas A and Kumar A (2018). A comparative evaluation of systems for scalable linear algebra-based analytics, Proceedings of the VLDB Endowment, 11:13, (2168-2182), Online publication date: 1-Sep-2018.
- Thomas A and Kumar A (2019). A comparative evaluation of systems for scalable linear algebra-based analytics, Proceedings of the VLDB Endowment, 11:13, (2168-2182), Online publication date: 1-Sep-2018.
- Szustak L, Halbiniak K, Kuczynski L, Wrobel J and Kulawik A (2019). Porting and optimization of solidification application for CPU-MIC hybrid platforms, International Journal of High Performance Computing Applications, 32:4, (523-539), Online publication date: 1-Jul-2018.
- Benatia A, Ji W, Wang Y and Shi F (2018). BestSF, ACM Transactions on Architecture and Code Optimization, 15:3, (1-27), Online publication date: 8-Oct-2018.
- Morad A, Yavits L, Kvatinsky S and Ginosar R (2016). Resistive GP-SIMD Processing-In-Memory, ACM Transactions on Architecture and Code Optimization, 12:4, (1-22), Online publication date: 7-Jan-2016.
- Eddelbuettel D and Sanderson C (2014). RcppArmadillo, Computational Statistics & Data Analysis, 71:C, (1054-1063), Online publication date: 1-Mar-2014.
Recommendations
Petascale computing with accelerators
PPoPP '09: Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programmingA trend is developing in high performance computing in which commodity processors are coupled to various types of computational accelerators. Such systems are commonly called hybrid systems. In this paper, we describe our experience developing an ...
Comparing Hardware Accelerators in Scientific Applications: A Case Study
Multicore processors and a variety of accelerators have allowed scientific applications to scale to larger problem sizes. We present a performance, design methodology, platform, and architectural comparison of several application accelerators executing ...