skip to main content
Skip header Section
Using MPI: portable parallel programming with the message-passing interfaceAugust 1994
Publisher:
  • MIT Press
  • 55 Hayward St.
  • Cambridge
  • MA
  • United States
ISBN:978-0-262-57104-3
Published:01 August 1994
Pages:
307
Skip Bibliometrics Section
Bibliometrics
Abstract

No abstract available.

Cited By

  1. Zhou N, Zhou H and Hoppe D (2023). Containerization for High Performance Computing Systems: Survey and Prospects, IEEE Transactions on Software Engineering, 49:4, (2722-2740), Online publication date: 1-Apr-2023.
  2. Zhang K, Su H and Dou Y (2021). Multilevel parallelism optimization of stencil computations on SIMDlized NUMA architectures, The Journal of Supercomputing, 77:11, (13584-13600), Online publication date: 1-Nov-2021.
  3. ACM
    Williams B, Eker A, Chiu K and Ponomarev D High-Performance PDES on Manycore Clusters Proceedings of the 2021 ACM SIGSIM Conference on Principles of Advanced Discrete Simulation, (153-164)
  4. Memeti S, Pllana S, Binotto A, Kołodziej J and Brandic I (2019). Using meta-heuristics and machine learning for software optimization of parallel computing systems, Computing, 101:8, (893-936), Online publication date: 1-Aug-2019.
  5. ACM
    Hayashi A, Paul S and Sarkar V GPUIterator: bridging the gap between Chapel and GPU platforms Proceedings of the ACM SIGPLAN 6th on Chapel Implementers and Users Workshop, (2-11)
  6. Eckstein J, Goldberg N and Kagawa A Rule-enhanced penalized regression by column generation using rectangular maximum agreement Proceedings of the 34th International Conference on Machine Learning - Volume 70, (1059-1067)
  7. Salles T and Hardiman L (2016). Badlands, Computers & Geosciences, 91:C, (77-89), Online publication date: 1-Jun-2016.
  8. ACM
    Achour S and Rinard M (2015). Approximate computation with outlier detection in Topaz, ACM SIGPLAN Notices, 50:10, (711-730), Online publication date: 18-Dec-2015.
  9. ACM
    Hayashi A, Zhao J, Ferguson M and Sarkar V LLVM-based communication optimizations for PGAS programs Proceedings of the Second Workshop on the LLVM Compiler Infrastructure in HPC, (1-11)
  10. ACM
    Achour S and Rinard M Approximate computation with outlier detection in Topaz Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, (711-730)
  11. Peretti Pezzi G, Vaissié E, Viala Y, Caromel D and Gourbesville P (2015). Parallel profiling of water distribution networks using the Clément formula, Applied Mathematics and Computation, 267:C, (83-95), Online publication date: 15-Sep-2015.
  12. Grelck C Single Assignment C (SAC) Central European Functional Programming School, (207-282)
  13. Solomonik E, Matthews D, Hammond J, Stanton J and Demmel J (2014). A massively parallel tensor contraction framework for coupled-cluster computations, Journal of Parallel and Distributed Computing, 74:12, (3176-3190), Online publication date: 1-Dec-2014.
  14. ACM
    Chan Y, Wellings A, Gray I and Audsley N On the Locality of Java 8 Streams in Real-Time Big Data Applications Proceedings of the 12th International Workshop on Java Technologies for Real-time and Embedded Systems, (20-28)
  15. ACM
    Sun Y, Lifflander J and Kalé L PICS Proceedings of the 4th International Workshop on Runtime and Operating Systems for Supercomputers, (1-8)
  16. Das D, Chen D, Martins A, Schneider N and Smith N (2014). Frame-semantic parsing, Computational Linguistics, 40:1, (9-56), Online publication date: 1-Mar-2014.
  17. ACM
    Dennis J Compiling Fresh Breeze Codelets Proceedings of Programming Models and Applications on Multicores and Manycores, (51-60)
  18. ACM
    Dennis J Compiling Fresh Breeze Codelets Proceedings of Programming Models and Applications on Multicores and Manycores, (51-60)
  19. Guo Z, Mi J, Xiong S and Grant P (2014). Phase field study of the tip operating state of a freely growing dendrite against convection using a novel parallel multigrid approach, Journal of Computational Physics, 257:PA, (278-297), Online publication date: 15-Jan-2014.
  20. Lotz J, Naumann U, Sagebaum M and Schanen M Discrete adjoints of PETSc through dco/c++ and adjoint MPI Proceedings of the 19th international conference on Parallel Processing, (497-507)
  21. Zhang Z, Zhang S, Wang Y, Jiang Y and Wang H (2013). Use of parallel deterministic dynamic programming and hierarchical adaptive genetic algorithm for reservoir operation optimization, Computers and Industrial Engineering, 65:2, (310-321), Online publication date: 1-Jun-2013.
  22. Wu Y, Li T, Sun L and Chen J (2013). Parallelization of a hydrological model using the message passing interface, Environmental Modelling & Software, 43, (124-132), Online publication date: 1-May-2013.
  23. Lin J, Zha L and Xu Z (2013). Consolidated cluster systems for data centers in the cloud age, Frontiers of Computer Science: Selected Publications from Chinese Universities, 7:1, (1-19), Online publication date: 1-Feb-2013.
  24. Esteban F, Díaz D, Hernández P, Caballero J, Dorado G and Gálvez S (2013). Direct approaches to exploit many-core architecture in bioinformatics, Future Generation Computer Systems, 29:1, (15-26), Online publication date: 1-Jan-2013.
  25. Zhu F, Gonzalez D, Carpenter T, Atkinson M and Wardlaw J (2012). Parallel perfusion imaging processing using GPGPU, Computer Methods and Programs in Biomedicine, 108:3, (1012-1021), Online publication date: 1-Dec-2012.
  26. Neves M, Ferreto T and De Rose C Scheduling mapreduce jobs in HPC clusters Proceedings of the 18th international conference on Parallel Processing, (179-190)
  27. ACM
    Feng M, Gupta R and Neamtiu I (2012). Effective parallelization of loops in the presence of I/O operations, ACM SIGPLAN Notices, 47:6, (487-498), Online publication date: 6-Aug-2012.
  28. Bordner J and Norman M Enzo-P / Cello Proceedings of the Extreme Scaling Workshop, (1-11)
  29. Wang J, Ponomarev D and Abu-Ghazaleh N Performance Analysis of a Multithreaded PDES Simulator on Multicore Clusters Proceedings of the 2012 ACM/IEEE/SCS 26th Workshop on Principles of Advanced and Distributed Simulation, (93-95)
  30. Stewart R, Trinder P and Maier P Supervised Workpools for Reliable Massively Parallel Computing Proceedings of the 2012 Conference on Trends in Functional Programming - Volume 7829, (247-262)
  31. ACM
    Feng M, Gupta R and Neamtiu I Effective parallelization of loops in the presence of I/O operations Proceedings of the 33rd ACM SIGPLAN Conference on Programming Language Design and Implementation, (487-498)
  32. Das D and Smith N Graph-based lexicon expansion with sparsity-inducing penalties Proceedings of the 2012 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies, (677-687)
  33. Brendel P, Dłotko P, Mrozek M and Żelazna N Homology computations via acyclic subspace Proceedings of the 4th international conference on Computational Topology in Image Context, (117-127)
  34. ACM
    Shi L, Zhao Y and Tang J (2012). Batch Mode Active Learning for Networked Data, ACM Transactions on Intelligent Systems and Technology, 3:2, (1-25), Online publication date: 1-Feb-2012.
  35. ACM
    Feng M, Lin C and Gupta R (2012). PLDS, ACM Transactions on Architecture and Code Optimization, 8:4, (1-21), Online publication date: 1-Jan-2012.
  36. Pacini E, Ribero M, Mateos C, Mirasso A and Garino C Simulation on cloud computing infrastructures of parametric studies of nonlinear solids problems Proceedings of the Second international conference on Advances in New Technologies, Interactive Interfaces and Communicability, (58-70)
  37. Pedemonte M, Nesmachnow S and Cancela H (2011). A survey on parallel ant colony optimization, Applied Soft Computing, 11:8, (5181-5197), Online publication date: 1-Dec-2011.
  38. Dwornik M and Pięta A Parallel implementation of stochastic inversion of seismic tomography data Proceedings of the 9th international conference on Parallel Processing and Applied Mathematics - Volume Part II, (353-360)
  39. ACM
    Feng M, Gupta R and Hu Y (2011). SpiceC, ACM SIGPLAN Notices, 46:8, (69-80), Online publication date: 7-Sep-2011.
  40. Solomonik E and Demmel J Communication-optimal parallel 2.5D matrix multiplication and LU factorization algorithms Proceedings of the 17th international conference on Parallel processing - Volume Part II, (90-109)
  41. Georgiev K, Ostromsky T and Zlatev Z New parallel implementation of an air pollution computer model --- performance study on an IBM blue gene/p computer Proceedings of the 8th international conference on Large-Scale Scientific Computing, (283-290)
  42. Bailey D and Borwein J (2011). High-precision numerical integration, Journal of Symbolic Computation, 46:7, (741-754), Online publication date: 1-Jul-2011.
  43. Grelck C Single assignment C (SAC) high productivity meets high performance Proceedings of the 4th Summer School conference on Central European Functional Programming School, (207-278)
  44. ACM
    Schnorr L, Legrand A and Vincent J Multi-scale analysis of large distributed computing systems Proceedings of the third international workshop on Large-scale system and application performance, (27-34)
  45. Abdennhader N, Belgacem M, Couturier R, Laiymani D, Miquée S, Niinimaki M and Sauget M Gridification of a radiotherapy dose computation application with the xtremweb-CH environment Proceedings of the 6th international conference on Advances in grid and pervasive computing, (188-197)
  46. ACM
    Gray I and Audsley N (2011). Targeting complex embedded architectures by combining the multicore communications API (mcapi) with compile-time virtualisation, ACM SIGPLAN Notices, 46:5, (51-60), Online publication date: 11-Apr-2011.
  47. ACM
    Gray I and Audsley N Targeting complex embedded architectures by combining the multicore communications API (mcapi) with compile-time virtualisation Proceedings of the 2011 SIGPLAN/SIGBED conference on Languages, compilers and tools for embedded systems, (51-60)
  48. ACM
    Feng M, Gupta R and Hu Y SpiceC Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, (69-80)
  49. ACM
    Kulkarni A, Liu Y and Smith S (2010). Task types for pervasive atomicity, ACM SIGPLAN Notices, 45:10, (671-690), Online publication date: 17-Oct-2010.
  50. ACM
    Kulkarni A, Liu Y and Smith S Task types for pervasive atomicity Proceedings of the ACM international conference on Object oriented programming systems languages and applications, (671-690)
  51. ACM
    Kristensen M and Vinter B Numerical Python for scalable architectures Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, (1-9)
  52. Schanen M, Förster M and Naumann U Second-order algorithmic differentiation by source transformation of MPI code Proceedings of the 17th European MPI users' group meeting conference on Recent advances in the message passing interface, (257-264)
  53. ACM
    Zhang Q, Zhang Y, Yu H and Huang X Efficient partial-duplicate detection based on sequence matching Proceedings of the 33rd international ACM SIGIR conference on Research and development in information retrieval, (675-682)
  54. ACM
    Gray I and Audsley N Supporting islands of coherency for highly-parallel embedded architectures using compile-time virtualisation Proceedings of the 13th International Workshop on Software & Compilers for Embedded Systems, (1-10)
  55. ACM
    Jardak C, Riihijärvi J, Oldewurtel F and Mähönen P Parallel processing of data from very large-scale wireless sensor networks Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, (787-794)
  56. ACM
    Yang L, Yu L, Tang J, Wang L, Zhao J and Li X Enabling multi-core based monitoring and fault tolerance in C++/Java Proceedings of the 3rd International Workshop on Multicore Software Engineering, (32-39)
  57. ACM
    Juve G and Deelman E (2010). Scientific workflows and clouds, XRDS: Crossroads, The ACM Magazine for Students, 16:3, (14-18), Online publication date: 1-Mar-2010.
  58. ACM
    Grelck C, Julku J and Penczek F S-Net for multi-memory multicores Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming, (25-34)
  59. Dorta T, Jiménez J, Martín J, Bidarte U and Astarloa A (2010). Reconfigurable multiprocessor systems, International Journal of Reconfigurable Computing, 2010, (1-10), Online publication date: 1-Jan-2010.
  60. ACM
    Vömel C (2010). ScaLAPACK's MRRR algorithm, ACM Transactions on Mathematical Software, 37:1, (1-35), Online publication date: 1-Jan-2010.
  61. ACM
    Agullo E, Hadri B, Ltaief H and Dongarrra J Comparative study of one-sided factorizations with multiple software packages on multi-core hardware Proceedings of the Conference on High Performance Computing Networking, Storage and Analysis, (1-12)
  62. Kashef R and Kamel M (2009). Enhanced bisecting k-means clustering using intermediate cooperation, Pattern Recognition, 42:11, (2557-2569), Online publication date: 1-Nov-2009.
  63. Lin Y and Hung L (2009). New families of computation-efficient parallel prefix algorithms, WSEAS Transactions on Computers, 8:10, (1651-1660), Online publication date: 1-Oct-2009.
  64. Hatcher P, Reno M, Antoniu G and Bouge L (2009). Cluster Computing with Java, Computing in Science and Engineering, 7:2, (34-39), Online publication date: 1-Mar-2009.
  65. Suresh S, Mani V, Omkar S, Kim H and Sundararajan N (2009). A new load distribution strategy for linear network with communication delays, Mathematics and Computers in Simulation, 79:5, (1488-1501), Online publication date: 1-Jan-2009.
  66. ACM
    Zhou K, Hou Q, Wang R and Guo B Real-time KD-tree construction on graphics hardware ACM SIGGRAPH Asia 2008 papers, (1-11)
  67. Alberto A, Benet J, Arias E, Cebrian D, Rojo T and Cuartero F (2008). A high performance tool for the simulation of the dynamic pantograph-catenary interaction, Mathematics and Computers in Simulation, 79:3, (652-667), Online publication date: 1-Dec-2008.
  68. Levesque M, Ichikawa K, Date S and Haga J Bringing flexibility to virtual screening for enzymatic inhibitors on the grid Proceedings of the 2008 9th IEEE/ACM International Conference on Grid Computing, (201-208)
  69. Vidal A, Garcia V, Alonso P and Bernabeu M (2008). Parallel computation of the eigenvalues of symmetric Toeplitz matrices through iterative methods, Journal of Parallel and Distributed Computing, 68:8, (1113-1121), Online publication date: 1-Aug-2008.
  70. Collette S, Cucu L and Goossens J (2008). Integrating job parallelism in real-time scheduling theory, Information Processing Letters, 106:5, (180-187), Online publication date: 1-May-2008.
  71. Hung L and Lin Y (2008). Parallel prefix algorithms on the multicomputer, WSEAS Transactions on Computer Research, 3:4, (213-223), Online publication date: 1-Apr-2008.
  72. Frattolillo F (2008). Supporting data management on cluster grids, Future Generation Computer Systems, 24:2, (166-176), Online publication date: 1-Feb-2008.
  73. Evans G, Keith J and Kroese D Parallel cross-entropy optimization Proceedings of the 39th conference on Winter simulation: 40 years! The best is yet to come, (2196-2202)
  74. Sánchez A, Pérez M, Gueant P, Montes J, Herrero P and Cortes T (2007). Improving GridFTP transfers by means of a multiagent parallel file system, Multiagent and Grid Systems, 3:4, (441-451), Online publication date: 1-Dec-2007.
  75. ACM
    Ananthanarayanan R and Modha D Anatomy of a cortical simulator Proceedings of the 2007 ACM/IEEE conference on Supercomputing, (1-12)
  76. ACM
    Nakashole N, Suleman H and Pedzai C Dynamic role allocation for small search engine clusters Proceedings of the 2007 annual research conference of the South African institute of computer scientists and information technologists on IT research in developing countries, (91-96)
  77. Ramadan O A parallel FDTD algorithm for the solution of Maxwell's equations with nearly PML absorbing boundary conditions Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III, (421-430)
  78. ACM
    Meza J, Oliva R, Hough P and Williams P (2007). OPT++, ACM Transactions on Mathematical Software, 33:2, (12-es), Online publication date: 1-Jun-2007.
  79. Intrigila B, Melatti I, Tofani A and Macchiarelli G (2007). Computational models of myocardial endomysial collagen arrangement, Computer Methods and Programs in Biomedicine, 86:3, (232-244), Online publication date: 1-Jun-2007.
  80. Zhu W, Niu Y and Gao G (2007). Performance portability on EARTH: a case study across several parallel architectures, Cluster Computing, 10:2, (115-126), Online publication date: 1-Jun-2007.
  81. Knottenbelt W and Bradley J Tackling large state spaces in performance modelling Proceedings of the 7th international conference on Formal methods for performance evaluation, (318-370)
  82. Bounanos S, Fleury M, Nicolas S and Vickers A (2007). Regular Paper, International Journal of High Performance Computing Applications, 21:2, (222-245), Online publication date: 1-May-2007.
  83. Peng H, Wang M and Lai C (2007). Design of parallel algorithms for fractal video compression, International Journal of Computer Mathematics, 84:2, (193-202), Online publication date: 1-Feb-2007.
  84. Kim J, Lee J and Yeom H Experimental Assessment of the Practicality of a Fault-Tolerant System Proceedings of the 33rd conference on Current Trends in Theory and Practice of Computer Science, (878-887)
  85. Crane J, Crawford F and Nelson S (2006). Grid enabled magnetic resonance scanners for near real-time medical image processing, Journal of Parallel and Distributed Computing, 66:12, (1524-1533), Online publication date: 1-Dec-2006.
  86. Wang Z, Ghoniem N, Swaminarayan S and LeSar R (2006). A parallel algorithm for 3D dislocation dynamics, Journal of Computational Physics, 219:2, (608-621), Online publication date: 1-Dec-2006.
  87. Gnanendran K and Sundarraj R (2006). Alternative model representations and computing capacity, Decision Support Systems, 42:3, (1413-1430), Online publication date: 1-Dec-2006.
  88. Akaydin O and Ramadan O 3-D parallel implementation of the wave-equation PML algorithm using distributed system Proceedings of the 21st international conference on Computer and Information Sciences, (834-843)
  89. Sánchez A, Pérez M, Gueant P, Montes J and Herrero P A parallel data storage interface to GridFTP Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II, (1203-1212)
  90. Liao S Parallelizing user-defined and implicit reductions globally on multiprocessors Proceedings of the 11th Asia-Pacific conference on Advances in Computer Systems Architecture, (189-202)
  91. Yang L, Xu L, Lin M and Quinn J A parallel GNFS algorithm with the biorthogonal block lanczos method for integer factorization Proceedings of the Third international conference on Autonomic and Trusted Computing, (428-438)
  92. Ostromsky T and Zlatev Z Parallel and GRID implementation of a large scale air pollution model Proceedings of the 6th international conference on Numerical methods and applications, (475-482)
  93. Lin Y Computation-efficient parallel prefix Proceedings of the 6th WSEAS International Conference on Applied Informatics and Communications, (280-285)
  94. Yang L, Xu L, Lin M and Quinn J A parallel GNFS algorithm based on a reliable look-ahead block lanczos method for integer factorization Proceedings of the 2006 international conference on Embedded and Ubiquitous Computing, (110-120)
  95. Wooding K and Williams H Doubly-Focused enumeration of pseudosquares and pseudocubes Proceedings of the 7th international conference on Algorithmic Number Theory, (208-221)
  96. Lv X, Zhao Y, Huang X, Xia G and Wang Z (2006). An efficient parallel/unstructured-multigrid preconditioned implicit method for simulating 3D unsteady compressible flows with moving objects, Journal of Computational Physics, 215:2, (661-690), Online publication date: 1-Jul-2006.
  97. Abu-Khzam F, Langston M, Shanbhag P and Symons C (2006). Scalable Parallel Algorithms for FPT Problems, Algorithmica, 45:3, (269-284), Online publication date: 1-Jul-2006.
  98. Cera M, Pezzi G, Pilla M, Maillard N and Navaux P Scheduling dynamically spawned processes in MPI-2 Proceedings of the 12th international conference on Job scheduling strategies for parallel processing, (33-46)
  99. Flores-Becerra G, Garcia V and Vidal A Efficient parallel algorithm for constructing a unit triangular matrix with prescribed singular values Proceedings of the 7th international conference on High performance computing for computational science, (349-362)
  100. Alves J, Borges C and Filho A Distributed security constrained optimal power flow integrated to a DSM based energy management system for real time power systems security control Proceedings of the 7th international conference on High performance computing for computational science, (131-144)
  101. Ivanov L (2006). A modern course on parallel and distributed processing, Journal of Computing Sciences in Colleges, 21:6, (29-38), Online publication date: 1-Jun-2006.
  102. Jiang Z, Lin D, Xu L and Lin L Integrating grid with cryptographic computing Proceedings of the Second international conference on Information Security Practice and Experience, (321-331)
  103. Nakata K, Yamashita M, Fujisawa K and Kojima M (2006). A parallel primal-dual interior-point method for semidefinite programs using positive definite matrix completion, Parallel Computing, 32:1, (24-43), Online publication date: 1-Jan-2006.
  104. Lin Y and Su C (2005). Faster optimal parallel prefix circuits, Journal of Parallel and Distributed Computing, 65:12, (1585-1595), Online publication date: 1-Dec-2005.
  105. Allcock W, Bresnahan J, Kettimuthu R, Link M, Dumitrescu C, Raicu I and Foster I The Globus Striped GridFTP Framework and Server Proceedings of the 2005 ACM/IEEE conference on Supercomputing
  106. Mavriplis D, Aftosmis M and Berger M High Resolution Aerospace Applications using the NASA Columbia Supercomputer Proceedings of the 2005 ACM/IEEE conference on Supercomputing
  107. ACM
    Li F, Chen G, Kandemir M and Karakoy M Exploiting last idle periods of links for network power management Proceedings of the 5th ACM international conference on Embedded software, (134-137)
  108. Rokicki J, Krause M and Wichulski M Parallelization of numerical CFD codes in cluster and grid environments Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics, (825-830)
  109. Dalcín L, Paz R and Storti M (2005). MPI for Python, Journal of Parallel and Distributed Computing, 65:9, (1108-1115), Online publication date: 1-Sep-2005.
  110. Krivulin N, Guster D and Hall C Parallel implementation of a random search procedure Proceedings of the 5th WSEAS international conference on Simulation, modelling and optimization, (367-372)
  111. Tai C, Zhao Y and Liew K (2005). Parallel computation of unsteady incompressible viscous flows around moving rigid bodies using an immersed object method with overlapping grids, Journal of Computational Physics, 207:1, (151-172), Online publication date: 20-Jul-2005.
  112. ACM
    Zhang Y, Wong D and Zheng W (2005). User-level checkpoint and recovery for LAM/MPI, ACM SIGOPS Operating Systems Review, 39:3, (72-81), Online publication date: 1-Jul-2005.
  113. Stanescu D, Kim D and Woyczynski W (2005). Numerical study of interacting particles approximation for integro-differential equations, Journal of Computational Physics, 206:2, (706-726), Online publication date: 1-Jul-2005.
  114. Numrich R (2005). Parallel numerical algorithms based on tensor notation and Co-Array Fortran syntax, Parallel Computing, 31:6, (588-607), Online publication date: 1-Jun-2005.
  115. Bahi J, Contassot-Vivier S and Couturier R (2005). Evaluation of the asynchronous iterative algorithms in the context of distant heterogeneous clusters, Parallel Computing, 31:5, (439-461), Online publication date: 1-May-2005.
  116. Kengue J, Valtchev P and Djamegni C A parallel algorithm for lattice construction Proceedings of the Third international conference on Formal Concept Analysis, (249-264)
  117. O'Cearbhaill E and O'Mahony M (2005). Parallel implementation of a transportation network model, Journal of Parallel and Distributed Computing, 65:1, (1-14), Online publication date: 1-Jan-2005.
  118. Becerra G and Maciá A Parallel global and local convergent algorithms for solving the iniverse additive singular value problem Proceedings of the 4th WSEAS International Conference on Systems Theory and Scientific Computation, (1-6)
  119. Dean J and Ghemawat S MapReduce Proceedings of the 6th conference on Symposium on Operating Systems Design & Implementation - Volume 6, (10-10)
  120. ACM
    Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K (2004). Programming with transactional coherence and consistency (TCC), ACM SIGOPS Operating Systems Review, 38:5, (1-13), Online publication date: 1-Dec-2004.
  121. ACM
    Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K (2004). Programming with transactional coherence and consistency (TCC), ACM SIGARCH Computer Architecture News, 32:5, (1-13), Online publication date: 1-Dec-2004.
  122. ACM
    Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K (2004). Programming with transactional coherence and consistency (TCC), ACM SIGPLAN Notices, 39:11, (1-13), Online publication date: 1-Nov-2004.
  123. ACM
    Hammond L, Carlstrom B, Wong V, Hertzberg B, Chen M, Kozyrakis C and Olukotun K Programming with transactional coherence and consistency (TCC) Proceedings of the 11th international conference on Architectural support for programming languages and operating systems, (1-13)
  124. Ostromsky T, Dimov I and Zlatev Z Parallel implementation and one year experiments with the danish eulerian model Proceedings of the Third international conference on Numerical Analysis and its Applications, (440-447)
  125. Arnal J, Migallón H, Migallón V and Penadés J Parallel newton iterative methods based on incomplete LU factorizations for solving nonlinear systems Proceedings of the 6th international conference on High Performance Computing for Computational Science, (716-729)
  126. Arias E and Hernández V Numerical integration of the differential riccati equation Proceedings of the 6th international conference on High Performance Computing for Computational Science, (671-684)
  127. Peinado J and Vidal A Three parallel algorithms for solving nonlinear systems and optimization problems Proceedings of the 6th international conference on High Performance Computing for Computational Science, (657-670)
  128. García V, Vidal V, Verdú G, Garayoa J and Miró R Parallel resolution of the two-group time dependent neutron diffusion equation with public domain ODE codes Proceedings of the 6th international conference on High Performance Computing for Computational Science, (368-381)
  129. Huang W and Tafti D (2004). A Parallel Adaptive Mesh Refinement Algorithm for Solving Nonlinear Dynamical Systems, International Journal of High Performance Computing Applications, 18:2, (171-181), Online publication date: 1-May-2004.
  130. Martin D, Wilsey P, Hoekstra R, Keiter E, Hutchinson S, Russo T and Waters L Scheduling Optimization on the Simbus Backplane Proceedings of the 37th annual symposium on Simulation
  131. Zomaya A, Patterson D and Olariu S (2004). Sequential and Parallel Meta-Heuristics for Solving the Single Row Routing Problem, Cluster Computing, 7:2, (123-139), Online publication date: 1-Apr-2004.
  132. Oliveira C and Pardalos P (2004). Randomized parallel algorithms for the multidimensional assignment problem, Applied Numerical Mathematics, 49:1, (117-133), Online publication date: 1-Apr-2004.
  133. Welsh M and Mainland G Programming sensor networks using abstract regions Proceedings of the 1st conference on Symposium on Networked Systems Design and Implementation - Volume 1, (3-3)
  134. Oian E, Garrido I, Chaib M, Fladmark G and Espedal M (2004). Modeling fractured and faulted regions, Computing and Visualization in Science, 6:2-3, (123-129), Online publication date: 1-Mar-2004.
  135. Grimshaw A, Humphrey M and Natrajan A (2004). A philosophical and technical comparison of Legion and Globus, IBM Journal of Research and Development, 48:2, (233-254), Online publication date: 1-Mar-2004.
  136. Øian E, Garrido I, Chaib M, Fladmark G and Espedal M (2004). Modeling fractured and faulted regions, Computing and Visualization in Science, 6:2-3, (123-129), Online publication date: 1-Mar-2004.
  137. Fernández J, Carazo J and García I (2004). Three-dimensional reconstruction of cellular structures by electron microscope tomography and parallel computing, Journal of Parallel and Distributed Computing, 64:2, (285-300), Online publication date: 1-Feb-2004.
  138. References Grid resource management, (507-566)
  139. Wolf F and Mohr B Specifying performance properties of parallel applications using compound events On-line monitoring systems and computer tool interoperability, (91-110)
  140. ACM
    Gorlatch S (2004). Send-receive considered harmful, ACM Transactions on Programming Languages and Systems, 26:1, (47-56), Online publication date: 1-Jan-2004.
  141. Li H, Rilling J and Goswami D (2004). Granularity-Driven Dynamic Predicate Slicing Algorithms for Message Passing Systems, Automated Software Engineering, 11:1, (63-89), Online publication date: 1-Jan-2004.
  142. Lin Y and Hsiao J (2004). A new approach to constructing optimal parallel prefix circuits with small depth, Journal of Parallel and Distributed Computing, 64:1, (97-107), Online publication date: 1-Jan-2004.
  143. ACM
    Kaiser T (2003). A methodology for creating large modules, ACM SIGPLAN Fortran Forum, 22:3, (11-24), Online publication date: 1-Dec-2003.
  144. Tai C and Zhao Y (2003). Parallel unsteady incompressible viscous flow computations using an unstructured multigrid method, Journal of Computational Physics, 192:1, (277-311), Online publication date: 20-Nov-2003.
  145. Michailidis P and Margaritis K (2003). Performance evaluation of load balancing strategies for approximate string matching application on an MPI cluster of heterogeneous workstations, Future Generation Computer Systems, 19:7, (1075-1104), Online publication date: 1-Oct-2003.
  146. Govett M, Hart L, Henderson T, Middlecoff J and Schaffer D (2003). The scalable modeling system, Parallel Computing, 29:8, (995-1020), Online publication date: 1-Aug-2003.
  147. Choi E and Cicci D (2003). Analysis of GPS static positioning problems, Applied Mathematics and Computation, 140:1, (37-51), Online publication date: 30-Jul-2003.
  148. Gerndt A, Reimersdahl T, Kuhlen T, Bischof C, Hörschler I, Meinke M and Schröder W (2003). Large-Scale CFD Data Handling in a VR-Based Otorhinolaryngological CAS-System using a Linux-Cluster, The Journal of Supercomputing, 25:2, (143-154), Online publication date: 1-Jun-2003.
  149. Shires D and Mohan R (2003). Optimization and Performance of a Fortran 90 MPI-Based Unstructured Code on Large-Scale Parallel Systems, The Journal of Supercomputing, 25:2, (131-141), Online publication date: 1-Jun-2003.
  150. Huang S, Aubanel E and Bhavsar V Mesh partitioners for computational grids Proceedings of the 2003 international conference on Computational science and its applications: PartIII, (60-68)
  151. Martin D, Wilsey P, Hoekstra R, Keiter E, Hutchinson S, Russo T and Waters L Redesigning the WARPED Simulation Kernel for Analysis and Application Development Proceedings of the 36th annual symposium on Simulation
  152. Lin Y, Hsu Y and Liu C (2003). Constructing H4, a Fast Depth-Size Optimal Parallel Prefix Circuit, The Journal of Supercomputing, 24:3, (279-304), Online publication date: 1-Mar-2003.
  153. Georgiou S, Koukouvinos C and Seberry J Hadamard matrices, orthogonal designs and construction algorithms DESIGNS 2002, (133-205)
  154. Gorlatch S SAT Patterns and skeletons for parallel and distributed computing, (29-63)
  155. Padua D and Hoeflinger J Supercomputers Encyclopedia of Computer Science, (1710-1718)
  156. Quinn M, Miller R, Miller R and Quinn M Parallel processing Encyclopedia of Computer Science, (1349-1365)
  157. Song Z and Roussopoulos N (2002). Using Hilbert curve in image storing and retrieving, Information Systems, 27:8, (523-536), Online publication date: 1-Dec-2002.
  158. Gong L, Sun X and Watson E (2002). Performance Modeling and Prediction of Nondedicated Network Computing, IEEE Transactions on Computers, 51:9, (1041-1055), Online publication date: 1-Sep-2002.
  159. Koulopoulos D, Papoutsis K, Goulas G and Housos E (2002). PLEIADES, Software—Practice & Experience, 32:11, (1035-1049), Online publication date: 1-Sep-2002.
  160. ACM
    Bergen D and Miller B (2002). Building an MPI cluster, XRDS: Crossroads, The ACM Magazine for Students, 8:5, (2-2), Online publication date: 1-Aug-2002.
  161. Taylor V, Wu X, Geisler J and Stevens R Using Kernel Couplings to Predict Parallel Application Performance Proceedings of the 11th IEEE International Symposium on High Performance Distributed Computing
  162. ACM
    Dingle N, Harrison P and Knottenbelt W Response time densities in generalised stochastic petri net models Proceedings of the 3rd international workshop on Software and performance, (46-54)
  163. Peinado J and Vidal A A parallel Newton-GMRES algorithm for solving large scale nonlinear systems Proceedings of the 5th international conference on High performance computing for computational science, (328-342)
  164. Fernández J, Cañas A, Díaz A, González J, Ortega J and Prieto A Performance of message-passing MATLAB toolboxes Proceedings of the 5th international conference on High performance computing for computational science, (228-242)
  165. ACM
    Hendriks E BProc Proceedings of the 16th international conference on Supercomputing, (129-136)
  166. ACM
    Harrison P and Knottenbelt W Passage time distributions in large Markov chains Proceedings of the 2002 ACM SIGMETRICS international conference on Measurement and modeling of computer systems, (77-85)
  167. Lampoudi S and Beazley D SWILL Proceedings of the FREENIX Track: 2002 USENIX Annual Technical Conference, (19-27)
  168. ACM
    Harrison P and Knottenbelt W (2002). Passage time distributions in large Markov chains, ACM SIGMETRICS Performance Evaluation Review, 30:1, (77-85), Online publication date: 1-Jun-2002.
  169. Stankovic N and Zhang K (2002). A Distributed Parallel Programming Framework, IEEE Transactions on Software Engineering, 28:5, (478-493), Online publication date: 1-May-2002.
  170. Németh Z and Sunderam V A Comparison of Conventional Distributed Computing Environments and Computational Grids Proceedings of the International Conference on Computational Science-Part II, (729-738)
  171. Dwivedula M, Hariri S and Parashar M A Software Design Model for Parallel Applications on Heterogeneous Systems Proceedings of the 16th International Parallel and Distributed Processing Symposium
  172. Silva J and Hughes A Monte Carlo Simulation of HIV-1 Evolution in Response to Selection by Antibodies Proceedings of the 16th International Parallel and Distributed Processing Symposium
  173. Goswami D, Singh A and Preiss B (2002). From Design Patterns to Parallel Architectural Skeletons, Journal of Parallel and Distributed Computing, 62:4, (669-695), Online publication date: 1-Apr-2002.
  174. Owczarz W and Zlatev Z (2002). Parallel matrix computations in air pollution modelling, Parallel Computing, 28:2, (355-368), Online publication date: 1-Feb-2002.
  175. Benner P, Byers R, Mayo R, Quintana-Ortí E and Hernández V (2002). Parallel Algorithms for LQ Optimal Control of Discrete-Time Periodic Linear Systems, Journal of Parallel and Distributed Computing, 62:2, (306-325), Online publication date: 1-Feb-2002.
  176. Zlatev Z Massive data set issues in air pollution modelling Handbook of massive data sets, (1169-1220)
  177. Evans D and Saman M Cluster computing with message-passing interface Highly parallel computaions, (3-42)
  178. ACM
    Gau C and Stadtherr M Parallel interval-Newton using message passing Proceedings of the 2001 ACM/IEEE conference on Supercomputing, (23-23)
  179. ACM
    Getov V, von Laszewski G, Philippsen M and Foster I (2001). Multiparadigm communications in Java for grid computing, Communications of the ACM, 44:10, (118-125), Online publication date: 1-Oct-2001.
  180. ACM
    Ostromsky T, Owczarz W and Zlatev Z Computational challenges in large-scale air pollution modelling Proceedings of the 15th international conference on Supercomputing, (407-418)
  181. ACM
    Mezher D A graphical tool for driving the parallel computation of pseudosprectra Proceedings of the 15th international conference on Supercomputing, (270-276)
  182. Demaine E, Foster I, Kesselman C and Snir M (2001). Generalized Communicators in the Message Passing Interface, IEEE Transactions on Parallel and Distributed Systems, 12:6, (610-616), Online publication date: 1-Jun-2001.
  183. Chakrabarti D and Banerjee P (2001). Static Single Assignment Form for Message-Passing Programs, International Journal of Parallel Programming, 29:2, (139-184), Online publication date: 1-Apr-2001.
  184. González P, Cabaleiro J and Pena T (2001). Parallel Computation of Wavelet Transforms Using the Lifting Scheme, The Journal of Supercomputing, 18:2, (141-152), Online publication date: 1-Feb-2001.
  185. Owczarx W and Zlatev Z Running a large air pollution model on an IBM SMP computer Practical parallel computing, (57-77)
  186. Gan B, Liu L, Jain S, Turner S, Cai W and Hsu W Manufacturing sypply chain management Proceedings of the 32nd conference on Winter simulation, (1245-1251)
  187. ACM
    Jian L, Yingjun L, Xiaoxing M, Min C, Xianping T, Guanqun Z and Jianzhong L (2000). A hierarchical framework for parallel seismic applications, Communications of the ACM, 43:10, (55-59), Online publication date: 1-Oct-2000.
  188. Kim V, Banerjee P and De K Fine-Grained Parallel VLSI Synthesis for Commercial CAD on a Network of Workstations Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
  189. Hereld M, Judson I and Stevens R (2000). Tutorial, IEEE Computer Graphics and Applications, 20:4, (22-28), Online publication date: 1-Jul-2000.
  190. Roerdink J and Meijster A (2000). The Watershed Transform: Definitions, Algorithms and Parallelization Strategies, Fundamenta Informaticae, 41:1,2, (187-228), Online publication date: 1-Apr-2000.
  191. Nyland L, Prins J, Goldberg A and Mills P (2000). A Design Methodology for Data-Parallel Applications, IEEE Transactions on Software Engineering, 26:4, (293-314), Online publication date: 1-Apr-2000.
  192. Gorlatch S (2000). Toward Formally-Based Design of Message Passing Programs, IEEE Transactions on Software Engineering, 26:3, (276-288), Online publication date: 1-Mar-2000.
  193. Benner P, Castillo M, Quintana-Ortí E and Hernández V (2000). Parallel Partial Stabilizing Algorithms for Large Linear Control Systems, The Journal of Supercomputing, 15:2, (193-206), Online publication date: 1-Feb-2000.
  194. Shang J, Wagner M, Pan Y and Blake D (2000). Strategies for Adopting FVTD on Multicomputers, Computing in Science and Engineering, 2:1, (10-21), Online publication date: 1-Jan-2000.
  195. Kipfer P and Slusallek P Transparent distributed processing for rendering Proceedings of the 1999 IEEE symposium on Parallel visualization and graphics, (39-46)
  196. ACM
    Andersen P (1999). The Texas Tech tornado cluster, XRDS: Crossroads, The ACM Magazine for Students, 6:1, (28-32), Online publication date: 1-Sep-1999.
  197. Zaki O, Lusk E, Gropp W and Swider D (1999). Toward Scalable Performance Visualization with Jumpshot, International Journal of High Performance Computing Applications, 13:3, (277-288), Online publication date: 1-Aug-1999.
  198. Darema F (1999). New Software Technologies for the Development and Runtime Support of Complex Applications, International Journal of High Performance Computing Applications, 13:3, (180-190), Online publication date: 1-Aug-1999.
  199. Shah M and Kant T (1999). Finite element analysis of fibre reinforced polymer shells using higher-order shear deformation theories on parallel distributed memory machines, International Journal of Computer Applications in Technology, 12:2-5, (206-210), Online publication date: 1-Jul-1999.
  200. Kaper H, Wiebel E and Tipei S (1999). Data Sonification and Sound Visualization, Computing in Science and Engineering, 1:4, (48-58), Online publication date: 1-Jul-1999.
  201. Prabhakaran P and Banerjee P (1999). Parallel Algorithms for Force Directed Scheduling of Flattened and Hierarchical Signal Flow Graphs, IEEE Transactions on Computers, 48:7, (762-768), Online publication date: 1-Jul-1999.
  202. Goudreau M, Lang K, Rao S, Suel T and Tsantilas T (1999). Portable and Efficient Parallel Computing Using the BSP Model, IEEE Transactions on Computers, 48:7, (670-689), Online publication date: 1-Jul-1999.
  203. ACM
    Baptist L and Cormen T Multidimensional, multiprocessor, out-of-core FFTs with distributed memory and parallel disks (extended abstract) Proceedings of the eleventh annual ACM symposium on Parallel algorithms and architectures, (242-250)
  204. Lawrence R, Almasi G and Rushmeier H (1999). A Scalable Parallel Algorithm for Self-Organizing Maps with Applicationsto Sparse Data Mining Problems, Data Mining and Knowledge Discovery, 3:2, (171-195), Online publication date: 1-Jun-1999.
  205. Burge L and George K JMAS Proceedings of the 5th conference on USENIX Conference on Object-Oriented Technologies & Systems - Volume 5, (9-9)
  206. Sharma G, Radhakrishnan R, Rajasekaran U, Abu-Ghazaleh N and Wilsey P Time Warp simulation on clumps Proceedings of the thirteenth workshop on Parallel and distributed simulation, (174-181)
  207. Baratloo A, Dasgupta P, Karamcheti V and Kedem Z Metacomputing with MILAN Proceedings of the Eighth Heterogeneous Computing Workshop
  208. ACM
    Karavanic K and Miller B Improving online performance diagnosis by the use of historical performance data Proceedings of the 1999 ACM/IEEE conference on Supercomputing, (42-es)
  209. Rao D, Thondugulam N, Radhakrishnan R and Wilsey P Unsynchronized parallel discrete event simulation Proceedings of the 30th conference on Winter simulation, (1563-1570)
  210. ACM
    Grimshaw A, Ferrari A, Lindahl G and Holcomb K (1998). Metasystems, Communications of the ACM, 41:11, (46-55), Online publication date: 1-Nov-1998.
  211. ACM
    Wei X and Ju J (1998). SFT, ACM SIGOPS Operating Systems Review, 32:4, (70-76), Online publication date: 1-Oct-1998.
  212. ACM
    Díaz A and Kaltofen E FOXBOX Proceedings of the 1998 international symposium on Symbolic and algebraic computation, (30-37)
  213. Cohen J, Fraigniaud P, König J and Raspaud A (1998). Optimized Broadcasting and Multicasting Protocols in Cut-Through Routed Networks, IEEE Transactions on Parallel and Distributed Systems, 9:8, (788-802), Online publication date: 1-Aug-1998.
  214. ACM
    Coddington P and Ko S Techniques for empirical testing of parallel random number generators Proceedings of the 12th international conference on Supercomputing, (282-288)
  215. ACM
    Kimura T and Takemiya H Local area metacomputing for multidisciplinary problems Proceedings of the 12th international conference on Supercomputing, (149-156)
  216. ACM
    Chetlur M, Abu-Gazaleh N, Radhakrishnan R and Wilsey P (1998). Optimizing communication in time-warp simulators, ACM SIGSIM Simulation Digest, 28:1, (64-71), Online publication date: 1-Jul-1998.
  217. Chetlur M, Abu-Gazaleh N, Radhakrishnan R and Wilsey P Optimizing communication in time-warp simulators Proceedings of the twelfth workshop on Parallel and distributed simulation, (64-71)
  218. ACM
    Kim V and Banerjee P Parallel algorithms for power estimation Proceedings of the 35th annual Design Automation Conference, (672-677)
  219. ACM
    Hung C, YarKhan A, Wong K, von Laven S and Coleman T Parallel implementation of an integrated edge-preserving smoothing algorithm in clusters of workstations Proceedings of the 36th annual Southeast regional conference, (20-22)
  220. ACM
    Davis T Generating computer animations with frame coherence in a distributed computing environment Proceedings of the 36th annual Southeast regional conference, (1-7)
  221. Kale L (1998). Programming Languages for CSE, IEEE Computational Science & Engineering, 5:2, (18-26), Online publication date: 1-Apr-1998.
  222. ACM
    Kurtz B, Kim C and Alsabbagh J (1998). Parallel computing in the undergraduate curriculum, ACM SIGCSE Bulletin, 30:1, (212-216), Online publication date: 1-Mar-1998.
  223. ACM
    Kurtz B, Kim C and Alsabbagh J Parallel computing in the undergraduate curriculum Proceedings of the twenty-ninth SIGCSE technical symposium on Computer science education, (212-216)
  224. ACM
    Yang C, Souter A and Pollock L (1998). All-du-path coverage for parallel programs, ACM SIGSOFT Software Engineering Notes, 23:2, (153-162), Online publication date: 1-Mar-1998.
  225. ACM
    Yang C, Souter A and Pollock L All-du-path coverage for parallel programs Proceedings of the 1998 ACM SIGSOFT international symposium on Software testing and analysis, (153-162)
  226. ACM
    Cormen T, Wegmann J and Nicol D Multiprocessor out-of-core FFTs with distributed memory and parallel disks (extended abstract) Proceedings of the fifth workshop on I/O in parallel and distributed systems, (68-78)
  227. ACM
    Hawick K and James H Distributed high-performance computation for remote sensing Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-13)
  228. ACM
    Tobis M, Schafer C, Foster I, Jacob R and Anderson J FOAM Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-15)
  229. ACM
    Blackston D and Suel T Highly portable and efficient implementations of parallel adaptive N-body methods Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-20)
  230. Krishnaswamy D and Banerjeer P Exploiting task and data parallelism in parallel Hough and Radon transforms Proceedings of the international Conference on Parallel Processing
  231. ACM
    Cordsen J, Pohl H and Schröder-Preikschat W Performance considerations in software multicasts Proceedings of the 11th international conference on Supercomputing, (213-220)
  232. Smith D and El-Rewini H Static Scheduling of MPI Tasks Proceedings of the 2nd IEEE Symposium on Computers and Communications (ISCC '97)
  233. ACM
    Krishnaswamy D, Banerjee P, Rudnick E and Patel J (1997). Asynchronous parallel algorithms for test set partitioned fault simulation, ACM SIGSIM Simulation Digest, 27:1, (30-37), Online publication date: 1-Jul-1997.
  234. ACM
    Subhlok J and Yang B (1997). A new model for integrated nested task and data parallel programming, ACM SIGPLAN Notices, 32:7, (1-12), Online publication date: 1-Jul-1997.
  235. ACM
    Subhlok J and Yang B A new model for integrated nested task and data parallel programming Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming, (1-12)
  236. Lu P Implementing optimized distributed data sharing using scoped behaviour and a class library Proceedings of the 3rd conference on USENIX Conference on Object-Oriented Technologies (COOTS) - Volume 3, (11-11)
  237. Krishnaswamy D, Banerjee P, Rudnick E and Patel J Asynchronous parallel algorithms for test set partitioned fault simulation Proceedings of the eleventh workshop on Parallel and distributed simulation, (30-37)
  238. ACM
    Barth D and Fraigniaud P Approximation algorithms for structured communication problems Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, (180-188)
  239. Xu H, Gui Y and Ni L (1997). Optimal Software Multicast in Wormhole-Routed Multistage Networks, IEEE Transactions on Parallel and Distributed Systems, 8:6, (597-607), Online publication date: 1-Jun-1997.
  240. Moreira J and Naik V (1997). Dynamic resource management on distributed systems using reconfigurable applications, IBM Journal of Research and Development, 41:3, (303-330), Online publication date: 1-May-1997.
  241. Stankovic N and Zhang K Visual parallel programming with Visper Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
  242. Krishnaswamy D, Rudnick E, Patel J and Banerjee P SPITFIRE Proceedings of the 15th IEEE VLSI Test Symposium
  243. Melin E, Raffin B, Rebeuf X and Virot B SCL-chan Proceedings of the 1997 Workshop on High-Level Programming Models and Supportive Environments (HIPS '97)
  244. Cohen J, Fraigniaud P, König J and Raspaud A Broadcasting and Multicasting in Cut-through Routed Networks Proceedings of the 11th International Symposium on Parallel Processing, (734-738)
  245. Levine D, Facello M, Hallstrom P, Reeder G, Walenz B and Stevens F (1997). Stalk, IEEE Computational Science & Engineering, 4:2, (55-65), Online publication date: 1-Apr-1997.
  246. ACM
    Dillon E, Dos Santos C and Guyard J (1997). Teaching an engineering approach for network computing, ACM SIGCSE Bulletin, 29:1, (229-232), Online publication date: 1-Mar-1997.
  247. ACM
    Dillon E, Dos Santos C and Guyard J Teaching an engineering approach for network computing Proceedings of the twenty-eighth SIGCSE technical symposium on Computer science education, (229-232)
  248. Taylor V, Chen J, Disz T, Papka M and Stevens R (1996). Interactive Virtual Reality in Simulations, IEEE Computational Science & Engineering, 3:4, (46-54), Online publication date: 1-Dec-1996.
  249. Foster I, Geisler J, Kesselman C and Tuecke S Multimethod communication for high-performance metacomputing applications Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (41-es)
  250. Ogawa H and Matsuoka S OMPI Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (37-es)
  251. Foster I, Kohr D, Krishnaiyer R and Choudhary A Double standards Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (36-es)
  252. Bokhari S (1996). Multiphase Complete Exchange on Paragon, SP2, and CS-2, IEEE Parallel & Distributed Technology: Systems & Technology, 4:3, (45-59), Online publication date: 1-Sep-1996.
  253. Sundaresan N and Gannon D (1996). Coir, Journal of Parallel and Distributed Computing, 37:1, (98-105), Online publication date: 25-Aug-1996.
  254. Foster I, Kesselman C and Tuecke S (1996). The Nexus Approach to Integrating Multithreading and Communication, Journal of Parallel and Distributed Computing, 37:1, (70-82), Online publication date: 25-Aug-1996.
  255. Foster I, Geisler J, Nickless B, Smith W and Tuecke S Software infrastructure for the I-WAY high-performance distributed computing experiment Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
  256. Grimshaw A and Wulf W Legion-a view from 50,000 feet Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
  257. ACM
    Dongarra J, Otto S, Snir M and Walker D (1996). A message passing standard for MPP and workstations, Communications of the ACM, 39:7, (84-90), Online publication date: 1-Jul-1996.
  258. Taylor V, Chen J, Huang M, Canfield T and Stevens R (1996). Identifying and Reducing Critical Lag in Finite Element Simulations, IEEE Computer Graphics and Applications, 16:4, (67-71), Online publication date: 1-Jul-1996.
  259. ACM
    Goudreau M, Lang K, Rao S, Suel T and Tsantilas T Towards efficiency and portability Proceedings of the eighth annual ACM symposium on Parallel Algorithms and Architectures, (1-12)
  260. Kwang K and Arakawa M (1996). Benchmark Evaluation of the IBM SP2 for Parallel Signal Processing, IEEE Transactions on Parallel and Distributed Systems, 7:5, (522-536), Online publication date: 1-May-1996.
  261. Beckman P and Gannon D Tulip Proceedings of the 10th International Parallel Processing Symposium, (532-536)
  262. Agarwal R, Balle S, Gustavson F, Joshi M and Palkar P (1995). A three-dimensional approach to parallel matrix multiplication, IBM Journal of Research and Development, 39:5, (575-582), Online publication date: 1-Sep-1995.
  263. Xu H, Gui Y and Ni L Optimal software multicast in wormhole-routed multistage networks Proceedings of the 1994 ACM/IEEE conference on Supercomputing, (703-712)
Contributors
  • University of Illinois Urbana-Champaign
  • Argonne National Laboratory
  • Tennessee Technological University

Recommendations

Lorie M. Liebrock

MPI is an attempt to bring together the best features of many existing message-passing systems, while showing that parallel processing need not compromise efficiency, portability, and functionality. In addition to providing standard message-passing capabilities, MPI extends the notion of a message tag to define a context that is allocated at runtime. Contexts allow user programs and libraries to maintain separate “message spaces.” A communicator is defined by the context and the group of processes. Communicators define precisely which processes are involved in a communication. Just as the pattern of communication, or application topology, is an integral part of an application, the definition of the virtual topology is an integral part of MPI. Cartesian topologies are predefined with special operations such as shift. Other MPI features include support for debugging, profiling, blocking and nonblocking communication, libraries, and heterogeneous networks of processors. In the first part of the book, beginning with a small but sufficient set of routines, MPI is introduced in terms of what is needed to solve particular problems. This approach is used through the first six chapters. More advanced features are introduced with the focus on the feature instead of the application, but still with coding examples. The authors alternate between the use of Fortran and C in the examples used to introduce MPI functionality. Only occasionally is this mixing of languages confusing or distracting. The MPI routine summary has two parts—one for C and one for Fortran. This book provides an easy introduction to parallel programming with MPI. Throughout the work, MPI implementors' choices are often pointed out to warn users of the possibility of variation in implementations. The chapter on implementation specifically discusses how MPI interacts with an abstract device interface. The porting sections show how to translate from another (source) language; for example, Intel NX to MPI (target), starting with basics such as initialization, sending, receiving, and termination. Collective operations, other features with counterparts, and features of the source language that are not included in the MPI are also discussed. Each porting section ends with comments on process startup. Overall, this material is easy to follow, but there are a few problems. Occasionally, incorrect function names are used. Also, a few of the examples use programming logic that is confusing. In one, there is a computed goto (with 8 branches), and 9 of 16 lines of code are labeled. In this example, only 2 of the unlabeled lines are not gotos. This confusing style, however, is the exception, rather than the rule, for the coding examples in the book. Other minor difficulties include not being able to distinguish patterns in the upshot figures, using C notation (!converged) in flowcharts, and giving an invalid address for the “frequently asked questions” page on the World Wide Web (a currently working address is http://www.erc.msstate.edu/mpi/mpi-faq.html).

Access critical reviews of Computing literature here

Become a reviewer for Computing Reviews.