skip to main content
Skip header Section
MPI-The Complete Reference, Volume 1: The MPI CoreAugust 1998
Publisher:
  • MIT Press
  • 55 Hayward St.
  • Cambridge
  • MA
  • United States
ISBN:978-0-262-69215-1
Published:01 August 1998
Pages:
426
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

From the Publisher:

Since its release in summer 1994, the Message Passing Interface (MPI) specification has become a standard for message-passing libraries for parallel computations. There exist more than a dozen implementations on a variety of computing platforms, from the IBM SP-2 supercomputer to PCs running Windows NT. The initial MPI Standard, known as MPI-1, has been modified over the last two years. This volume, the definitive reference manual for the latest version of MPI-1, contains a complete specification of the MPI Standard. It is annotated with comments that clarify complicated issues, including why certain design choices were made, how users are intended to use the interface, and how they should construct their version of MPI. The volume also provides many detailed, illustrative programming examples.

Cited By

  1. ACM
    Coullon H, Henrio L, Loulergue F and Robillard S (2023). Component-based Distributed Software Reconfiguration:A Verification-oriented Survey, ACM Computing Surveys, 56:1, (1-37), Online publication date: 31-Jan-2024.
  2. Loulergue F and Ed-Dbali A Verified High Performance Computing: The SyDPaCC Approach Verification and Evaluation of Computer and Communication Systems, (15-29)
  3. ACM
    Gévay G, Soto J and Markl V (2021). Handling Iterations in Distributed Dataflow Systems, ACM Computing Surveys, 54:9, (1-38), Online publication date: 31-Dec-2022.
  4. ACM
    Tarafdar N, Di Guglielmo G, Harris P, Krupa J, Loncar V, Rankin D, Tran N, Wu Z, Shen Q and Chow P (2021). AIgean: An Open Framework for Deploying Machine Learning on Heterogeneous Clusters, ACM Transactions on Reconfigurable Technology and Systems, 15:3, (1-32), Online publication date: 30-Sep-2022.
  5. Yu B, Feng G, Cao H, Li X, Sun Z, Wang H, Zhu X, Zheng W and Chen W (2021). Chukonu, Proceedings of the VLDB Endowment, 15:4, (872-885), Online publication date: 1-Dec-2021.
  6. Medeiros T, Berned G, Navarro A, Rossi F, Luizelli M, Brandalero M, Hübner M, Beck A and Lorenzon A (2021). Aging-Aware Parallel Execution, IEEE Embedded Systems Letters, 13:3, (122-125), Online publication date: 1-Sep-2021.
  7. ACM
    Wei W, D'Azevedo E, Huck K, Chatterjee A, Hernandez O and Kaiser H Memory reduction using a ring abstraction over GPU RDMA for distributed quantum Monte Carlo solver Proceedings of the Platform for Advanced Scientific Computing Conference, (1-9)
  8. Lirkov I (2020). Performance Analysis of a Scalable Algorithm for 3D Linear Transforms on Supercomputer with Intel Processors/Co-Processors, Cybernetics and Information Technologies, 20:6, (94-104), Online publication date: 1-Dec-2020.
  9. Slaughter E, Wu W, Fu Y, Brandenburg L, Garcia N, Kautz W, Marx E, Morris K, Cao Q, Bosilca G, Mirchandaney S, Lee W, Treichler S, McCormick P and Aiken A Task bench Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-15)
  10. ACM
    Yu H, Chen Z, Fu X, Wang J, Su Z, Sun J, Huang C and Dong W Symbolic verification of message passing interface programs Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering, (1248-1260)
  11. ACM
    Bhimani J, Mi N, Leeser M and Yang Z (2019). New Performance Modeling Methods for Parallel Data Processing Applications, ACM Transactions on Modeling and Computer Simulation, 29:3, (1-24), Online publication date: 27-Jul-2019.
  12. ACM
    Castillo E, Jain N, Casas M, Moreto M, Schulz M, Beivide R, Valero M and Bhatele A Optimizing computation-communication overlap in asynchronous task-based programs Proceedings of the ACM International Conference on Supercomputing, (380-391)
  13. ACM
    Lee C and Ousterhout J Granular Computing Proceedings of the Workshop on Hot Topics in Operating Systems, (149-154)
  14. Mencagli G, França F, Bentes C, Justen Marzulo L, Lima Pilla M, Wyrzykowski R, Deelman E, Langr D, Dytrych T, Launey K and Draayer J (2020). Accelerating many-nucleon basis generation for high performance computing enabled ab initio nuclear structure studies, International Journal of High Performance Computing Applications, 33:3, (522-533), Online publication date: 1-May-2019.
  15. ACM
    Eskandari N, Tarafdar N, Ly-Ma D and Chow P A Modular Heterogeneous Stack for Deploying FPGAs and CPUs in the Data Center Proceedings of the 2019 ACM/SIGDA International Symposium on Field-Programmable Gate Arrays, (262-271)
  16. Nakao M, Murai H, Iwashita H, Boku T and Sato M (2019). Implementation and evaluation of the HPC challenge benchmark in the XcalableMP PGAS language, International Journal of High Performance Computing Applications, 33:1, (110-123), Online publication date: 1-Jan-2019.
  17. Nikolakopoulos A, Kalantzis V, Gallopoulos E and Garofalakis J (2019). EigenRec, Knowledge and Information Systems, 58:1, (59-81), Online publication date: 1-Jan-2019.
  18. Makatun D, Lauret J and Rudová H (2018). Planning of distributed data production for High Energy and Nuclear Physics, Cluster Computing, 21:4, (1949-1965), Online publication date: 1-Dec-2018.
  19. Hu Y, Liu H and Huang H TriCore Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  20. Hu Y, Liu H and Huang H TriCore Proceedings of the International Conference for High Performance Computing, Networking, Storage, and Analysis, (1-12)
  21. ACM
    Mashayekhi O, Shah C, Qu H, Lim A and Levis P (2018). Automatically Distributing Eulerian and Hybrid Fluid Simulations in the Cloud, ACM Transactions on Graphics, 37:2, (1-14), Online publication date: 3-Jul-2018.
  22. Miryala G and Ludwig S (2018). Comparing Spark with MapReduce, International Journal of Swarm Intelligence Research, 9:3, (1-22), Online publication date: 1-Jul-2018.
  23. Lunet T, Bodart J, Gratton S and Vasseur X (2018). Time-parallel simulation of the decay of homogeneous turbulence using Parareal with spatial coarsening, Computing and Visualization in Science, 19:1-2, (31-44), Online publication date: 1-Jun-2018.
  24. ACM
    Yu H Combining symbolic execution and model checking to verify MPI programs Proceedings of the 40th International Conference on Software Engineering: Companion Proceeedings, (527-530)
  25. ACM
    Gianinazzi L, Kalvoda P, De Palma A, Besta M and Hoefler T (2018). Communication-avoiding parallel minimum cuts and connected components, ACM SIGPLAN Notices, 53:1, (219-232), Online publication date: 23-Mar-2018.
  26. ACM
    Gianinazzi L, Kalvoda P, De Palma A, Besta M and Hoefler T Communication-avoiding parallel minimum cuts and connected components Proceedings of the 23rd ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (219-232)
  27. ACM
    De Falco I, Cioppa A, Scafuri U and Tarantino E Exploiting diversity in an asynchronous migration model for distributed differential evolution Proceedings of the Genetic and Evolutionary Computation Conference Companion, (1880-1887)
  28. Gutierrez-Alcoba A, Ortega G, Hendrix E and Garca I (2017). Accelerating an algorithm for perishable inventory control on heterogeneous platforms, Journal of Parallel and Distributed Computing, 104:C, (12-18), Online publication date: 1-Jun-2017.
  29. Jin J, Lai G, Lin X and Cai X (2017). DLRankSVM, The Journal of Supercomputing, 73:5, (2157-2186), Online publication date: 1-May-2017.
  30. Kouetcha D, Ramézani H and Cohaut N (2017). Ultrafast scalable parallel algorithm for the radial distribution function histogramming using MPI maps, The Journal of Supercomputing, 73:4, (1629-1653), Online publication date: 1-Apr-2017.
  31. Eijkhout V Teaching MPI from mental models Proceedings of the Workshop on Education for High Performance Computing, (14-18)
  32. ACM
    Steele G and Tristan J (2016). Adding approximate counters, ACM SIGPLAN Notices, 51:8, (1-12), Online publication date: 9-Nov-2016.
  33. Zhang Z, Lang M, Pakin S and Fu S (2016). TracSim, Parallel Computing, 57:C, (108-124), Online publication date: 1-Sep-2016.
  34. ACM
    Schaller M, Gonnet P, Chalk A and Draper P SWIFT Proceedings of the Platform for Advanced Scientific Computing Conference, (1-10)
  35. ACM
    Chatzikonstantis G, Rodopoulos D, Nomikou S, Strydis C, De Zeeuw C and Soudris D First impressions from detailed brain model simulations on a Xeon/Xeon-Phi node Proceedings of the ACM International Conference on Computing Frontiers, (361-364)
  36. ACM
    Steele G and Tristan J Adding approximate counters Proceedings of the 21st ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (1-12)
  37. ACM
    Tallent N, Vishnu A, Van Dam H, Daily J, Kerbyson D and Hoisie A (2015). Diagnosing the causes and severity of one-sided message contention, ACM SIGPLAN Notices, 50:8, (130-139), Online publication date: 18-Dec-2015.
  38. ACM
    Slaughter E, Lee W, Treichler S, Bauer M and Aiken A Regent Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (1-12)
  39. ACM
    Shook E, Wren C, Marean C, Potts A, Franklin J, Engelbrecht F, O'Neal D, Janssen M, Fisher E, Hill K, Esler K, Cowling R, Scheiter S and Moncrieff G Paleoscape model of coastal South Africa during modern human origins Proceedings of the 2015 XSEDE Conference: Scientific Advancements Enabled by Enhanced Cyberinfrastructure, (1-8)
  40. ACM
    Wolf F, Psaroudakis I, May N, Ailamaki A and Sattler K Extending database task schedulers for multi-threaded application code Proceedings of the 27th International Conference on Scientific and Statistical Database Management, (1-12)
  41. Triguero I, Peralta D, Bacardit J, García S and Herrera F (2015). MRPR, Neurocomputing, 150:PA, (331-345), Online publication date: 20-Feb-2015.
  42. ACM
    Tallent N, Vishnu A, Van Dam H, Daily J, Kerbyson D and Hoisie A Diagnosing the causes and severity of one-sided message contention Proceedings of the 20th ACM SIGPLAN Symposium on Principles and Practice of Parallel Programming, (130-139)
  43. Nakao M, Murai H, Shimosaka T, Tabuchi A, Hanawa T, Kodama Y, Bokut T and Sato M XcalableACC Proceedings of the First Workshop on Accelerator Programming using Directives, (27-36)
  44. ACM
    Langr D, Tvrdík P, Dytrych T and Draayer J (2014). Algorithm 947, ACM Transactions on Mathematical Software, 41:1, (1-26), Online publication date: 27-Oct-2014.
  45. ACM
    Träff J and Rougier A Zero-copy, Hierarchical Gather is not possible with MPI Datatypes and Collectives Proceedings of the 21st European MPI Users' Group Meeting, (39-44)
  46. ACM
    Treichler S, Bauer M and Aiken A Realm Proceedings of the 23rd international conference on Parallel architectures and compilation, (263-276)
  47. ACM
    Wilson L and Fonner J Launcher Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment, (1-8)
  48. ACM
    De Falco I, Della Cioppa A, Maisto D, Scafuri U and Tarantino E Using an adaptive invasion-based model for fast range image registration Proceedings of the 2014 Annual Conference on Genetic and Evolutionary Computation, (1095-1102)
  49. ACM
    Yu Z, Li M, Liu Y and Li X GatorCloud Proceedings of the 2014 ACM international workshop on Software-defined ecosystems, (13-20)
  50. Weber R, Jenkins D and Peterson G (2014). Specmaster, Concurrency and Computation: Practice & Experience, 26:6, (1234-1248), Online publication date: 25-Apr-2014.
  51. ACM
    Moreland K, Geveci B, Ma K and Maynard R A classification of scientific visualization algorithms for massive threading Proceedings of the 8th International Workshop on Ultrascale Visualization, (1-10)
  52. ACM
    Siddiqui J and Khurshid S (2012). Scaling symbolic execution using ranged analysis, ACM SIGPLAN Notices, 47:10, (523-536), Online publication date: 15-Nov-2012.
  53. ACM
    Siddiqui J and Khurshid S Scaling symbolic execution using ranged analysis Proceedings of the ACM international conference on Object oriented programming systems languages and applications, (523-536)
  54. ACM
    Böhm S and Běhálek M Usage of petri nets for high performance computing Proceedings of the 1st ACM SIGPLAN workshop on Functional high-performance computing, (37-48)
  55. Fredriksson O and Ghica D Seamless Distributed Computing from the Geometry of Interaction Revised Selected Papers of the 7th International Symposium on Trustworthy Global Computing - Volume 8191, (34-48)
  56. Prabhakar R, Govindarajan R and Thazhuthaveetil M CUDA-for-clusters Proceedings of the 18th international conference on Parallel Processing, (415-426)
  57. ACM
    Yu H and Li Z Multi-slicing: a compiler-supported parallel approach to data dependence profiling Proceedings of the 2012 International Symposium on Software Testing and Analysis, (23-33)
  58. ACM
    Kim J, Seo S, Lee J, Nah J, Jo G and Lee J SnuCL Proceedings of the 26th ACM international conference on Supercomputing, (341-352)
  59. ACM
    Weber R, Jenkins D, Lineback N and Peterson G Accelerating tandem MS protein database searches using OpenCL Proceedings of the 3rd international workshop on Emerging computational methods for the life sciences, (59-63)
  60. Rao V, Agrawal N and Maity S C-DAC's efforts Proceedings of the ATIP/A*CRC Workshop on Accelerator Technologies for High-Performance Computing: Does Asia Lead the Way?, (1-4)
  61. Dalle O and Mancini E Integrated tools for the simulation analysis of peer-to-peer backup systems Proceedings of the 5th International ICST Conference on Simulation Tools and Techniques, (178-183)
  62. Camposano R, Gope D, Grivet-Talocia S and Jandhyala V Moore meets Maxwell Proceedings of the Conference on Design, Automation and Test in Europe, (1275-1276)
  63. Aringhieri R, Catanzaro D and Di Summa M (2011). Optimal solutions for the balanced minimum evolution problem, Computers and Operations Research, 38:12, (1845-1854), Online publication date: 1-Dec-2011.
  64. ACM
    Huerta Yero E and Lucchese F Practical experiences on the gridification of financial applications Proceedings of the fourth workshop on High performance computational finance, (39-46)
  65. ACM
    Kennedy K, Koelbel C and Zima H (2011). The rise and fall of high performance Fortran, Communications of the ACM, 54:11, (74-82), Online publication date: 1-Nov-2011.
  66. ACM
    Cunningham D Distributed ray tracing in x10 Proceedings of the ACM international conference companion on Object oriented programming systems languages and applications companion, (11-12)
  67. ACM
    Lins R, de F. Pereira e Silva G and de A. Formiga A HistDoc v. 2.0 Proceedings of the 2011 Workshop on Historical Document Imaging and Processing, (169-176)
  68. ACM
    Bauer M, Clark J, Schkufza E and Aiken A (2011). Programming the memory hierarchy revisited, ACM SIGPLAN Notices, 46:8, (13-24), Online publication date: 7-Sep-2011.
  69. Subotic V, Ferrer R, Sancho J, Labarta J and Valero M Quantifying the potential task-based dataflow parallelism in MPI applications Proceedings of the 17th international conference on Parallel processing - Volume Part I, (39-51)
  70. ACM
    You H, Rekapalli B, Liu Q and Moore S Autotuned parallel I/O for highly scalable biosequence analysis Proceedings of the 2011 TeraGrid Conference: Extreme Digital Discovery, (1-8)
  71. ACM
    Viry P Parallel and distributed programming extensions for mainstream languages based on pi-calculus Proceedings of the 30th annual ACM SIGACT-SIGOPS symposium on Principles of distributed computing, (343-344)
  72. ACM
    Tyree S, Weinberger K, Agrawal K and Paykin J Parallel boosted regression trees for web search ranking Proceedings of the 20th international conference on World wide web, (387-396)
  73. ACM
    Radcliffe N, Watson L and Sosonkina M A comparison of alternatives for communicating with spawned processes Proceedings of the 49th Annual Southeast Regional Conference, (132-137)
  74. ACM
    Bauer M, Clark J, Schkufza E and Aiken A Programming the memory hierarchy revisited Proceedings of the 16th ACM symposium on Principles and practice of parallel programming, (13-24)
  75. ACM
    Tian X and Benkrid K (2010). High-Performance Quasi-Monte Carlo Financial Simulation, ACM Transactions on Reconfigurable Technology and Systems, 3:4, (1-22), Online publication date: 1-Nov-2010.
  76. ACM
    Cordes D, Marwedel P and Mallik A Automatic parallelization of embedded software using hierarchical task graphs and integer linear programming Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, (267-276)
  77. ACM
    Blagojević F, Hargrove P, Iancu C and Yelick K Hybrid PGAS runtime support for multicore nodes Proceedings of the Fourth Conference on Partitioned Global Address Space Programming Model, (1-10)
  78. Da Costa U, De Medeiros Júnior I and Oliveira M Specification and verification of a MPI implementation for a MP-SoC Proceedings of the 7th International colloquium conference on Theoretical aspects of computing, (168-183)
  79. ACM
    D’Ambra P, Serafino D and Filippone S (2010). MLD2P4, ACM Transactions on Mathematical Software, 37:3, (7-23), Online publication date: 1-Sep-2010.
  80. Korch M, Rauber T and Scholtes C Scalability and locality of extrapolation methods for distributed-memory architectures Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II, (65-76)
  81. ACM
    Verschelde J and Yoffe G Polynomial homotopies on multicore workstations Proceedings of the 4th International Workshop on Parallel and Symbolic Computation, (131-140)
  82. ACM
    Perry B and Swany M Improving MPI communication via data type fission Proceedings of the 19th ACM International Symposium on High Performance Distributed Computing, (352-355)
  83. Massetto F, Kakugawa F, De Paula Bianchini C, Sato L, Hsu C, Li K and Gomes A A message forward tool for integration of clusters of clusters based on MPI architecture Proceedings of the Second Russia-Taiwan conference on Methods and tools of parallel programming multicomputers, (105-114)
  84. Howison M, Bethel E and Childs H MPI-hybrid parallelism for volume rendering on large, multi-core systems Proceedings of the 10th Eurographics conference on Parallel Graphics and Visualization, (1-10)
  85. Tota S, Casu M, Roch M, Rostagno L and Zamboni M MEDEA Proceedings of the Conference on Design, Automation and Test in Europe, (45-50)
  86. Gesbert L, Gava F, Loulergue F and Dabrowski F (2010). Bulk synchronous parallel ML with exceptions, Future Generation Computer Systems, 26:3, (486-490), Online publication date: 1-Mar-2010.
  87. ACM
    Orchard D, Bolingbroke M and Mycroft A Ypnos Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming, (15-24)
  88. ACM
    Lesniak M PASTHA Proceedings of the 5th ACM SIGPLAN workshop on Declarative aspects of multicore programming, (5-14)
  89. ACM
    Bekas C, Curioni A and Fedulova I Low cost high performance uncertainty quantification Proceedings of the 2nd Workshop on High Performance Computational Finance, (1-8)
  90. ACM
    Hildebrand D, Nisar A and Haskin R pNFS, POSIX, and MPI-IO Proceedings of the 4th Annual Workshop on Petascale Data Storage, (32-36)
  91. ACM
    Luecke G, Coyle J, Hoekstra J, Kraeva M, Xu Y, Kleiman E and Weiss O Evaluating error detection capabilities of UPC run-time systems Proceedings of the Third Conference on Partitioned Global Address Space Programing Models, (1-4)
  92. Lirkov I, Vutov Y, Paprzycki M and Ganzha M Parallel performance evaluation of MIC(0) preconditioning algorithm for voxel µFE simulation Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part II, (135-144)
  93. Numrich R A team object for CoArray Fortran Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part II, (68-73)
  94. Dursun H, Nomura K, Peng L, Seymour R, Wang W, Kalia R, Nakano A and Vashishta P A Multilevel Parallelization Framework for High-Order Stencil Computations Proceedings of the 15th International Euro-Par Conference on Parallel Processing, (642-653)
  95. Setia A, Swarup V, Kumar S and Singh L A novel adaptive fuzzy load balancer for heterogeneous LAM/MPI clusters applied to evolutionary learning in neuro-fuzzy systems Proceedings of the 18th international conference on Fuzzy Systems, (68-73)
  96. Maddalena L, Petrosino A and Laccetti G (2009). A fusion-based approach to digital movie restoration, Pattern Recognition, 42:7, (1485-1495), Online publication date: 1-Jul-2009.
  97. Larsen P, Karlsson S and Madsen J Identifying Inter-task Communication in Shared Memory Programming Models Proceedings of the 5th International Workshop on OpenMP: Evolving OpenMP in an Age of Extreme Parallelism, (168-182)
  98. ACM
    Chen W, Chu J, Luan J, Bai H, Wang Y and Chang E Collaborative filtering for orkut communities Proceedings of the 18th international conference on World wide web, (681-690)
  99. Sijoy C and Chaturvedi S (2009). Finite difference time domain algorithm for electromagnetic problems involving material movement, Journal of Computational Physics, 228:6, (2282-2295), Online publication date: 1-Apr-2009.
  100. Velho P and Legrand A Accuracy study and improvement of network simulation in the SimGrid framework Proceedings of the 2nd International Conference on Simulation Tools and Techniques, (1-10)
  101. Barker K, Davis K, Hoisie A, Kerbyson D, Lang M, Pakin S and Sancho J Entering the petaflop era Proceedings of the 2008 ACM/IEEE conference on Supercomputing, (1-11)
  102. ACM
    Bekkerman R and Scholz M Data weaving Proceedings of the 17th ACM conference on Information and knowledge management, (1083-1092)
  103. Song Y, Chen W, Bai H, Lin C and Chang E Parallel spectral clustering Proceedings of the 2008th European Conference on Machine Learning and Knowledge Discovery in Databases - Volume Part II, (374-389)
  104. Sancho J and Kerbyson D Improving the Performance of Multiple Conjugate Gradient Solvers by Exploiting Overlap Proceedings of the 14th international Euro-Par conference on Parallel Processing, (688-697)
  105. ACM
    Vakkalanka S, DeLisi M, Gopalakrishnan G and Kirby R Scheduling considerations for building dynamic verification tools for MPI Proceedings of the 6th workshop on Parallel and distributed systems: testing, analysis, and debugging, (1-6)
  106. Vakkalanka S, Gopalakrishnan G and Kirby R Dynamic Verification of MPI Programs with Reductions in Presence of Split Operations and Relaxed Orderings Proceedings of the 20th international conference on Computer Aided Verification, (66-79)
  107. Sebepou Z, Magoutis K, Marazakis M and Bilas A A comparative experimental study of parallel file systems for large-scale data processing First USENIX Workshop on Large-Scale Computing, (1-10)
  108. Kononov A, Riyanti C, Leeuw S, Oosterlee C and Vuik C (2008). Numerical performance of a parallel solution method for a heterogeneous 2D Helmholtz equation, Computing and Visualization in Science, 11:3, (139-146), Online publication date: 1-May-2008.
  109. ACM
    Zea N, Sartori J and Kumar R (2008). Servo, ACM SIGARCH Computer Architecture News, 36:2, (28-37), Online publication date: 1-May-2008.
  110. Trifunović A and Knottenbelt W (2008). Parallel multilevel algorithms for hypergraph partitioning, Journal of Parallel and Distributed Computing, 68:5, (563-581), Online publication date: 1-May-2008.
  111. Dalcín L, Paz R, Storti M and D'Elía J (2008). MPI for Python, Journal of Parallel and Distributed Computing, 68:5, (655-662), Online publication date: 1-May-2008.
  112. ACM
    Siegel S, Mironova A, Avrunin G and Clarke L (2008). Combining symbolic execution with model checking to verify parallel numerical programs, ACM Transactions on Software Engineering and Methodology, 17:2, (1-34), Online publication date: 1-Apr-2008.
  113. ACM
    Hill C and Rudolph L "Zen" and the art of petascale ocean modeling Proceedings of the 2nd workshop on System-level virtualization for high performance computing, (32-39)
  114. ACM
    Mattos G, Lins R, de Araújo Formiga A and Junqueira Martins F BigBatch Proceedings of the 2008 ACM symposium on Applied computing, (434-441)
  115. Salem S and Nandi A A new scalable and efficient parallel algorithm (PRACAL) for clustering large datasets Proceedings of the 19th IASTED International Conference on Parallel and Distributed Computing and Systems, (79-84)
  116. Fujiwara K and Casanova H Speed and accuracy of network simulation in the SimGrid framework Proceedings of the 2nd international conference on Performance evaluation methodologies and tools, (1-10)
  117. Brown R and Sharapov I (2007). High-scalability parallelization of a molecular modeling application, International Journal of Parallel Programming, 35:5, (441-458), Online publication date: 1-Oct-2007.
  118. Siegel S and Avrunin G Verification of halting properties for MPI programs using nonblocking operations Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (326-334)
  119. Träff J, Gropp W and Thakur R Self--consistent MPI performance requirements Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (36-45)
  120. Jia B Process cooperation in multiple message broadcast Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (27-35)
  121. Sanders P, Speck J and Träff J Full bandwidth broadcast, reduction and scan with only two trees Proceedings of the 14th European conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (17-26)
  122. Mereuta L and Renault É Checkpointing aided parallel execution model and analysis Proceedings of the Third international conference on High Performance Computing and Communications, (707-717)
  123. De Falco I, Cioppa A, Scafuri U and Tarantino E Multiobjective differential evolution for mapping in a grid environment Proceedings of the Third international conference on High Performance Computing and Communications, (322-333)
  124. De Falco I, Cioppa A, Scafuri U and Tarantino E A multiobjective evolutionary approach for multisite mapping on grids Proceedings of the 7th international conference on Parallel processing and applied mathematics, (991-1000)
  125. Laine J and Midorikawa E Using analytical models to load balancing in a heterogeneous network of computers Proceedings of the 9th international conference on Parallel Computing Technologies, (559-568)
  126. Renault É Toward a distributed implementation of openMP using CAPE Proceedings of the 9th international conference on Parallel Computing Technologies, (303-312)
  127. Krishna M, Kumar A, Jayam N, Senthilkumar G, Baruah P, Sharma R, Kapoor S and Srinivasan A A synchronous mode MPI implementation on the cell BETM architecture Proceedings of the 5th international conference on Parallel and Distributed Processing and Applications, (982-991)
  128. Yang Y, Chen X, Gopalakrishnan G and Kirby R Distributed dynamic partial order reduction based verification of threaded software Proceedings of the 14th international SPIN conference on Model checking software, (58-75)
  129. Jenks S, Kim K, Li Y, Liu S, Zheng L, Kim M, Youn H, Lee K and Seol D (2007). A middleware model supporting time-triggered message-triggered objects for standard Linux systems, Real-Time Systems, 36:1-2, (75-99), Online publication date: 1-Jul-2007.
  130. ACM
    Huang W, Liu J, Koop M, Abali B and Panda D Nomad Proceedings of the 3rd international conference on Virtual execution environments, (158-168)
  131. ACM
    Kennedy K, Koelbel C and Zima H The rise and fall of High Performance Fortran Proceedings of the third ACM SIGPLAN conference on History of programming languages, (7-1-7-22)
  132. Kumar A, Senthilkumar G, Krishna M, Jayam N, Baruah P, Sharma R, Srinivasan A and Kapoor S A Buffered-Mode MPI Implementation for the Cell BETM Processor Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007, (603-610)
  133. Bounanos S, Fleury M, Nicolas S and Vickers A (2007). Regular Paper, International Journal of High Performance Computing Applications, 21:2, (222-245), Online publication date: 1-May-2007.
  134. Falzone C, Chan A, Lusk E and Gropp W (2007). A Portable Method for Finding User Errors in the Usage of MPI Collective Operations, International Journal of High Performance Computing Applications, 21:2, (155-165), Online publication date: 1-May-2007.
  135. Siegel S Model checking nonblocking MPI programs Proceedings of the 8th international conference on Verification, model checking, and abstract interpretation, (44-58)
  136. Sussman A (2006). Building complex coupled physical simulations on the grid with InterComm, Engineering with Computers, 22:3-4, (311-323), Online publication date: 1-Dec-2006.
  137. ACM
    Chrisochoides N, Fedorov A, Kot A, Archip N, Black P, Clatz O, Golby A, Kikinis R and Warfield S Toward real-time image guided neurosurgery using distributed and grid computing Proceedings of the 2006 ACM/IEEE conference on Supercomputing, (76-es)
  138. Díaz M, Garrido D, Romero S, Rubio B, Soler E and Troya J (2006). Experiences with component-oriented technologies in nuclear power plant simulators, Software—Practice & Experience, 36:13, (1489-1512), Online publication date: 1-Nov-2006.
  139. Díaz M, Romero S, Rubio B, Soler E and Troya J Dynamic reconfiguration of scientific components using aspect oriented programming Proceedings of the 2006 Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, DOA, GADA, and ODBASE - Volume Part II, (1351-1360)
  140. ACM
    Pazel D and Tibbitts B Intentional MPI programming in a visual development environment Proceedings of the 2006 ACM symposium on Software visualization, (169-170)
  141. Leykin A and Verschelde J Interfacing with the numerical homotopy algorithms in PHCpack Proceedings of the Second international conference on Mathematical Software, (354-360)
  142. Natvig T and Elster A Automatic and transparent optimizations of an application's MPI communication Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (208-217)
  143. Nitsche T Coordinating computation with communication Proceedings of the 8th international conference on Coordination Models and Languages, (212-227)
  144. Aparício G, Blanquer I and Hernández V A parallel implementation of the k nearest neighbours classifier in three levels Proceedings of the 7th international conference on High performance computing for computational science, (225-235)
  145. Díaz M, Romero S, Rubio B, Soler E and Troya J Using SBASCO to solve reaction-diffusion equations in two-dimensional irregular domains Proceedings of the 6th international conference on Computational Science - Volume Part II, (912-919)
  146. Massetto F, Gomes Junior A and Sato L HyMPI – a MPI implementation for heterogeneous high performance systems Proceedings of the First international conference on Advances in Grid and Pervasive Computing, (314-323)
  147. Ritzdorf H and Träff J Collective operations in NEC's high-performance MPI libraries Proceedings of the 20th international conference on Parallel and distributed processing, (100-100)
  148. Boukerche A, Al-Shaikh R and Notare M Towards building a highly-available cluster based model for high performance computing Proceedings of the 20th international conference on Parallel and distributed processing, (324-324)
  149. Mamidala A, Chai L, Jin H and Panda D Efficient SMP-aware MPI-level broadcast over InfiniBand's hardware multicast Proceedings of the 20th international conference on Parallel and distributed processing, (272-272)
  150. Chen B, Fofanov V, Bryant D, Dodson B, Kristensen D, Lisewski A, Kimmel M, Lichtarge O and Kavraki L Geometric sieving Proceedings of the 10th annual international conference on Research in Computational Molecular Biology, (500-515)
  151. ACM
    Chan E, van de Geijn R, Gropp W and Thakur R Collective communication on architectures that support simultaneous communication over multiple links Proceedings of the eleventh ACM SIGPLAN symposium on Principles and practice of parallel programming, (2-11)
  152. Cooney D, Dumas M and Roe P GPSL Proceedings of the 9th international conference on Fundamental Approaches to Software Engineering, (3-17)
  153. Agerwala T and Gupta M (2006). Systems research challenges, IBM Journal of Research and Development, 50:2/3, (173-180), Online publication date: 1-Mar-2006.
  154. Tehranian S, Zhao Y, Harvey T, Swaroop A and Mckenzie K (2006). A robust framework for real-time distributed processing of satellite data, Journal of Parallel and Distributed Computing, 66:3, (403-418), Online publication date: 1-Mar-2006.
  155. Juhász S Broadcasting algorithm of constant complexity for fully-switched clusters Proceedings of the 5th WSEAS International Conference on Software Engineering, Parallel and Distributed Systems, (104-109)
  156. Legrand A, Marchal L and Robert Y (2005). Optimizing the steady-state throughput of scatter and reduce operations on heterogeneous platforms, Journal of Parallel and Distributed Computing, 65:12, (1497-1514), Online publication date: 1-Dec-2005.
  157. Chaisiri S, Pichitlamken J, Uthayopas P, Rojanapanpat T, Phakhawirotkul S and Vorakosit T Applying Web Service and Windows Clustering for High Volume Risk Analysis Proceedings of the Eighth International Conference on High-Performance Computing in Asia-Pacific Region
  158. Aldred L, van der Aalst W, Dumas M and ter Hofstede A On the notion of coupling in communication middleware Proceedings of the 2005 OTM Confederated international conference on On the Move to Meaningful Internet Systems: CoopIS, COA, and ODBASE - Volume Part II, (1015-1033)
  159. Chaudhry G and Cormen T Oblivious vs. distribution-based sorting Proceedings of the 13th annual European conference on Algorithms, (317-328)
  160. Wilson B (2005). Introduction to parallel programming using message-passing, Journal of Computing Sciences in Colleges, 21:1, (207-211), Online publication date: 1-Oct-2005.
  161. Roweth D and Addison D Optimised gather collectives on QsNetII Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (407-414)
  162. Mamidala A, Jin H and Panda D Efficient hardware multicast group management for multiple MPI communicators over infiniband Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (388-398)
  163. Chai L, Noronha R, Gupta P, Brown G and Panda D Designing a portable MPI-2 over modern interconnects using uDAPL interface Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (200-208)
  164. Huang W, Santhanaraman G, Jin H and Panda D Design alternatives and performance trade-offs for implementing MPI-2 over infiniband Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (191-199)
  165. Falzone C, Chan A, Lusk E and Gropp W Collective error detection for MPI collective operations Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (138-147)
  166. Träff J An improved algorithm for (non-commutative) reduce-scatter with an application Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (129-137)
  167. Träff J and Ripke A An optimal broadcast algorithm adapted to SMP clusters Proceedings of the 12th European PVM/MPI users' group conference on Recent Advances in Parallel Virtual Machine and Message Passing Interface, (48-56)
  168. Buenabad-Chávez J, Castro-García M and Román-Alonso G Simple, list-based parallel programming with transparent load balancing Proceedings of the 6th international conference on Parallel Processing and Applied Mathematics, (920-927)
  169. Barros A, Dumas M and ter Hofstede A Service interaction patterns Proceedings of the 3rd international conference on Business Process Management, (302-318)
  170. Tinetti F and De Giusti A Broadcast-Based parallel LU factorization Proceedings of the 11th international Euro-Par conference on Parallel Processing, (867-876)
  171. Hermanns M, Mohr B and Wolf F Event-Based measurement and analysis of one-sided communication Proceedings of the 11th international Euro-Par conference on Parallel Processing, (156-165)
  172. Beecroft J, Addison D, Hewson D, McLaren M, Roweth D, Petrini F and Nieplocha J (2005). QsNetII, IEEE Micro, 25:4, (34-47), Online publication date: 1-Jul-2005.
  173. ACM
    Yoon I, Sussman A and Porter A And away we go Proceedings of the second international workshop on Software engineering for high performance computing system applications, (45-49)
  174. Díaz M, Garrido D, Romero S, Rubio B, Soler E and Troya J A CCA-compliant nuclear power plant simulator kernel Proceedings of the 8th international conference on Component-Based Software Engineering, (283-297)
  175. Lee J and Sussman A High Performance Communication between Parallel Programs Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 4 - Volume 05
  176. Kim J, Andrade H and Sussman A Comparing the Performance of High-Level Middleware Systems in Shared and Distributed Memory Parallel Environments Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Papers - Volume 01
  177. ACM
    Pounds A, Nalluri R and Coleman B The development of a Tri-Use cluster for general computer education, high performance computing education, and computationally intensive research Proceedings of the 43rd annual Southeast regional conference - Volume 1, (345-348)
  178. O'Cearbhaill E and O'Mahony M (2005). Parallel implementation of a transportation network model, Journal of Parallel and Distributed Computing, 65:1, (1-14), Online publication date: 1-Jan-2005.
  179. Wu J and Sussman A Flexible Control of Data Transfers between Parallel Programs Proceedings of the 5th IEEE/ACM International Workshop on Grid Computing, (226-234)
  180. Smith J, Sampaio S, Watson P and Paton N (2004). The Design, Implementation and Evaluation of an ODMG Compliant, Parallel Object Database Server, Distributed and Parallel Databases, 16:3, (275-319), Online publication date: 1-Nov-2004.
  181. Dotsenko Y, Coarfa C and Mellor-Crummey J A Multi-Platform Co-Array Fortran Compiler Proceedings of the 13th International Conference on Parallel Architectures and Compilation Techniques, (29-40)
  182. Garzón E, Tabik S, Bretones A and García I Analysis of the interaction of electromagnetic signals with thin-wires structures. multiprocessing issues for an iterative method Proceedings of the 6th international conference on High Performance Computing for Computational Science, (78-89)
  183. Sussman A and Andrade H Enabling coupled scientific simulations on the grid Proceedings of the 7th international conference on Applied Parallel Computing: state of the Art in Scientific Computing, (217-224)
  184. ACM
    Worringen J, Traff J and Ritzdorf H Fast Parallel Non-Contiguous File Access Proceedings of the 2003 ACM/IEEE conference on Supercomputing
  185. ACM
    Liu J, Chandrasekaran B, Wu J, Jiang W, Kini S, Yu W, Buntinas D, Wyckoff P and Panda D Performance Comparison of MPI Implementations over InfiniBand, Myrinet and Quadrics Proceedings of the 2003 ACM/IEEE conference on Supercomputing
  186. ACM
    Fernandez J, Frachtenberg E and Petrini F BCS-MPI Proceedings of the 2003 ACM/IEEE conference on Supercomputing
  187. ACM
    Olson R, Schmidt M, Gordon M and Rendell A Enabling the Efficient Use of SMP Clusters Proceedings of the 2003 ACM/IEEE conference on Supercomputing
  188. Van Der Steen A (2003). An Evaluation of Some Beowulf Clusters, Cluster Computing, 6:4, (287-297), Online publication date: 1-Oct-2003.
  189. Vetter J and Mueller F (2003). Communication characteristics of large-scale scientific applications for contemporary cluster architectures, Journal of Parallel and Distributed Computing, 63:9, (853-865), Online publication date: 1-Sep-2003.
  190. Alves A, Pina A, Exposto J and Rufino J ToCL Proceedings of the 2003 international conference on Computational science: PartII, (1022-1031)
  191. Awhad V and Wallace C A unified formal specification and analysis of the new java memory models Proceedings of the abstract state machines 10th international conference on Advances in theory and practice, (166-185)
  192. Dorband E, Hemsendorf M and Merritt D (2003). Systolic and hyper-systolic algorithms for the gravitational N-body problem, with an application to Brownian motion, Journal of Computational Physics, 185:2, (484-511), Online publication date: 1-Mar-2003.
  193. ACM
    Bergen D and Miller B (2002). Building an MPI cluster, XRDS: Crossroads, The ACM Magazine for Students, 8:5, (2-2), Online publication date: 1-Aug-2002.
  194. Alves A, Pina A, Exposto J and Rufino J Scalable multithreading in a low latency Myrinet cluster Proceedings of the 5th international conference on High performance computing for computational science, (579-593)
  195. Kulkarni D and Sosonkina M A framework for integrating network information into distributed iterative solution of sparse linear systems Proceedings of the 5th international conference on High performance computing for computational science, (436-447)
  196. Eleftheriou M, Chatterjee S and Moreira J A C++ Implementation of the Co-Array Programming Model for Blue Gene/L Proceedings of the 16th International Parallel and Distributed Processing Symposium
  197. Reussner R, Sanders P and Traff J (2002). SKaMPI: a comprehensive benchmark for public benchmarking of MPI, Scientific Programming, 10:1, (55-65), Online publication date: 1-Jan-2002.
  198. ACM
    Petitet A, Blackford S, Dongarra J, Ellis B, Fagg G, Roche K and Vadhiyar S Numerical libraries and the grid Proceedings of the 2001 ACM/IEEE conference on Supercomputing, (14-14)
  199. ACM
    Chaudhry G, Cormen T and Wisniewski L Columnsort lives! an efficient out-of-core sorting program Proceedings of the thirteenth annual ACM symposium on Parallel algorithms and architectures, (169-178)
  200. Larsson Traff J, Ritzdorf H and Hempel R The implementation of MPI-2 one-sided communication for the NEC SX-5 Proceedings of the 2000 ACM/IEEE conference on Supercomputing, (1-es)
  201. Zaki O, Lusk E, Gropp W and Swider D (1999). Toward Scalable Performance Visualization with Jumpshot, International Journal of High Performance Computing Applications, 13:3, (277-288), Online publication date: 1-Aug-1999.
  202. ACM
    Stunkel C, Herring J, Abali B and Sivaram R A new switch chip for IBM RS/6000 SP systems Proceedings of the 1999 ACM/IEEE conference on Supercomputing, (16-es)
  203. Ho S, Itoh S, Ihara S and Schlichting R Agent middleware for heterogeneous scientific simulations Proceedings of the 1998 ACM/IEEE conference on Supercomputing, (1-7)
Contributors
  • University of Illinois Urbana-Champaign
  • Oregon Health & Science University
  • Beloit College
  • The University of Tennessee, Knoxville

Recommendations