From the Publisher:
At last, a practitioner's guide to parallel programming! Students and professionals who use parallel or distributed computer systems will be able to solve real problems with Designing and Building Parallel Programs. This book provides a comprehensive introduction to parallel algorithm design, performance analysis, and program construction. It describes the tools needed to write parallel programs and provides numerous examples. A unique feature is the companion on-line version, accessible via the World Wide Web using browsers such as Mosaic. This provides a convenient hypertext version of the text with pointers to programming tools, example programs, and other resources on parallel and distributed computing.
Cited By
- Vasconcelos V, Martins F, López H and Yoshida N (2022). A Type Discipline for Message Passing Parallel Programs, ACM Transactions on Programming Languages and Systems, 44:4, (1-55), Online publication date: 31-Dec-2022.
- Niño A, Reyes S and Carbó-Dorca R (2021). An HPC hybrid parallel approach to the experimental analysis of Fermat’s theorem extension to arbitrary dimensions on heterogeneous computer systems, The Journal of Supercomputing, 77:10, (11328-11352), Online publication date: 1-Oct-2021.
- van Wagensveld R, Wägemann T, Hehenkamp N, Kolagari R, Margull U and Mader R Intra-Task Parallelism in Automotive Real-Time Systems Proceedings of the 9th International Workshop on Programming Models and Applications for Multicores and Manycores, (61-70)
- Frömmgen A, Stohr D, Koldehofe B and Rizk A Don't repeat yourself Proceedings of the 14th International Conference on emerging Networking EXperiments and Technologies, (20-26)
- Liu X, Qiu M, Wang X, Liu W and Cai K (2017). Energy Efficiency Optimization for Communication of Air-Based Information Network with Guaranteed Timing Constraints, Journal of Signal Processing Systems, 86:2-3, (299-312), Online publication date: 1-Mar-2017.
- Stetsenko I (2017). Parallel Algorithm for Petri Object Simulation, Cybernetics and Systems Analysis, 53:4, (605-614), Online publication date: 1-Jul-2017.
- Reps B and Weinzierl T (2017). Complex Additive Geometric Multilevel Solvers for Helmholtz Equations on Spacetrees, ACM Transactions on Mathematical Software, 44:1, (1-36), Online publication date: 24-Jul-2017.
- Zaikin O (2017). A parallel SAT solving algorithm based on improved handling of conflict clauses, Procedia Computer Science, 119:C, (103-111), Online publication date: 1-Dec-2017.
- Öztürk M and Zengin A (2016). Improved GUI Testing using Task Parallel Library, ACM SIGSOFT Software Engineering Notes, 41:2, (1-8), Online publication date: 11-May-2016.
- Feilhauer T and Sobotka M (2016). DEF - a programming language agnostic framework and execution environment for the parallel execution of library routines, Journal of Cloud Computing: Advances, Systems and Applications, 5:1, (1-17), Online publication date: 1-Dec-2016.
- Ungerer T, Bradatsch C, Frieb M, Kluge F, Mische J, Stegmeier A, Jahr R, Gerdes M, Zaykov P, Matusova L, Li Z, Petrov Z, Böddeker B, Kehr S, Regler H, Hugl A, Rochange C, Ozaktas H, Cassé H, Bonenfant A, Sainrat P, Lay N, George D, Broster I, Quiñones E, Panic M, Abella J, Hernandez C, Cazorla F, Uhrig S, Rohde M and Pyka A (2016). Parallelizing Industrial Hard Real-Time Applications for the parMERASA Multicore, ACM Transactions on Embedded Computing Systems, 15:3, (1-27), Online publication date: 21-Jul-2016.
- Rupp K, Weinbub J, Jüngel A and Grasser T (2016). Pipelined Iterative Solvers with Kernel Fusion for Graphics Processing Units, ACM Transactions on Mathematical Software, 43:2, (1-27), Online publication date: 2-Sep-2016.
- Perlasca P, Valentini G, Frasca M and Mesiti M Multi-species protein function prediction Proceedings of the 18th International Conference on Information Integration and Web-based Applications and Services, (489-493)
- Karácsony M and Claessen K Using fusion to enable late design decisions for pipelined computations Proceedings of the 5th International Workshop on Functional High-Performance Computing, (9-16)
- Öztürk M and Zengin A (2016). Improved GUI Testing using Task Parallel Library, ACM SIGSOFT Software Engineering Notes, 41:1, (1-8), Online publication date: 22-Feb-2016.
- Ababneh I, Bani-Mohammad S and Al Smadi M (2015). Corner-Boundary Processor Allocation for 3D Mesh-Connected Multicomputers, International Journal of Cloud Applications and Computing, 5:1, (1-13), Online publication date: 1-Jan-2015.
- Pereira M and Cruvinel P (2015). A model for soil computed tomography based on volumetric reconstruction, Wiener filtering and parallel processing, Computers and Electronics in Agriculture, 111:C, (151-163), Online publication date: 1-Feb-2015.
- Lorenzon A, Cera M and Schneider Beck A (2015). Performance and Energy Evaluation of Different Multi-Threading Interfaces in Embedded and General Purpose Systems, Journal of Signal Processing Systems, 80:3, (295-307), Online publication date: 1-Sep-2015.
- Prasad S, McDermott M, Puri S, Shah D, Aghajarian D, Shekhar S and Zhou X (2015). A vision for GPU-accelerated parallel computation on geo-spatial datasets, SIGSPATIAL Special, 6:3, (19-26), Online publication date: 22-Apr-2015.
- Bailey M Fundamentals seminar ACM SIGGRAPH 2015 Courses, (1-129)
- Yanhaona M and Grimshaw A A Roadmap for a Type Architecture Based Parallel Programming Language Proceedings of the 2015 International Workshop on Code Optimisation for Multi and Many Cores, (1-10)
- Götz M, Bodenstein C and Riedel M HPDBSCAN Proceedings of the Workshop on Machine Learning in High-Performance Computing Environments, (1-10)
- López H, Marques E, Martins F, Ng N, Santos C, Vasconcelos V and Yoshida N (2015). Protocol-based verification of message-passing parallel programs, ACM SIGPLAN Notices, 50:10, (280-298), Online publication date: 18-Dec-2015.
- Liu Y, Gao P, Wong B and Keshav S (2014). Quartz, ACM SIGCOMM Computer Communication Review, 44:4, (283-294), Online publication date: 25-Feb-2015.
- Feldman D, Sung C, Sugaya A and Rus D (2015). iDiary, ACM Transactions on Sensor Networks, 11:4, (1-41), Online publication date: 23-Dec-2015.
- Sato Y, Hozumi S and Chiba S Calculation coverage testing in scientific applications Proceedings of the 2015 International Symposium on Software Testing and Analysis, (350-360)
- López H, Marques E, Martins F, Ng N, Santos C, Vasconcelos V and Yoshida N Protocol-based verification of message-passing parallel programs Proceedings of the 2015 ACM SIGPLAN International Conference on Object-Oriented Programming, Systems, Languages, and Applications, (280-298)
- Arkin E and Tekinerdogan B Domain Specific Language for Deployment of Parallel Applications on Parallel Computing Platforms Proceedings of the 2014 European Conference on Software Architecture Workshops, (1-8)
- Rong R, Hao J and Liu J Performance Study of a Minimalistic Simulator on XSEDE Massively Parallel Systems Proceedings of the 2014 Annual Conference on Extreme Science and Engineering Discovery Environment, (1-8)
- Lin C, Nagarajan V and Gupta R Fence scoping Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis, (105-116)
- Borrmann A, Flurl M, Jubierre J, Mundani R and Rank E (2018). Synchronous collaborative tunnel design based on consistency-preserving multi-scale models, Advanced Engineering Informatics, 28:4, (499-517), Online publication date: 1-Oct-2014.
- Shao Y, Cui B, Chen L, Ma L, Yao J and Xu N Parallel subgraph listing in a large-scale graph Proceedings of the 2014 ACM SIGMOD International Conference on Management of Data, (625-636)
- Liu Y, Gao P, Wong B and Keshav S Quartz Proceedings of the 2014 ACM conference on SIGCOMM, (283-294)
- Mubarak M, Seol S, Lu Q and Shephard M (2018). A parallel ghosting algorithm for the flexible distributed mesh database, Scientific Programming, 21:1-2, (17-42), Online publication date: 1-Jan-2013.
- Prakash A, Chaudhury A and Ramachandran R (2018). Parallel simulation of population balance model-based particulate processes using multicore CPUs and GPUs, Modelling and Simulation in Engineering, 2013, (2-2), Online publication date: 1-Jan-2013.
- Plavec F, Vranesic Z and Brown S (2013). Exploiting Task- and Data-Level Parallelism in Streaming Applications Implemented in FPGAs, ACM Transactions on Reconfigurable Technology and Systems (TRETS), 6:4, (1-37), Online publication date: 1-Dec-2013.
- Jahr R, Gerdes M and Ungerer T A pattern-supported parallelization approach Proceedings of the 2013 International Workshop on Programming Models and Applications for Multicores and Manycores, (53-62)
- Zhang Z, Katz D, Wozniak J, Espinosa A and Foster I Design and analysis of data management in scalable parallel scripting Proceedings of the International Conference on High Performance Computing, Networking, Storage and Analysis, (1-11)
- Feldman D, Sung C and Rus D The single pixel GPS Proceedings of the 20th International Conference on Advances in Geographic Information Systems, (23-32)
- Böhm S and Běhálek M Usage of petri nets for high performance computing Proceedings of the 1st ACM SIGPLAN workshop on Functional high-performance computing, (37-48)
- Lobachev O Parallel computation skeletons with premature termination property Proceedings of the 11th international conference on Functional and Logic Programming, (197-212)
- Laarman A, Langerak R, Van De Pol J, Weber M and Wijs A Multi-core nested depth-first search Proceedings of the 9th international conference on Automated technology for verification and analysis, (321-335)
- Wijs A Towards informed swarm verification Proceedings of the Third international conference on NASA Formal methods, (422-437)
- Scherger M (2011). A course in advanced operating systems using MPI for projects, Journal of Computing Sciences in Colleges, 26:4, (57-63), Online publication date: 1-Apr-2011.
- Gergel V and Labutina A The paraLab system for investigating the parallel algorithms Proceedings of the Second Russia-Taiwan conference on Methods and tools of parallel programming multicomputers, (95-104)
- Belgacem M, Abdennadher N and Niinimaki M Virtual EZ grid Proceedings of the 5th international conference on Advances in Grid and Pervasive Computing, (385-394)
- Harman M Why the virtual nature of software makes it ideal for search based optimization Proceedings of the 13th international conference on Fundamental Approaches to Software Engineering, (1-12)
- Epicoco I, Mocavero S and Aloisio G Experience on the parallelization of the OASIS3 coupler Proceedings of the Eighth Australasian Symposium on Parallel and Distributed Computing - Volume 107, (51-60)
- Bukhamsin A, Sindi M and Al-Jallal J Using the Intel MPI benchmarks (IMB) to evaluate MPI implementations on an Infiniband Nehalem Linux cluster Proceedings of the 2010 Spring Simulation Multiconference, (1-4)
- Dieterle M, Berthold J and Loogen R A skeleton for distributed work pools in eden Proceedings of the 10th international conference on Functional and Logic Programming, (337-353)
- Digas B, Melnikova L and Rozenberg V Application of parallel technologies to modeling lithosphere dynamics and seismicity Proceedings of the 8th international conference on Parallel processing and applied mathematics: Part II, (340-349)
- dos Santos J, de Lima F, Magalhães R, de Melo J and Neto A A parallel hybrid implementation using genetic algorithm, GRASP and reinforcement learning Proceedings of the 2009 international joint conference on Neural Networks, (2502-2507)
- Lima J and Maillard N (2009). Online mapping of MPI-2 dynamic tasks to processes and threads, International Journal of High Performance Systems Architecture, 2:2, (81-89), Online publication date: 1-Mar-2009.
- Sahni O, Carothers C, Shephard M and Jansen K (2018). Strong scaling analysis of a parallel, unstructured, implicit solver and the influence of the operating system interference, Scientific Programming, 17:3, (261-274), Online publication date: 1-Aug-2009.
- Morihata A, Matsuzaki K, Hu Z and Takeichi M (2009). The third homomorphism theorem on trees, ACM SIGPLAN Notices, 44:1, (177-185), Online publication date: 21-Jan-2009.
- Llorà X Data-intensive computing for competent genetic algorithms Proceedings of the 11th Annual conference on Genetic and evolutionary computation, (1387-1394)
- Le-Phuoc D, Polleres A, Hauswirth M, Tummarello G and Morbidoni C Rapid prototyping of semantic mash-ups through semantic web pipes Proceedings of the 18th international conference on World wide web, (581-590)
- Qiu M and Sha E (2009). Cost minimization while satisfying hard/soft timing constraints for heterogeneous embedded systems, ACM Transactions on Design Automation of Electronic Systems (TODAES), 14:2, (1-30), Online publication date: 1-Mar-2009.
- Morihata A, Matsuzaki K, Hu Z and Takeichi M The third homomorphism theorem on trees Proceedings of the 36th annual ACM SIGPLAN-SIGACT symposium on Principles of programming languages, (177-185)
- Katz G and Kider J All-pairs shortest-paths for large graphs on the GPU Proceedings of the 23rd ACM SIGGRAPH/EUROGRAPHICS symposium on Graphics hardware, (47-55)
- Larson J and Norris B Component specification for parallel coupling infrastructure Proceedings of the 2007 international conference on Computational science and its applications - Volume Part III, (55-68)
- Kwiatkowski J, Pawlik M and Konieczny D Comparison of execution time decomposition methods for performance evaluation Proceedings of the 7th international conference on Parallel processing and applied mathematics, (1160-1169)
- Rungta N and Mercer E Generating counter-examples through randomized guided search Proceedings of the 14th international SPIN conference on Model checking software, (39-57)
- Rufinus J and Kortsarts Y (2007). One-dimensional heat distribution problem and parallel computing concepts, Journal of Computing Sciences in Colleges, 22:3, (74-81), Online publication date: 1-Jan-2007.
- Sobral J Pluggable grid services Proceedings of the 8th IEEE/ACM International Conference on Grid Computing, (113-120)
- Peng H, Wang M and Lai C (2007). Design of parallel algorithms for fractal video compression, International Journal of Computer Mathematics, 84:2, (193-202), Online publication date: 1-Feb-2007.
- Son H and Li X PARMI Proceedings of the Third international conference on High Performance Computing and Communications, (19-29)
- Grando M and Mitrana V (2019). A Possible Connection Between Two Theories: Grammar Systems and Concurrent Programming, Fundamenta Informaticae, 76:3, (325-336), Online publication date: 1-Aug-2007.
- Huang J, Jiao X, Fujimoto R and Zha H DAG-guided parallel asynchronous variational integrators with super-elements Proceedings of the 2007 Summer Computer Simulation Conference, (691-697)
- Grando M and Mitrana V (2019). A Possible Connection Between Two Theories: Grammar Systems and Concurrent Programming, Fundamenta Informaticae, 76:3, (325-336), Online publication date: 1-Mar-2007.
- Dwyer M, Elbaum S, Person S and Purandare R Parallel Randomized State-Space Search Proceedings of the 29th international conference on Software Engineering, (3-12)
- Nitsche T Coordinating computation with communication Proceedings of the 8th international conference on Coordination Models and Languages, (212-227)
- Zhang J, Meleis W, Kaeli D and Wu T Acceleration of Maximum Likelihood Estimation for Tomosynthesis Mammography Proceedings of the 12th International Conference on Parallel and Distributed Systems - Volume 1, (291-299)
- Paszyński M and Demkowicz L (2006). Parallel, fully automatic hp-adaptive 3D finite element package, Engineering with Computers, 22:3-4, (255-276), Online publication date: 1-Dec-2006.
- Čiegis R and Baravykaitė M Implementation of a black-box global optimization algorithm with a parallel branch and bound template Proceedings of the 8th international conference on Applied parallel computing: state of the art in scientific computing, (1115-1125)
- Paszyński M, Pardo D, Demkowicz L and Torres-Verdin C Parallel hp-Finite Element Simulations of 3D Resistivity Logging Instruments Proceedings of the 2006 conference on Leading the Web in Concurrent Engineering: Next Generation Concurrent Engineering, (635-642)
- Bell A and Haverkort B (2006). Distributed disk-based algorithms for model checking very large Markov chains, Formal Methods in System Design, 29:2, (177-196), Online publication date: 1-Sep-2006.
- Sonoda E and Travieso G The OOPS framework Companion to the 21st ACM SIGPLAN symposium on Object-oriented programming systems, languages, and applications, (659-660)
- Folino G, Mendicino G, Senatore A, Spezzano G and Straface S (2006). A model based on cellular automata for the parallel simulation of 3D unsaturated flow, Parallel Computing, 32:5-6, (357-376), Online publication date: 1-Jun-2006.
- Chen J, Watson III W, Edwards R and Mao W Message Passing for Linux Clusters with Gigabit Ethernet Mesh Connections Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 9 - Volume 10
- Zhu W, Niu Y and Gao G Performance Portability on EARTH Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 15 - Volume 16
- Yin Z, Yuan L and Tang T (2005). A new parallel strategy for two-dimensional incompressible flow simulations using pseudo-spectral methods, Journal of Computational Physics, 210:1, (325-341), Online publication date: 20-Nov-2005.
- Medina A and Ramos-Paz A Driving point impedance computation applying parallel processing techniques Proceedings of the 7th WSEAS International Conference on Mathematical Methods and Computational Techniques In Electrical Engineering, (229-234)
- Bataineh S (2005). Toward an analytical solution to task allocation, processor assignment, and performance evaluation of network processors, Journal of Parallel and Distributed Computing, 65:1, (29-47), Online publication date: 1-Jan-2005.
- Inggs C and Barringer H (2005). CTL* Model Checking on a Shared-Memory Architecture, Electronic Notes in Theoretical Computer Science (ENTCS), 128:3, (107-123), Online publication date: 19-Apr-2005.
- Kengue J, Valtchev P and Djamegni C A parallel algorithm for lattice construction Proceedings of the Third international conference on Formal Concept Analysis, (249-264)
- Figueira S and Reddi V Topology-Based hypercube structures for global communication in heterogeneous networks Proceedings of the 11th international Euro-Par conference on Parallel Processing, (994-1004)
- Lafreniere B and Sodan A ScoPred–scalable user-directed performance prediction using complexity modeling and historical data Proceedings of the 11th international conference on Job Scheduling Strategies for Parallel Processing, (62-90)
- Gan Z and Harrison R Calibrating quantum chemistry Proceedings of the 2005 ACM/IEEE conference on Supercomputing
- Lipnikov K and Vassilevski Y (2004). On control of adaptation in parallel mesh generation, Engineering with Computers, 20:3, (193-201), Online publication date: 1-Sep-2004.
- Nam Y and Hwang E Real-Time transcoding of MPEG videos in a distributed environment Proceedings of the 5th international conference on Parallel and Distributed Computing: applications and Technologies, (280-283)
- Harbulot B and Gurd J Using AspectJ to separate concerns in parallel scientific Java code Proceedings of the 3rd international conference on Aspect-oriented software development, (122-131)
- Juhász S and Charaf H Exploiting fast ethernet performance in multiplatform cluster environment Proceedings of the 2004 ACM symposium on Applied computing, (1407-1411)
- Zhao Y (2004). Granule-oriented programming, ACM SIGPLAN Notices, 39:12, (107-118), Online publication date: 1-Dec-2004.
- Niculescu V (2019). On Data Distributions in the Construction of Parallel Programs, The Journal of Supercomputing, 29:1, (5-25), Online publication date: 1-Jul-2004.
- Dattilo G and Spezzano G (2003). Simulation of a cellular landslide model with CAMELOT on high performance computers, Parallel Computing, 29:10, (1403-1418), Online publication date: 1-Oct-2003.
- Gatu C and Kontoghiorghes E (2003). Parallel algorithms for computing all possible subset regression models using the QR decomposition, Parallel Computing, 29:4, (505-521), Online publication date: 1-Apr-2003.
- Escalante R (2003). Parallel strategies for the step by step Tau method, Applied Mathematics and Computation, 137:2-3, (277-292), Online publication date: 25-May-2003.
- Berthold J Dynamic chunking in eden Proceedings of the 15th international conference on Implementation of Functional Languages, (102-117)
- Dongarra J, Foster I, Fox G, Gropp W, Kennedy K, Torczon L and White A References Sourcebook of parallel computing, (729-789)
- Gorlatch S SAT Patterns and skeletons for parallel and distributed computing, (29-63)
- Isreb M and Khan A Internet traffic congestion modelling and parallel distributed analysis Proceedings of the 2003 international conference on Parallel and distributed processing and applications, (145-151)
- Padua D and Hoeflinger J Supercomputers Encyclopedia of Computer Science, (1710-1718)
- Gunawan T and Cai W (2019). Performance Analysis of a Myrinet-Based Cluster, Cluster Computing, 6:4, (299-313), Online publication date: 1-Oct-2003.
- Seinstra F and Koelma D (2002). P-3PC, IEEE Transactions on Parallel and Distributed Systems, 13:7, (758-768), Online publication date: 1-Jul-2002.
- Cremonesi P and Gennaro C (2002). Integrated Performance Models for SPMD Applications and MIMD Architectures, IEEE Transactions on Parallel and Distributed Systems, 13:7, (745-757), Online publication date: 1-Jul-2002.
- Povitsky A (2002). Parallel ADI solver based on processor scheduling, Applied Mathematics and Computation, 133:1, (43-81), Online publication date: 25-Nov-2002.
- Randall M and Lewis A (2002). A parallel implementation of ant colony optimization, Journal of Parallel and Distributed Computing, 62:9, (1421-1432), Online publication date: 1-Sep-2002.
- Dwivedula M, Hariri S and Parashar M A Software Design Model for Parallel Applications on Heterogeneous Systems Proceedings of the 16th International Parallel and Distributed Processing Symposium
- Anvik J, MacDonald S, Szafron D, Schaeffer J, Bromling S and Tan K Generating Parallel Programs from the Wavefront Design Pattern Proceedings of the 16th International Parallel and Distributed Processing Symposium
- Bergen D and Miller B (2002). Building an MPI cluster, XRDS: Crossroads, The ACM Magazine for Students, 8:5, (2-2), Online publication date: 1-Aug-2002.
- Cremonesi P and Gennaro C (2002). Integrated Performance Models for SPMD Applications and MIMD Architectures, IEEE Transactions on Parallel and Distributed Systems, 13:12, (1320-1332), Online publication date: 1-Dec-2002.
- Hodzic E and Shang W (2002). On Time Optimal Supernode Shape, IEEE Transactions on Parallel and Distributed Systems, 13:12, (1220-1233), Online publication date: 1-Dec-2002.
- Mukerjee A and Mali A (2019). Modular Models of Intelligence – Review, Limitations and Prospects, Artificial Intelligence Review, 17:1, (39-64), Online publication date: 1-Feb-2002.
- Swann C (2019). Maximum Likelihood Estimation Using Parallel Computing, Computational Economics, 19:2, (145-178), Online publication date: 1-Apr-2002.
- Cotronis J Modular MPI components and the composition of grid applications Proceedings of the 10th Euromicro conference on Parallel, distributed and network-based processing, (154-161)
- De Palma P, Wiborg A and Withers A (2001). Super computing on a budget, Journal of Computing Sciences in Colleges, 17:2, (71-77), Online publication date: 1-Dec-2001.
- Peña R and Rubio F Parallel functional programming at two levels of abstraction Proceedings of the 3rd ACM SIGPLAN international conference on Principles and practice of declarative programming, (187-198)
- Gau C and Stadtherr M Parallel interval-Newton using message passing Proceedings of the 2001 ACM/IEEE conference on Supercomputing, (23-23)
- Vetter J and McCracken M (2001). Statistical scalability analysis of communication operations in distributed applications, ACM SIGPLAN Notices, 36:7, (123-132), Online publication date: 1-Jul-2001.
- Vetter J and McCracken M Statistical scalability analysis of communication operations in distributed applications Proceedings of the eighth ACM SIGPLAN symposium on Principles and practices of parallel programming, (123-132)
- Mills R, Stathopoulos A and Smirni E Algorithmic modifications to the Jacobi-Davidson parallel eigensolver to dynamically balance external CPU and memory load Proceedings of the 15th international conference on Supercomputing, (454-463)
- Brown R Maximizing beowulf performance Proceedings of the 4th annual Linux Showcase & Conference - Volume 4, (29-29)
- Vetter J Performance analysis of distributed applications using automatic classification of communication inefficiencies Proceedings of the 14th international conference on Supercomputing, (245-254)
- Prieto M, Llorente I and Tirado F (2000). Data Locality Exploitation in the Decomposition of Regular Domain Problems, IEEE Transactions on Parallel and Distributed Systems, 11:11, (1141-1150), Online publication date: 1-Nov-2000.
- Nyland L, Prins J, Goldberg A and Mills P (2000). A Design Methodology for Data-Parallel Applications, IEEE Transactions on Software Engineering, 26:4, (293-314), Online publication date: 1-Apr-2000.
- Tongsima S, Sha E, Chantrapornchai C, Surma D and Passos N (2000). Probabilistic Loop Scheduling for Applications with Uncertain Execution Time, IEEE Transactions on Computers, 49:1, (65-80), Online publication date: 1-Jan-2000.
- Kandemir M, Banerjee P, Choudhary A, Ramanujam J and Shenoy N (1999). A global communication optimization technique based on data-flow analysis and linear algebra, ACM Transactions on Programming Languages and Systems (TOPLAS), 21:6, (1251-1297), Online publication date: 1-Nov-1999.
- Hall M, Kogge P, Koller J, Diniz P, Chame J, Draper J, LaCoss J, Granacki J, Brockman J, Srivastava A, Athas W, Freeh V, Shin J and Park J Mapping irregular applications to DIVA, a PIM-based data-intensive architecture Proceedings of the 1999 ACM/IEEE conference on Supercomputing, (57-es)
- Brest J, Žumer V and Ojsteršek M Dynamic scheduling on a PC cluster Proceedings of the 1999 ACM symposium on Applied computing, (496-500)
- Chung Y and Prasanna V (1998). Parallelizing Image Feature Extraction on Coarse-Grain Machines, IEEE Transactions on Pattern Analysis and Machine Intelligence, 20:12, (1389-1394), Online publication date: 1-Dec-1998.
- Anglano C Predicting parallel applications performance on non-dedicated cluster platforms Proceedings of the 12th international conference on Supercomputing, (172-179)
- Sodan A (1998). Yin and yang in computer science, Communications of the ACM, 41:4, (103-114), Online publication date: 1-Apr-1998.
- Yang L and Misra M (2019). Coarse-Grained Parallel Algorithms for Multi-DimensionalWavelet Transforms, The Journal of Supercomputing, 12:1-2, (99-118), Online publication date: 1-Jan-1998.
- Tongsima S, Chantrapornchai C, Sha E and Passos N Probabilistic Rotation Proceedings of the international Conference on Parallel Processing
- Melin E, Raffin B, Rebeuf X and Virot B SCL-chan Proceedings of the 1997 Workshop on High-Level Programming Models and Supportive Environments (HIPS '97)
- He X and Yao W Translating hierarchical predicate transition nets to CC++ program skeletons Proceedings of the 21st International Computer Software and Applications Conference, (60-65)
- Diaz M, Rubio B and Troya J Logic Channels Proceedings of the 11th International Symposium on Parallel Processing, (436-442)
- Diao S Parallel Implementation of Chemically Reactive Euler Solver Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
- Jung K, Hong J and Yoo K Experiments with Java for Computing Intensive Problems Proceedings of the High-Performance Computing on the Information Superhighway, HPC-Asia '97
- Fiedler R Optimization and scaling of shared-memory and message-passing implementations of the ZEUS hydrodynamics algorithm Proceedings of the 1997 ACM/IEEE conference on Supercomputing, (1-16)
- Zeng N and Schach S Is the object-oriented paradigm appropriate for scientific software? Proceedings of the 35th Annual Southeast Regional Conference, (98-105)
- Dillon E, Dos Santos C and Guyard J (1997). Teaching an engineering approach for network computing, ACM SIGCSE Bulletin, 29:1, (229-232), Online publication date: 1-Mar-1997.
- Dillon E, Dos Santos C and Guyard J Teaching an engineering approach for network computing Proceedings of the twenty-eighth SIGCSE technical symposium on Computer science education, (229-232)
- Arredondo D, Errecalde M, Piccoli F, Printista M, Gallard R and Flores s (1997). Load distribution and balancing support in a workstation-based distributed system, ACM SIGOPS Operating Systems Review, 31:2, (46-59), Online publication date: 1-Apr-1997.
- Tseng Y, Lin T, Panda D and Gupta S (1997). Bandwidth-Optimal Complete Exchange on Wormhole-Routed 2D/3D Torus Networks, IEEE Transactions on Parallel and Distributed Systems, 8:4, (380-396), Online publication date: 1-Apr-1997.
- Fang N Engineering Parallel Algorithms Proceedings of the 5th IEEE International Symposium on High Performance Distributed Computing
- Yao W and He X Mapping Petri Nets to Parallel Programs in CC++ Proceedings of the 20th Conference on Computer Software and Applications
- Pancake C (1996). Is Parallelism for You?, IEEE Computational Science & Engineering, 3:2, (18-37), Online publication date: 1-Jun-1996.
- Bode P, Xu G and Cen R A parallel cosmological hydrodynamics code Proceedings of the 1996 ACM/IEEE conference on Supercomputing, (27-es)
- Hosseini A, Mavroidis D and Konas P Code generation and analysis for the functional verification of micro processors Proceedings of the 33rd annual Design Automation Conference, (305-310)
- Molitoris J and Taylor T Advanced simulation, battle managers, and visualization Proceedings of the 27th conference on Winter simulation, (1168-1175)
- Crandall P, Aydt R, Chien A and Reed D Input/output characteristics of scalable parallel applications Proceedings of the 1995 ACM/IEEE conference on Supercomputing, (59-es)
Recommendations
The NYU Ultracomputer Designing an MIMD Shared Memory Parallel Computer
We present the design for the NYU Ultracomputer, a shared-memory MIMD parallel machine composed of thousands of autonomous processing elements. This machine uses an enhanced message switching network with the geometry of an Omega-network to approximate ...
Compiling data-parallel programs for clusters of SMPs: Research Articles
Compilers for Parallel ComputersClusters of shared-memory multiprocessors (SMPs) have become the most promising parallel computing platforms for scientific computing. However, SMP clusters significantly increase the complexity of user application development when using the low-level ...
Compiling machine-independent parallel programs
Initial evidence is presented that explicitly parallel, machine-independent programs can automatically be translated into parallel machine code that is competitive in performance with hand-written code.The programming language used is Modula-2*, an ...