skip to main content
Skip header Section
Partitioning and Scheduling Parallel Programs for MultiprocessorsMarch 1989
Publisher:
  • MIT Press
  • 55 Hayward St.
  • Cambridge
  • MA
  • United States
ISBN:978-0-262-69130-7
Published:01 March 1989
Pages:
215
Skip Bibliometrics Section
Bibliometrics
Skip Abstract Section
Abstract

From the Publisher:

This book is one of the first to address the problem of forming useful parallelism from potential parallelism and to provide a general solution.

The book presents two approaches to automatic partitioning and scheduling so that the same parallel program can be made to execute efficiently on widely different multiprocessors. The first approach is based on a macro dataflow model in which the program is partitioned into tasks at compile time and the tasks are scheduled on processors at run time. The second approach is based on a compile time scheduling model, where both the partitioning and scheduling are performed at compile time.

Both approaches have been implemented in partition programs written in the single assignment language SISAL. The inputs to the partitioning and scheduling algorithms are a graphical representation of the parallel program and a list of parameters describing the target multiprocessor. Execution profile information is used to derive compile-time estimates of execution times and data sizes in the program. Both the macro dataflow and compile-time scheduling problems are expressed as optimization problems and are shown to be NP complete in the strong sense. Efficient approximation algorithms for these problems are presented. Finally, the effectiveness of the partitioning and scheduling algorithms is studied by multiprocessor simulations of various SISAL benchmark programs for different target multiprocessor parameters.

Vivek Sarkar is a Member of Research Staff at the IBM T. J. Watson Research Center. Partitioning and Scheduling Parallel Programs for Multiprocessing is included in the series Research Monographs in Parallel and DistributedComputing. Copublished with Pitman Publishing.

Cited By

  1. Bock A Static Partitioning of Spreadsheets for Parallel Execution Practical Aspects of Declarative Languages, (221-237)
  2. Izadkhah H and Li Y (2019). Learning Based Genetic Algorithm for Task Graph Scheduling, Applied Computational Intelligence and Soft Computing, 2019, Online publication date: 1-Jan-2019.
  3. ACM
    Schreiber E, Korf R and Moffitt M (2018). Optimal Multi-Way Number Partitioning, Journal of the ACM, 65:4, (1-61), Online publication date: 16-Aug-2018.
  4. Popa E, Iacono M and Pop F (2018). Adapting MCP and HLFET Algorithms to Multiple Simultaneous Scheduling, International Journal of Parallel Programming, 46:3, (607-629), Online publication date: 1-Jun-2018.
  5. Yoosefi A and Naji H (2017). A Clustering Algorithm for Communication-Aware Scheduling of Task Graphs on Multi-Core Reconfigurable Systems, IEEE Transactions on Parallel and Distributed Systems, 28:10, (2718-2732), Online publication date: 1-Oct-2017.
  6. Khan M (2017). Task scheduling for heterogeneous systems using an incremental approach, The Journal of Supercomputing, 73:5, (1905-1928), Online publication date: 1-May-2017.
  7. Deldari A, Naghibzadeh M and Abrishami S (2017). CCA, The Journal of Supercomputing, 73:2, (756-781), Online publication date: 1-Feb-2017.
  8. Kanemitsu H, Hanada M and Nakazato H (2016). Clustering-Based Task Scheduling in a Large Number of Heterogeneous Processors, IEEE Transactions on Parallel and Distributed Systems, 27:11, (3144-3157), Online publication date: 1-Nov-2016.
  9. Kliazovich D, Pecero J, Tchernykh A, Bouvry P, Khan S and Zomaya A (2016). CA-DAG, Journal of Grid Computing, 14:1, (23-39), Online publication date: 1-Mar-2016.
  10. ACM
    Madhu K, Rao A, Das S, Madhava K, Nandy S and Narayan R Flexible resource allocation and management for application graphs on ReNÉ MPSoC Proceedings of the 7th Workshop on Parallel Programming and Run-Time Management Techniques for Many-core Architectures and the 5th Workshop on Design Tools and Architectures For Multicore Embedded Computing Platforms, (13-18)
  11. ACM
    Park H, Malik A and Salcic Z (2015). Scheduling Globally Asynchronous Locally Synchronous Programs for Guaranteed Response Times, ACM Transactions on Design Automation of Electronic Systems, 20:3, (1-25), Online publication date: 24-Jun-2015.
  12. Wu J, Zeng J, Yu H and Kenny J CommGram Proceedings of the First Workshop on Visual Performance Analysis, (28-35)
  13. ACM
    Yang T and Gerasoulis A PYRROS ACM International Conference on Supercomputing 25th Anniversary Volume, (163-172)
  14. ACM
    Yang T and Gerasoulis A Author retrospective for PYRROS ACM International Conference on Supercomputing 25th Anniversary Volume, (18-20)
  15. ACM
    Stork S, Naden K, Sunshine J, Mohr M, Fonseca A, Marques P and Aldrich J (2014). Æminium, ACM Transactions on Programming Languages and Systems, 36:1, (1-42), Online publication date: 1-Mar-2014.
  16. ACM
    Cordes D, Engel M, Marwedel P and Neugebauer O Automatic extraction of multi-objective aware pipeline parallelism using genetic algorithms Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, (73-82)
  17. David T, Jacquelin M and Marchal L (2012). Scheduling streaming applications on a complex multicore platform, Concurrency and Computation: Practice & Experience, 24:15, (1726-1750), Online publication date: 1-Oct-2012.
  18. Cordes D and Marwedel P Multi-objective aware extraction of task-level parallelism using genetic algorithms Proceedings of the Conference on Design, Automation and Test in Europe, (394-399)
  19. Colin J and Nakechbandi M Scheduling tasks and communications on a hierarchical system with message contention Proceedings of the 11th international conference on Algorithms and architectures for parallel processing - Volume Part II, (89-98)
  20. Berzins M, Meng Q, Schmidt J and Sutherland J DAG-Based software frameworks for PDEs Proceedings of the 2011 international conference on Parallel Processing, (324-333)
  21. ACM
    Hsu C, Pino J and Bhattacharyya S (2011). Multithreaded Simulation for Synchronous Dataflow Graphs, ACM Transactions on Design Automation of Electronic Systems, 16:3, (1-23), Online publication date: 1-Jun-2011.
  22. Raina A and Muthukumar V (2011). A unified design space simulation environment for network-on-chip: fuse-N, International Journal of High Performance Systems Architecture, 3:1, (23-32), Online publication date: 1-Jan-2011.
  23. ACM
    Cordes D, Marwedel P and Mallik A Automatic parallelization of embedded software using hierarchical task graphs and integer linear programming Proceedings of the eighth IEEE/ACM/IFIP international conference on Hardware/software codesign and system synthesis, (267-276)
  24. Cordasco G and Rosenberg A Area-maximizing schedules for series-parallel DAGs Proceedings of the 16th international Euro-Par conference on Parallel processing: Part II, (380-392)
  25. Wang L, von Laszewski G, Dayal J and Wang F Towards Energy Aware Scheduling for Precedence Constrained Parallel Tasks in a Cluster with DVFS Proceedings of the 2010 10th IEEE/ACM International Conference on Cluster, Cloud and Grid Computing, (368-377)
  26. Ghader H, KeyKhosravi D and HosseinAliPour A DAG scheduling on heterogeneous distributed systems using learning automata Proceedings of the Second international conference on Intelligent information and database systems: Part II, (247-257)
  27. Pecero J, Trystram D and Zomaya A A New Genetic Algorithm for Scheduling for Large Communication Delays Proceedings of the 15th International Euro-Par Conference on Parallel Processing, (241-252)
  28. ACM
    Baskaran M, Vydyanathan N, Bondhugula U, Ramanujam J, Rountev A and Sadayappan P (2009). Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors, ACM SIGPLAN Notices, 44:4, (219-228), Online publication date: 14-Feb-2009.
  29. ACM
    Baskaran M, Vydyanathan N, Bondhugula U, Ramanujam J, Rountev A and Sadayappan P Compiler-assisted dynamic scheduling for effective parallelization of loop nests on multicore processors Proceedings of the 14th ACM SIGPLAN symposium on Principles and practice of parallel programming, (219-228)
  30. Quan D and Hsu D (2008). Mapping Heavy Communication Grid-Based Workflows Onto Grid Resources Within an SLA Context Using Metaheuristics, International Journal of High Performance Computing Applications, 22:3, (330-346), Online publication date: 1-Aug-2008.
  31. ACM
    Hsu C, Pino J and Bhattacharyya S Multithreaded simulation for synchronous dataflow graphs Proceedings of the 45th annual Design Automation Conference, (331-336)
  32. Roig C, Ripoll A and Guirado F (2007). A New Task Graph Model for Mapping Message Passing Applications, IEEE Transactions on Parallel and Distributed Systems, 18:12, (1740-1753), Online publication date: 1-Dec-2007.
  33. ACM
    Ayyub S and Abramson D GridRod Proceedings of the 21st annual international conference on Supercomputing, (43-52)
  34. Ruan Y, Liu G, Han J and Li Q Communication-Aware Scheduling Algorithm Based on Heterogeneous Computing Systems Proceedings of the 7th international conference on Computational Science, Part I: ICCS 2007, (426-429)
  35. Lee S, Kumara S and Gautam N (2007). Efficient scheduling algorithm for component-based networks, Future Generation Computer Systems, 23:4, (558-568), Online publication date: 1-May-2007.
  36. Liu L, Li X, Chen M and Ju R A throughput-driven task creation and mapping for network processors Proceedings of the 2nd international conference on High performance embedded architectures and compilers, (227-241)
  37. ACM
    Ko D and Bhattacharyya S The pipeline decomposition tree: Proceedings of the 4th international conference on Hardware/software codesign and system synthesis, (52-57)
  38. ACM
    Muhammad F, Muller F and Auguin M Contentions-conscious dynamic but deterministic scheduling of computational and communication tasks Proceedings of the 2006 ACM symposium on Applied computing, (1487-1492)
  39. Gautama H and van Gemund A (2006). Low-Cost Static Performance Prediction of Parallel Stochastic Task Compositions, IEEE Transactions on Parallel and Distributed Systems, 17:1, (78-91), Online publication date: 1-Jan-2006.
  40. Sekhar A, Manoj B and Murthy C A state-space search approach for optimizing reliability and cost of execution in distributed sensor networks Proceedings of the 7th international conference on Distributed Computing, (63-74)
  41. Baskiyar S and Dickinson C (2005). Scheduling directed a-cyclic task graphs on a bounded set of heterogeneous processors using task duplication, Journal of Parallel and Distributed Computing, 65:8, (911-921), Online publication date: 1-Aug-2005.
  42. Sinnen O and Sousa L (2005). Communication Contention in Task Scheduling, IEEE Transactions on Parallel and Distributed Systems, 16:6, (503-515), Online publication date: 1-Jun-2005.
  43. Hakem M and Butelle F Dynamic Critical Path Scheduling Parallel Programs onto Multiprocessors Proceedings of the 19th IEEE International Parallel and Distributed Processing Symposium (IPDPS'05) - Workshop 8 - Volume 09
  44. Muthuvelu N, Liu J, Soe N, Venugopal S, Sulistio A and Buyya R A dynamic job grouping-based scheduling for deploying applications with fine-grained tasks on global grids Proceedings of the 2005 Australasian workshop on Grid computing and e-research - Volume 44, (41-48)
  45. ACM
    Nanda M, Chandra S and Sarkar V Decentralizing execution of composite web services Proceedings of the 19th annual ACM SIGPLAN conference on Object-oriented programming, systems, languages, and applications, (170-187)
  46. ACM
    Nanda M, Chandra S and Sarkar V (2004). Decentralizing execution of composite web services, ACM SIGPLAN Notices, 39:10, (170-187), Online publication date: 1-Oct-2004.
  47. ACM
    Gruau F, Lhuillier Y, Reitz P and Temam O BLOB computing Proceedings of the 1st conference on Computing frontiers, (125-139)
  48. Bajaj R and Agrawal D (2004). Improving Scheduling of Tasks in a Heterogeneous Environment, IEEE Transactions on Parallel and Distributed Systems, 15:2, (107-118), Online publication date: 1-Feb-2004.
  49. Sinnen O and Sousa L (2004). On Task Scheduling Accuracy, The Journal of Supercomputing, 27:2, (177-194), Online publication date: 1-Feb-2004.
  50. Lai G, Fang J, Sung P and Pean D Scheduling parallel tasks onto NUMA multiprocessors with inter-processor communication overhead Proceedings of the 2003 international conference on Parallel and distributed processing and applications, (65-75)
  51. Lee H, Kim J, Hong S and Lee S (2003). Processor Allocation and Task Scheduling of Matrix Chain Products on Parallel Systems, IEEE Transactions on Parallel and Distributed Systems, 14:4, (394-407), Online publication date: 1-Apr-2003.
  52. Rosien M, Guo Y, Smit G and Krol T Mapping Applications to an FPFA Tile Proceedings of the conference on Design, Automation and Test in Europe - Volume 1
  53. van Gemund A (2003). Symbolic Performance Modeling of Parallel Systems, IEEE Transactions on Parallel and Distributed Systems, 14:2, (154-165), Online publication date: 1-Feb-2003.
  54. Bambha N, Kianzad V, Khandelia M and Bhattacharyya S (2002). Intermediate Representations for Design Automation of Multiprocessor DSP Systems, Design Automation for Embedded Systems, 7:4, (307-323), Online publication date: 1-Nov-2002.
  55. Radulescu A and van Gemund A (2002). Low-Cost Task Scheduling for Distributed-Memory Machines, IEEE Transactions on Parallel and Distributed Systems, 13:6, (648-658), Online publication date: 1-Jun-2002.
  56. Lepère R and Trystram D A New Clustering Algorithm for Large Communication Delays Proceedings of the 16th International Parallel and Distributed Processing Symposium
  57. Konstantinou D and Koziris N TOPPER Proceedings of the 8th Panhellenic conference on Informatics, (336-350)
  58. Gupta R and De Micheli G Hardware-software cosynthesis for digital systems Readings in hardware/software co-design, (5-17)
  59. Wu M, Shu W and Gu J (2001). Efficient Local Search for DAG Scheduling, IEEE Transactions on Parallel and Distributed Systems, 12:6, (617-627), Online publication date: 1-Jun-2001.
  60. Ahmad I, Kwok Y, Wu M and Shu W (2000). CASCH, IEEE Concurrency, 8:4, (21-33), Online publication date: 1-Oct-2000.
  61. Ranaweera S and Agrawal D A Scalable Task Duplication Based Scheduling Algorithm for Heterogeneous Systems Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
  62. Wu M, Shu W and Chen Y Runtime Parallel Incremental Scheduling of DAGs Proceedings of the Proceedings of the 2000 International Conference on Parallel Processing
  63. Kadamuddi D and Tsai J (2000). Clustering Algorithm for Parallelizing Software Systems in Multiprocessors Environment, IEEE Transactions on Software Engineering, 26:4, (340-361), Online publication date: 1-Apr-2000.
  64. ACM
    Lee H, Kim J, Hong S and Lee S Task scheduling using a block dependency DAG for block-oriented sparse Cholesky factorization Proceedings of the 2000 ACM symposium on Applied computing - Volume 2, (641-648)
  65. ACM
    Kwok Y and Ahmad I (1999). Static scheduling algorithms for allocating directed task graphs to multiprocessors, ACM Computing Surveys, 31:4, (406-471), Online publication date: 1-Dec-1999.
  66. Gao L, Rosenberg A and Sitaraman R (1999). Optimal Clustering of Tree-Sweep Computations for High-Latency Parallel Environments, IEEE Transactions on Parallel and Distributed Systems, 10:8, (813-824), Online publication date: 1-Aug-1999.
  67. ACM
    Grajcar M Genetic list scheduling algorithm for scheduling and allocation on a loosely coupled heterogeneous multiprocessor system Proceedings of the 36th annual ACM/IEEE Design Automation Conference, (280-285)
  68. Gajjala Purna K and Bhatia D (1999). Temporal Partitioning and Scheduling Data Flow Graphs for Reconfigurable Computers, IEEE Transactions on Computers, 48:6, (579-590), Online publication date: 1-Jun-1999.
  69. Ahmad I and Kwok Y (1999). On Parallelizing the Multiprocessor Scheduling Problem, IEEE Transactions on Parallel and Distributed Systems, 10:4, (414-432), Online publication date: 1-Apr-1999.
  70. ACM
    Grandpierre T, Lavarenne C and Sorel Y Optimized rapid prototyping for real-time embedded heterogeneous multiprocessors Proceedings of the seventh international workshop on Hardware/software codesign, (74-78)
  71. Kwok Y and Ahmad I (1999). FASTEST, IEEE Transactions on Parallel and Distributed Systems, 10:2, (147-159), Online publication date: 1-Feb-1999.
  72. ACM
    Lee W, Barua R, Frank M, Srikrishna D, Babb J, Sarkar V and Amarasinghe S (1998). Space-time scheduling of instruction-level parallelism on a raw machine, ACM SIGOPS Operating Systems Review, 32:5, (46-57), Online publication date: 1-Dec-1998.
  73. ACM
    Yang T and Fu C (1998). Space/time-efficient scheduling and execution of parallel irregular computations, ACM Transactions on Programming Languages and Systems, 20:6, (1195-1222), Online publication date: 1-Nov-1998.
  74. ACM
    Lee W, Barua R, Frank M, Srikrishna D, Babb J, Sarkar V and Amarasinghe S (1998). Space-time scheduling of instruction-level parallelism on a raw machine, ACM SIGPLAN Notices, 33:11, (46-57), Online publication date: 1-Nov-1998.
  75. ACM
    Lee W, Barua R, Frank M, Srikrishna D, Babb J, Sarkar V and Amarasinghe S Space-time scheduling of instruction-level parallelism on a raw machine Proceedings of the eighth international conference on Architectural support for programming languages and operating systems, (46-57)
  76. Darbha S and Pande S (1998). A Robust Compile Time Method for SchedulingTask Parallelism on Distributed Memory Machines, The Journal of Supercomputing, 12:4, (325-347), Online publication date: 1-Oct-1998.
  77. Klasing R (1998). Improved Compressions of Cube-Connected Cycles Networks, IEEE Transactions on Parallel and Distributed Systems, 9:8, (803-812), Online publication date: 1-Aug-1998.
  78. Chapin S and Spafford E (1998). Dissemination of state information in distributed autonomous systems, Computer Communications, 21:11, (969-979), Online publication date: 1-Aug-1998.
  79. Amoura A, Bampis E and König J (1998). Scheduling Algorithms for Parallel Gaussian Elimination With Communication Costs, IEEE Transactions on Parallel and Distributed Systems, 9:7, (679-686), Online publication date: 1-Jul-1998.
  80. Donaldson V and Ferrante J (1998). Analyzing Asynchronous Pipeline Schedules, International Journal of Parallel Programming, 26:1, (5-42), Online publication date: 1-Feb-1998.
  81. Darbha S and Agrawal D (1998). Optimal Scheduling Algorithm for Distributed-Memory Machines, IEEE Transactions on Parallel and Distributed Systems, 9:1, (87-95), Online publication date: 1-Jan-1998.
  82. Ramaswamy S, Sapatnekar S and Banerjee P (1997). A Framework for Exploiting Task and Data Parallelism on Distributed Memory Multicomputers, IEEE Transactions on Parallel and Distributed Systems, 8:11, (1098-1116), Online publication date: 1-Nov-1997.
  83. Ahmad I, Kwok Y, Wu M and Shu W Automatic Parallelization and Scheduling of Programs on Multiprocessors using CASCH Proceedings of the international Conference on Parallel Processing, (288-291)
  84. Wang Y, Amato N and Friesen D Hindsight Helps Proceedings of the international Conference on Parallel Processing, (170-173)
  85. Wu M and Shu W (1997). On Parallelization of Static Scheduling Algorithms, IEEE Transactions on Software Engineering, 23:8, (517-528), Online publication date: 1-Aug-1997.
  86. ACM
    Fu C and Yang T (1997). Space and time efficient execution of parallel irregular computations, ACM SIGPLAN Notices, 32:7, (57-68), Online publication date: 1-Jul-1997.
  87. ACM
    Fu C and Yang T Space and time efficient execution of parallel irregular computations Proceedings of the sixth ACM SIGPLAN symposium on Principles and practice of parallel programming, (57-68)
  88. ACM
    Megiddo N and Sarkar V Optimal weighted loop fusion for parallel programs Proceedings of the ninth annual ACM symposium on Parallel algorithms and architectures, (282-291)
  89. Yang T and Fu C (1997). Heuristic Algorithms for Scheduling Iterative Task Computations on Distributed Memory Machines, IEEE Transactions on Parallel and Distributed Systems, 8:6, (608-622), Online publication date: 1-Jun-1997.
  90. Liou J and Palis M A Comparison of General Approaches to Multiprocessor Scheduling Proceedings of the 11th International Symposium on Parallel Processing, (152-156)
  91. ACM
    Subhlok J and Vondran G Optimal latency-throughput tradeoffs for data parallel pipelines Proceedings of the eighth annual ACM symposium on Parallel Algorithms and Architectures, (62-71)
  92. Kwok Y and Ahmad I (1996). Dynamic Critical-Path Scheduling, IEEE Transactions on Parallel and Distributed Systems, 7:5, (506-521), Online publication date: 1-May-1996.
  93. Kim J and Shin K (1996). Execution Time Analysis of Communicating Tasks in Distributed Systems, IEEE Transactions on Computers, 45:5, (572-579), Online publication date: 1-May-1996.
  94. Yang T and Ibarra O (1996). Performance Prediction in Symbolic Scheduling of Partitioned Programs with Weight Variation, Journal of Parallel and Distributed Computing, 34:2, (171-182), Online publication date: 1-May-1996.
  95. Fu C and Yang T Efficient Run-Time Support for Irregular Task Computations with Mixed Granularities Proceedings of the 10th International Parallel Processing Symposium, (823-830)
  96. ACM
    Schumann M Automatic performance prediction to support cross development of parallel programs Proceedings of the SIGMETRICS symposium on Parallel and distributed tools, (88-97)
  97. ACM
    Fu C and Yang T Run-time compilation for parallel sparse matrix computations Proceedings of the 10th international conference on Supercomputing, (237-244)
  98. Palis M, Liou J and Wei D (1996). Task Clustering and Scheduling for Distributed Memory Parallel Architectures, IEEE Transactions on Parallel and Distributed Systems, 7:1, (46-55), Online publication date: 1-Jan-1996.
  99. ACM
    Subhlok J and Vondran G (1995). Optimal mapping of sequences of data parallel tasks, ACM SIGPLAN Notices, 30:8, (134-143), Online publication date: 1-Aug-1995.
  100. ACM
    Subhlok J and Vondran G Optimal mapping of sequences of data parallel tasks Proceedings of the fifth ACM SIGPLAN symposium on Principles and practice of parallel programming, (134-143)
  101. ACM
    Löwe W and Zimmermann W Upper time bounds for executing PRAM-programs on the LogP-machine Proceedings of the 9th international conference on Supercomputing, (41-50)
  102. Roy K and McCrosky C Transformation of functional specifications of finite difference methods to parallel distributed codes Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques, (268-272)
  103. Engelhardt D and Wendelborn A A partitioning-independent paradigm for nested data parallelism Proceedings of the IFIP WG10.3 working conference on Parallel architectures and compilation techniques, (224-233)
  104. Subhlok J, O'Hallaron D, Gross T, Dinda P and Webb J Communication and memory requirements as the basis for mapping task and data parallel programs Proceedings of the 1994 ACM/IEEE conference on Supercomputing, (330-339)
  105. Gross T, O'Hallaron D and Subhlok J (1994). Task Parallelism in a High Performance Fortran Framework, IEEE Parallel & Distributed Technology: Systems & Technology, 2:3, (16-26), Online publication date: 1-Sep-1994.
  106. ACM
    Wang K Precise compile-time performance prediction for superscalar-based computers Proceedings of the ACM SIGPLAN 1994 conference on Programming language design and implementation, (73-84)
  107. ACM
    van Gemund A Compiling performance models from parallel programs Proceedings of the 8th international conference on Supercomputing, (303-312)
  108. ACM
    Wang K (1994). Precise compile-time performance prediction for superscalar-based computers, ACM SIGPLAN Notices, 29:6, (73-84), Online publication date: 1-Jun-1994.
  109. Alverson G and Notkin D (1993). Program Structuring for Effective Parallel Portability, IEEE Transactions on Parallel and Distributed Systems, 4:9, (1041-1059), Online publication date: 1-Sep-1993.
  110. Tick E and Zhong X (1993). A compile-time granularity analysis algorithm and its performance evaluation, New Generation Computing, 11:3-4, (271-295), Online publication date: 1-Sep-1993.
  111. ACM
    van Gemund A Performance prediction of parallel processing systems Proceedings of the 7th international conference on Supercomputing, (318-327)
  112. ACM
    Fahringer T and Zima H A static parameter based performance prediction tool for parallel programs Proceedings of the 7th international conference on Supercomputing, (207-219)
  113. Gupta R and De Micheli G (1993). Hardware-Software Cosynthesis for Digital Systems, IEEE Design & Test, 10:3, (29-41), Online publication date: 1-Jul-1993.
  114. Sih G and Lee E (1993). Declustering, IEEE Transactions on Parallel and Distributed Systems, 4:6, (625-637), Online publication date: 1-Jun-1993.
  115. Gerasoulis A and Yang T (1993). On the Granularity and Clustering of Directed Acyclic Task Graphs, IEEE Transactions on Parallel and Distributed Systems, 4:6, (686-701), Online publication date: 1-Jun-1993.
  116. Sih G and Lee E (1993). A Compile-Time Scheduling Heuristic for Interconnection-Constrained Heterogeneous Processor Architectures, IEEE Transactions on Parallel and Distributed Systems, 4:2, (175-187), Online publication date: 1-Feb-1993.
  117. ACM
    Yang T and Gerasoulis A PYRROS Proceedings of the 6th international conference on Supercomputing, (428-437)
  118. ACM
    Lee Y and Ryder B A comprehensive approach to parallel data flow analysis Proceedings of the 6th international conference on Supercomputing, (236-247)
  119. ACM
    McCreary C and Gill D Automatic partitioning and virtual scheduling for efficient parallel execution Proceedings of the 30th annual Southeast regional conference, (29-36)
  120. Hassan K and Samadzadeh M An object-oriented environment for simulation and evaluation of architectures Proceedings of the 25th annual symposium on Simulation, (91-97)
  121. ACM
    Yang T and Gerasoulis A A fast static scheduling algorithm for DAGs on an unbounded number of processors Proceedings of the 1991 ACM/IEEE conference on Supercomputing, (633-642)
  122. ACM
    Hall M, Kennedy K and McKinley K Interprocedural transformations for parallel code generation Proceedings of the 1991 ACM/IEEE conference on Supercomputing, (424-434)
  123. ACM
    Ramanujam J and Sadayappan P Tiling multidimensional iteration spaces for nonshared memory machines Proceedings of the 1991 ACM/IEEE conference on Supercomputing, (111-120)
  124. ACM
    Gerasoulis A, Venugopal S and Yang T (1990). Clustering task graphs for message passing architectures, ACM SIGARCH Computer Architecture News, 18:3b, (447-456), Online publication date: 1-Sep-1990.
  125. ACM
    Sarkar V (1990). Instruction reordering for fork-join parallelism, ACM SIGPLAN Notices, 25:6, (322-336), Online publication date: 1-Jun-1990.
  126. ACM
    Debray S, Lin N and Hermnegildo M (1990). Task granularity analysis in logic programs, ACM SIGPLAN Notices, 25:6, (174-188), Online publication date: 1-Jun-1990.
  127. ACM
    Sarkar V Instruction reordering for fork-join parallelism Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation, (322-336)
  128. ACM
    Debray S, Lin N and Hermnegildo M Task granularity analysis in logic programs Proceedings of the ACM SIGPLAN 1990 conference on Programming language design and implementation, (174-188)
  129. ACM
    Gerasoulis A, Venugopal S and Yang T Clustering task graphs for message passing architectures Proceedings of the 4th international conference on Supercomputing, (447-456)
  130. ACM
    Simons B, Alpern D and Ferrante J A foundation for sequentializing parallel code Proceedings of the second annual ACM symposium on Parallel algorithms and architectures, (350-359)
  131. ACM
    Sarkar V (1989). Determining average program execution times and their variance, ACM SIGPLAN Notices, 24:7, (298-312), Online publication date: 1-Jul-1989.
  132. ACM
    Cytron R, Hind M and Hsieh W (1989). Automatic generation of DAG parallelism, ACM SIGPLAN Notices, 24:7, (54-68), Online publication date: 1-Jul-1989.
  133. ACM
    Sarkar V Determining average program execution times and their variance Proceedings of the ACM SIGPLAN 1989 conference on Programming language design and implementation, (298-312)
  134. ACM
    Cytron R, Hind M and Hsieh W Automatic generation of DAG parallelism Proceedings of the ACM SIGPLAN 1989 conference on Programming language design and implementation, (54-68)
Contributors
  • Georgia Institute of Technology

Recommendations