Parallel loop transformation techniques for vector-based multiprocessor systems

November 1995

Author:
Viet Nhu Ngo
Univ. of Minnesota

Publisher:

University of Minnesota
Computer Science Dept. 136 Lind Hall 207 Church Street Minneapolis, MN
United States

Order Number:UMI Order No. GAX94-33091

Bibliometrics

Abstract

A parallel vector processor (PVP) supercomputer relies on parallel processing and pipelines to deliver its enormous computational rate. Loop parallelization and vectorization together enable the performance of application programs to approach the theoretical peak rate of a PVP. This dissertation presents a series of powerful compiling techniques that broaden the classes of loops that can be vectorized and parallelized.

A serial loop can be executed in parallel by different processors if it does not have any loop-carried dependencies (LCD). We propose analysis and transformation techniques to detect and/or remove non-inherent LCD which include: geometric dependence testing methods to perform private array analysis, a code generation method to enable DOALL transformation of loops involving conditionally defined privatized variables, a symbolic analysis technique which in the presence of control constructs allows a compiler to remove false dependencies from irrelevant flow paths.

Loop vectorization has been found to be one of the most significant forms of parallelism. However, it has traditionally been limited to innermost loops and outer loops which are made innermost by the application of loop distribution and interchange. We propose a framework for direct vectorization of outer loops (OLV); i.e., vectorization of an outer loop without interchange or distribution. The framework includes: general vector execution modeling, legality of OLV, vector loop selection and an OLV vector code generator.

These techniques have been implemented in the Cray Fortran-90 compiling system. Implementation details are presented. The efficacy of these techniques are demonstrated by significant performance improvement of application programs.

Cited By

Contributors

Vietnhu Ngo
University of Minnesota Twin Cities
- Publication Years1995 - 1995
- Publication counts1
- Citation count6
- Available for Download0
- Downloads (cumulative)0
- Downloads (12 months)0
- Downloads (6 weeks)0
- Average Downloads per Article0
- Average Citation per Article6
View Full Profile

Index Terms

Parallel loop transformation techniques for vector-based multiprocessor systems
1. Computer systems organization
  1. Architectures
    1. Parallel architectures
      1. Systolic arrays

Recommendations

Nest-loop transformation techniques considering timing and memory optimization for embedded systems
Read More
Advanced loop parallelization theory and techniques
Read More
The I Test: An Improved Dependence Test for Automatic Parallelization and Vectorization

The I test is a subscript dependence test which extends both the range of applicability and the accuracy of the GCD and Banerjee tests (U. Banerjee, 1976), standard subscript dependence tests used to determine whether loops may be parallelized/...
Read More

Comments

Browse Theses

Sections

Cited By

Index Terms

Nest-loop transformation techniques considering timing and memory optimization for embedded systems

Advanced loop parallelization theory and techniques

The I Test: An Improved Dependence Test for Automatic Parallelization and Vectorization

Sections

Cited By

Save to Binder

Index Terms

Recommendations

Nest-loop transformation techniques considering timing and memory optimization for embedded systems

Advanced loop parallelization theory and techniques

The I Test: An Improved Dependence Test for Automatic Parallelization and Vectorization