Optimizing programs by data and control transformations

March 1998

Author:
Michael Jan Cierniak

Publisher:

University of Rochester
Dept. of Computer Science Rochester, NY
United States

Order Number:UMI Order No. GAX98-08872

Bibliometrics

Abstract

This dissertation presents new techniques designed to speed up the execution of computer programs by improving their memory locality. Locality is an important property for today's machines, because it hides the relatively high latency of computer memories.

Our techniques change the layout of multidimensional arrays by applying data transformations. We unify data transformations with code transformations which change the order of execution of loop nests. We solve related problems which would have been obstacles to the practical use of our techniques: we show how to detect and reduce array overlapping and how to recover structure from linearized arrays. Our optimizations reduce the execution times of sequential, scientific benchmarks by up to 50% over what is possible with previous techniques. Parallel programs are improved by as much as a factor of four.

In addition to implementing our techniques in a standard, off-line, compiler, we adapt our optimizations to Just-In-Time (JIT) compilation. The JIT translation becomes very important with the increasing popularity of mobile technologies such as Java. We argue that new, faster algorithms are needed in that context. We propose a collection of fast, approximate compiler techniques for data transformations and show that they are effective for Java programs.

Cited By

Contributors

Michał Jan Cierniak
Microsoft Corporation
- Publication Years1994 - 2005
- Publication counts23
- Citation count1,196
- Available for Download12
- Downloads (cumulative)12,230
- Downloads (12 months)835
- Downloads (6 weeks)166
- Average Downloads per Article1,019
- Average Citation per Article52
View Full Profile

Index Terms

Optimizing programs by data and control transformations
1. Software and its engineering
  1. Software notations and tools
    1. Compilers

Recommendations

An Iteration Partition Approach for Cache or Local Memory Thrashing on Parallel Processing

Parallel processing systems with cache or local memory in the memory hierarchies are considered. These systems have a local cache memory in each processor and usually employ a write-invalidate protocol for the cache coherence. In such systems, a problem ...
Read More
An algorithm to automate non-unimodular transformations of loop nests
SPDP '93: Proceedings of the 1993 5th IEEE Symposium on Parallel and Distributed Processing

This paper provides a solution to the open problem of automatic rewriting loop nests for non-unimodular transformations.We present an algorithm that rewrites a loop nest under any non-singular (unimodular or non-unimodular) transformation. The algorithm ...
Read More
Precise Data Locality Optimization of Nested Loops

A significant source for enhancing application performance and for reducing power consumption in embedded processor applications is to improve the usage of the memory hierarchy. In this paper, a temporal and spatial locality optimization framework of ...
Read More

Comments

Browse Theses

Sections

Cited By

Index Terms

An Iteration Partition Approach for Cache or Local Memory Thrashing on Parallel Processing

An algorithm to automate non-unimodular transformations of loop nests

Precise Data Locality Optimization of Nested Loops

Sections

Cited By

Save to Binder

Index Terms

Recommendations

An Iteration Partition Approach for Cache or Local Memory Thrashing on Parallel Processing

An algorithm to automate non-unimodular transformations of loop nests

Precise Data Locality Optimization of Nested Loops