Over-provisioned multicore systems

January 2008

Author:
Koushik Chakraborty
The University of Wisconsin - Madison
,
Adviser:
Gurindar S. Sohi
The University of Wisconsin - Madison

Publisher:

University of Wisconsin at Madison
Engineering Experiment Station Madison, WI
United States

ISBN:978-0-549-80420-8

Order Number:AAI3327881

Pages:

187

Purchase on ProQuest

Bibliometrics

Abstract

Technology scaling has provided system designers with an exploding transistor budget, far more than what was available when the core principles behind many existing commodity microprocessors were envisioned. This tremendous growth also brings forth a whole new set of engineering challenges involving power density, thermal efficiency, and so on. In particular, the power constraint is becoming a first order design consideration in microprocessor designs. In the landscape of general purpose processors, power limited designs designate a significant paradigm shift from the area limited designs of the past.

This dissertation proposes a model to capture the first order impact of the power constraint. Denoted as the Simultaneously Active Fraction (SAF), this metric represents the fraction of the entire chip resources that can be active simultaneously, given a target power envelope. As the improvement in the energy efficiency of individual transistor devices lags behind the growth in their integration capacity, the dissertation finds that the SAF is monotonically decreasing in each successive technology generation.

In the context of rapidly shrinking SAF, this dissertation investigates a novel multicore design paradigm: Over-provisioned Multicore System (OPMS). An OPMS is a class of multicores that by design provision more processing core resources than that can be kept active for their target Thermal Design Power (TDP). Since only a subset of the on-chip cores are active at any given time, this design paradigm affords tremendous flexibility in assigning computation on processing cores, facilitating many novel techniques in this broad framework.

To demonstrate a concrete application of this framework, the dissertation proposes Computation Spreading (CSP): a new model for distributing the collective work from multithreaded applications. CSP aims to collocate similar computation fragments from different threads on the same core, while distributing dissimilar computation fragments from the same thread across multiple cores. Under CSP, on-chip cores in an OPMS are dynamically specialized via retaining mutually exclusive predictive states. The dissertation demonstrates the effectiveness of CSP in an OPMS through a rigorous evaluation of performance, energy efficiency, and several design trade-offs.

Cited By

Contributors

Gurindar S Sohi
University of Wisconsin-Madison
- Publication Years1985 - 2021
- Publication counts99
- Citation count10,504
- Available for Download111
- Downloads (cumulative)96,478
- Downloads (12 months)6,713
- Downloads (6 weeks)1,072
- Average Downloads per Article869
- Average Citation per Article106
View Full Profile
Koushik Chakraborty
Utah State University
- Publication Years2006 - 2021
- Publication counts51
- Citation count1,188
- Available for Download40
- Downloads (cumulative)21,001
- Downloads (12 months)508
- Downloads (6 weeks)54
- Average Downloads per Article525
- Average Citation per Article23
View Full Profile

Recommendations

Multicore Processors and Systems
Read More
Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems

With the raw computing power of graphics processing units (GPUs) being more widely available in commodity multicore systems, there is an imminent need to harness their power for important numerical libraries such as LAPACK. In this paper, we consider ...
Read More
Massively LDPC Decoding on Multicore Architectures

Unlike usual VLSI approaches necessary for the computation of intensive Low-Density Parity-Check (LDPC) code decoders, this paper presents flexible software-based LDPC decoders. Algorithms and data structures suitable for parallel computing are proposed ...
Read More

Comments

Browse Theses

Sections

Cited By

Multicore Processors and Systems

Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems

Massively LDPC Decoding on Multicore Architectures

Sections

Cited By

Save to Binder

Recommendations

Multicore Processors and Systems

Divide and Conquer on Hybrid GPU-Accelerated Multicore Systems

Massively LDPC Decoding on Multicore Architectures