A Media-Enhanced Vector Architecture for Embedded Memory Systems

A Media-Enhanced Vector Architecture for Embedded Memory SystemsAugust 1999

August 1999

1999 Technical Report

Author:
Christoforos Kozyrakis

Publisher:

University of California at Berkeley
Computer Science Division 571 Evans Hall Berkeley, CA
United States

Published:27 August 1999

Bibliometrics

Abstract

Next generation portable devices will require processors with both low energy consumption and high performance for media functions. At the same time, modern CMOS technology creates the need for highly scalable VLSI architectures. Conventional processor architectures fail to meet these requirements. This paper presents the architecture of Vector IRAM (VIRAM), a processor that combines vector processing with embedded DRAM technology. Vector processing achieves high multimedia performance with simple hardware, while embedded DRAM provides high memory bandwidth at low energy consumption. VIRAM provides flexible support for media data types, short vectors, and DSP features. The vector pipeline is enhanced to hide DRAM latency without using caches. The peak performance is 3.2 GFLOPS (single precision) and maximum memory bandwidth is 25.6 GBytes/s. With a target power consumption of 2 Watts for the vector pipeline and the memory system, VIRAM supports 1.6 GFLOPS/Watt. For a set of representative media kernels, VIRAM sustains on average 88% of its peak performance, outperforming conventional SIMD media extensions and DSP processors by factors of 4.5 to 17. Using a clustered implementation approach, the modular design can be scaled without complicating control logic. We demonstrate that scaling the architecture leads to near linear application speedup. We also evaluate the effect of scaling the capacity and parallelism of the on-chip memory system to die area and sustained performance.

Cited By

Contributors

Christos Kozyrakis
Stanford University
- Publication Years1997 - 2023
- Publication counts169
- Citation count19,398
- Available for Download159
- Downloads (cumulative)324,996
- Downloads (12 months)35,037
- Downloads (6 weeks)4,951
- Average Downloads per Article2,044
- Average Citation per Article115
View Full Profile

Recommendations

Scalable vector media-processors for embedded systems
Read More
Scalable Vector Media-processors for Embedded Systems
Read More
A hyperscalar dual-core architecture for embedded systems

This paper proposes a lightweight reconfigurable dual-core architecture for embedded systems, called hyperscalar dual-core architecture. The proposed architecture can play three different roles (a 2-issue statically scheduled superscalar processor, a ...
Read More

Comments

Browse Reports

Sections

Cited By

Scalable vector media-processors for embedded systems

Scalable Vector Media-processors for Embedded Systems

A hyperscalar dual-core architecture for embedded systems

Save to Binder

Sections

Cited By

Save to Binder

Recommendations

Scalable vector media-processors for embedded systems

Scalable Vector Media-processors for Embedded Systems

A hyperscalar dual-core architecture for embedded systems