Article

The Vector-Thread Architecture

Authors:
Ronny Krashinsky

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
View Profile

,
Christopher Batten

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
View Profile

,
Mark Hampton

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
View Profile

,
Steve Gerding

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
View Profile

,
Brian Pharris

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
View Profile

,
Jared Casper

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
View Profile

,
Krste Asanovic

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA

MIT Computer Science and Artificial Intelligence Laboratory, Cambridge, MA
View Profile

Authors Info & Claims

ISCA '04: Proceedings of the 31st annual international symposium on Computer architectureJune 2004

Published:02 March 2004Publication History

ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture

ABSTRACT

The vector-thread (VT) architectural paradigm unifies the vectorand multithreaded compute models. The VT abstraction providesthe programmer with a control processor and a vector of virtualprocessors (VPs). The control processor can use vector-fetch commandsto broadcast instructions to all the VPs or each VP can usethread-fetches to direct its own control flow. A seamless intermixingof the vector and threaded control mechanisms allows a VT architectureto flexibly and compactly encode application parallelismand locality, and a VT machine exploits these to improve performanceand efficiency. We present SCALE, an instantiation of theVT architecture designed for low-power and high-performance embeddedsystems. We evaluate the SCALE prototype design usingdetailed simulation of a broad range of embedded applications andshow that its performance is competitive with larger and more complexprocessors.

References

{1} T.-C. Chiueh. Multi-threaded vectorization. In ISCA-18, May 1991. Google ScholarDigital Library
{2} C. R. Jesshope. Implementing an efficient vector instruction set in a chip multi-processor using micro-threaded pipelines. Australia Computer Science Communications, 23(4):80-88, 2001. Google ScholarDigital Library
{3} K. Kitagawa, S. Tagaya, Y. Hagihara, and Y. Kanoh. A hardware overview of SX-6 and SX-7 supercomputer. NEC Research & Development Journal, 44(1):2-7, Jan 2003.Google Scholar
{4} C. Kozyrakis. Scalable vector media-processors for embedded systems. PhD thesis, University of California at Berkeley, May 2002. Google ScholarDigital Library
{5} C. Kozyrakis and D. Patterson. Overcoming the limitations of conventional vector processors. In ISCA-30, June 2003. Google ScholarDigital Library
{6} C. Kozyrakis, S. Perissakis, D. Patterson, T. Anderson, K. Asanovi¿, N. Cardwell, R. Fromm, J. Golbus, B. Gribstad, K. Keeton, R. Thomas, N. Treuhaft, and K. Yelick. Scalable Processors in the Billion-Transistor Era: IRAM. IEEE Computer, 30(9):75-78, Sept 1997. Google ScholarDigital Library
{7} K. Mai, T. Paaske, N. Jayasena, R. Ho, W. Dally, and M. Horowitz. Smart Memories: A modular reconfigurable architecture. In Proc. ISCA 27, pages 161-171, June 2000. Google ScholarDigital Library
{8} S. Rixner, W. Dally, U. Kapasi, B. Khailany, A. Lopez-Lagunas, P. Mattson, and J. Owens. A bandwidth-efficient architecture for media processing. In MICRO-31, Nov 1998. Google ScholarDigital Library
{9} R. M. Russel. The CRAY-1 computer system. Communications of the ACM, 21(1):63-72, Jan 1978. Google ScholarDigital Library
{10} K. Sankaralingam, R. Nagarajan, H. Liu, C. Kim, J. Huh, D. Burger, S. W. Keckler, and C. Moore. Exploiting ILP, TLP, and DLP with the polymorphous TRIPS architecture. In ISCA-30, June 2003. Google ScholarDigital Library
{11} J. E. Smith. Dynamic instruction scheduling and the Astronautics ZS-1. IEEE Computer, 22(7):21-35, July 1989. Google ScholarDigital Library
{12} G. S. Sohi, S. E. Breach, and T. N. Vijaykumar. Multiscalar processors. In ISCA-22, pages 414-425, June 1995. Google ScholarDigital Library
{13} E. Waingold, M. Taylor, D. Srikrishna, V. Sarkar, W. Lee, V. Lee, J. Kim, M. Frank, P. Finch, R. Barua, J. Babb, S. Amarasinghe, and A. Agarwal. Baring it all to software: Raw machines. IEEE Computer, 30(9):86-93, Sept 1997. Google ScholarDigital Library
{14} J. Wawrzynek, K. Asanovi¿, B. Kingsbury, J. Beck, D. Johnson, and N. Morgan. Spert-II: A vector microprocessor system. IEEE Computer, 29(3):79-86, Mar 1996. Google ScholarDigital Library
{15} M. Zhang and K. Asanovi¿. Highly-associative caches for low-power processors. In Kool Chips Workshop, MICRO-33, Dec 2000.Google Scholar

Recommendations

The Vector-Thread Architecture
ISCA 2004

The vector-thread (VT) architectural paradigm unifies the vectorand multithreaded compute models. The VT abstraction providesthe programmer with a control processor and a vector of virtualprocessors (VPs). The control processor can use vector-fetch ...
Read More
High-Performance and Low-Cost Dual-Thread VLIW Processor Using Weld Architecture Paradigm

This paper presents a cost-effective and high-performance dual-thread VLIW processor model. The dual-thread VLIW processor model is a low-cost subset of the Weld architecture paradigm. It supports one main thread and one speculative thread running ...
Read More
Vector-thread architecture and implementation
Read More

Comments

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

Published in
ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture
June 2004
373 pages
ISBN:0769521436
ACM SIGARCH Computer Architecture News Volume 32, Issue 2
ISCA 2004
March 2004
373 pages
ISSN:0163-5964
DOI:10.1145/1028176
Issue’s Table of Contents
Sponsors
In-Cooperation
Publisher
IEEE Computer Society
United States
Publication History
- Published: 2 March 2004
Check for updates
Qualifiers
- Article
Conference

Acceptance Rates
ISCA '04 Paper Acceptance Rate31of217submissions,14%Overall Acceptance Rate543of3,203submissions,17%
More
Upcoming Conference
ISCA '24

Sponsor:

sigarch

ISCA '24: The 51st Annual International Symposium on Computer Architecture

June 29 - July 3, 2024

Buenos Aires , Argentina
Funding Sources
Other Metrics
View Article Metrics

Article Metrics
- 75
  Total Citations
  View Citations
- 1,120
  Total Downloads
- Downloads (Last 12 months)41
- Downloads (Last 6 weeks)5
Other Metrics
View Author Metrics
Cited By
View all

PDF Format

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

The Vector-Thread Architecture

ISCA '04: Proceedings of the 31st annual international symposium on Computer architecture

ABSTRACT

References

Cited By

Recommendations

The Vector-Thread Architecture

High-Performance and Low-Cost Dual-Thread VLIW Processor Using Weld Architecture Paradigm

Vector-thread architecture and implementation