A stream compiler for communication-exposed architectures
With the increasing miniaturization of transistors, wire delays are becoming a dominant factor in microprocessor performance. To address this issue, a number of emerging architectures contain replicated processing units with software-exposed communication between one unit and another (e.g., Raw, Sma...
Gespeichert in:
Veröffentlicht in: | Computer architecture news 2002-12, Vol.30 (5), p.291-303 |
---|---|
Hauptverfasser: | , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 303 |
---|---|
container_issue | 5 |
container_start_page | 291 |
container_title | Computer architecture news |
container_volume | 30 |
creator | Gordon, Michael I. Thies, William Karczmarek, Michal Lin, Jasper Meli, Ali S. Lamb, Andrew A. Leger, Chris Wong, Jeremy Hoffmann, Henry Maze, David Amarasinghe, Saman |
description | With the increasing miniaturization of transistors, wire delays are becoming a dominant factor in microprocessor performance. To address this issue, a number of emerging architectures contain replicated processing units with software-exposed communication between one unit and another (e.g., Raw, SmartMemories, TRIPS). However, for their use to be widespread, it will be necessary to develop compiler technology that enables a portable, high-level language to execute efficiently across a range of wire-exposed architectures.In this paper, we describe our compiler for StreamIt: a high-level, architecture-independent language for streaming applications. We focus on our backend for the Raw processor. Though StreamIt exposes the parallelism and communication patterns of stream programs, some analysis is needed to adapt a stream program to a software-exposed processor. We describe a partitioning algorithm that employs fission and fusion transformations to adjust the granularity of a stream graph, a layout algorithm that maps a stream graph to a given network topology, and a scheduling strategy that generates a fine-grained static communication pattern for each computational element.We have implemented a fully functional compiler that parallelizes StreamIt applications for Raw, including several load-balancing transformations. Using the cycle-accurate Raw simulator, we demonstrate that the StreamIt compiler can automatically map a high-level stream abstraction to Raw without losing performance. We consider this work to be a first step towards a portable programming model for communication-exposed architectures. |
doi_str_mv | 10.1145/635506.605428 |
format | Article |
fullrecord | <record><control><sourceid>crossref</sourceid><recordid>TN_cdi_crossref_primary_10_1145_635506_605428</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1145_635506_605428</sourcerecordid><originalsourceid>FETCH-LOGICAL-c828-5c7e55d1adb698556921d9e3e5a4bbffb6fe069b32b29b8de752a7a8e0cc53f03</originalsourceid><addsrcrecordid>eNotz0tLAzEYheEsFKzVpfv5A6m5zJdJlqV4g0I33YdcvmCk0wzJFPTfaxlXh3dz4CHkibMN5z08KwnA1EYx6IW-ISvGlaRgVH9H7lv7Yn89SLYiZtu1uaIbu1DGKZ-wdqnUa4yXcw5uzuVM8XsqDWPnavjMM4b5UrE9kNvkTg0f_3dNjq8vx9073R_ePnbbPQ1aaAphQIDIXfTKaABlBI8GJYLrvU_Jq4RMGS-FF8briAMINziNLASQick1octtqKW1islONY-u_ljO7JVqF6pdqPIX_SZJNA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>A stream compiler for communication-exposed architectures</title><source>ACM Digital Library Complete</source><creator>Gordon, Michael I. ; Thies, William ; Karczmarek, Michal ; Lin, Jasper ; Meli, Ali S. ; Lamb, Andrew A. ; Leger, Chris ; Wong, Jeremy ; Hoffmann, Henry ; Maze, David ; Amarasinghe, Saman</creator><creatorcontrib>Gordon, Michael I. ; Thies, William ; Karczmarek, Michal ; Lin, Jasper ; Meli, Ali S. ; Lamb, Andrew A. ; Leger, Chris ; Wong, Jeremy ; Hoffmann, Henry ; Maze, David ; Amarasinghe, Saman</creatorcontrib><description>With the increasing miniaturization of transistors, wire delays are becoming a dominant factor in microprocessor performance. To address this issue, a number of emerging architectures contain replicated processing units with software-exposed communication between one unit and another (e.g., Raw, SmartMemories, TRIPS). However, for their use to be widespread, it will be necessary to develop compiler technology that enables a portable, high-level language to execute efficiently across a range of wire-exposed architectures.In this paper, we describe our compiler for StreamIt: a high-level, architecture-independent language for streaming applications. We focus on our backend for the Raw processor. Though StreamIt exposes the parallelism and communication patterns of stream programs, some analysis is needed to adapt a stream program to a software-exposed processor. We describe a partitioning algorithm that employs fission and fusion transformations to adjust the granularity of a stream graph, a layout algorithm that maps a stream graph to a given network topology, and a scheduling strategy that generates a fine-grained static communication pattern for each computational element.We have implemented a fully functional compiler that parallelizes StreamIt applications for Raw, including several load-balancing transformations. Using the cycle-accurate Raw simulator, we demonstrate that the StreamIt compiler can automatically map a high-level stream abstraction to Raw without losing performance. We consider this work to be a first step towards a portable programming model for communication-exposed architectures.</description><identifier>ISSN: 0163-5964</identifier><identifier>DOI: 10.1145/635506.605428</identifier><language>eng</language><ispartof>Computer architecture news, 2002-12, Vol.30 (5), p.291-303</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c828-5c7e55d1adb698556921d9e3e5a4bbffb6fe069b32b29b8de752a7a8e0cc53f03</citedby><cites>FETCH-LOGICAL-c828-5c7e55d1adb698556921d9e3e5a4bbffb6fe069b32b29b8de752a7a8e0cc53f03</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,778,782,27913,27914</link.rule.ids></links><search><creatorcontrib>Gordon, Michael I.</creatorcontrib><creatorcontrib>Thies, William</creatorcontrib><creatorcontrib>Karczmarek, Michal</creatorcontrib><creatorcontrib>Lin, Jasper</creatorcontrib><creatorcontrib>Meli, Ali S.</creatorcontrib><creatorcontrib>Lamb, Andrew A.</creatorcontrib><creatorcontrib>Leger, Chris</creatorcontrib><creatorcontrib>Wong, Jeremy</creatorcontrib><creatorcontrib>Hoffmann, Henry</creatorcontrib><creatorcontrib>Maze, David</creatorcontrib><creatorcontrib>Amarasinghe, Saman</creatorcontrib><title>A stream compiler for communication-exposed architectures</title><title>Computer architecture news</title><description>With the increasing miniaturization of transistors, wire delays are becoming a dominant factor in microprocessor performance. To address this issue, a number of emerging architectures contain replicated processing units with software-exposed communication between one unit and another (e.g., Raw, SmartMemories, TRIPS). However, for their use to be widespread, it will be necessary to develop compiler technology that enables a portable, high-level language to execute efficiently across a range of wire-exposed architectures.In this paper, we describe our compiler for StreamIt: a high-level, architecture-independent language for streaming applications. We focus on our backend for the Raw processor. Though StreamIt exposes the parallelism and communication patterns of stream programs, some analysis is needed to adapt a stream program to a software-exposed processor. We describe a partitioning algorithm that employs fission and fusion transformations to adjust the granularity of a stream graph, a layout algorithm that maps a stream graph to a given network topology, and a scheduling strategy that generates a fine-grained static communication pattern for each computational element.We have implemented a fully functional compiler that parallelizes StreamIt applications for Raw, including several load-balancing transformations. Using the cycle-accurate Raw simulator, we demonstrate that the StreamIt compiler can automatically map a high-level stream abstraction to Raw without losing performance. We consider this work to be a first step towards a portable programming model for communication-exposed architectures.</description><issn>0163-5964</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2002</creationdate><recordtype>article</recordtype><recordid>eNotz0tLAzEYheEsFKzVpfv5A6m5zJdJlqV4g0I33YdcvmCk0wzJFPTfaxlXh3dz4CHkibMN5z08KwnA1EYx6IW-ISvGlaRgVH9H7lv7Yn89SLYiZtu1uaIbu1DGKZ-wdqnUa4yXcw5uzuVM8XsqDWPnavjMM4b5UrE9kNvkTg0f_3dNjq8vx9073R_ePnbbPQ1aaAphQIDIXfTKaABlBI8GJYLrvU_Jq4RMGS-FF8briAMINziNLASQick1octtqKW1islONY-u_ljO7JVqF6pdqPIX_SZJNA</recordid><startdate>200212</startdate><enddate>200212</enddate><creator>Gordon, Michael I.</creator><creator>Thies, William</creator><creator>Karczmarek, Michal</creator><creator>Lin, Jasper</creator><creator>Meli, Ali S.</creator><creator>Lamb, Andrew A.</creator><creator>Leger, Chris</creator><creator>Wong, Jeremy</creator><creator>Hoffmann, Henry</creator><creator>Maze, David</creator><creator>Amarasinghe, Saman</creator><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>200212</creationdate><title>A stream compiler for communication-exposed architectures</title><author>Gordon, Michael I. ; Thies, William ; Karczmarek, Michal ; Lin, Jasper ; Meli, Ali S. ; Lamb, Andrew A. ; Leger, Chris ; Wong, Jeremy ; Hoffmann, Henry ; Maze, David ; Amarasinghe, Saman</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c828-5c7e55d1adb698556921d9e3e5a4bbffb6fe069b32b29b8de752a7a8e0cc53f03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2002</creationdate><toplevel>online_resources</toplevel><creatorcontrib>Gordon, Michael I.</creatorcontrib><creatorcontrib>Thies, William</creatorcontrib><creatorcontrib>Karczmarek, Michal</creatorcontrib><creatorcontrib>Lin, Jasper</creatorcontrib><creatorcontrib>Meli, Ali S.</creatorcontrib><creatorcontrib>Lamb, Andrew A.</creatorcontrib><creatorcontrib>Leger, Chris</creatorcontrib><creatorcontrib>Wong, Jeremy</creatorcontrib><creatorcontrib>Hoffmann, Henry</creatorcontrib><creatorcontrib>Maze, David</creatorcontrib><creatorcontrib>Amarasinghe, Saman</creatorcontrib><collection>CrossRef</collection><jtitle>Computer architecture news</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gordon, Michael I.</au><au>Thies, William</au><au>Karczmarek, Michal</au><au>Lin, Jasper</au><au>Meli, Ali S.</au><au>Lamb, Andrew A.</au><au>Leger, Chris</au><au>Wong, Jeremy</au><au>Hoffmann, Henry</au><au>Maze, David</au><au>Amarasinghe, Saman</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A stream compiler for communication-exposed architectures</atitle><jtitle>Computer architecture news</jtitle><date>2002-12</date><risdate>2002</risdate><volume>30</volume><issue>5</issue><spage>291</spage><epage>303</epage><pages>291-303</pages><issn>0163-5964</issn><abstract>With the increasing miniaturization of transistors, wire delays are becoming a dominant factor in microprocessor performance. To address this issue, a number of emerging architectures contain replicated processing units with software-exposed communication between one unit and another (e.g., Raw, SmartMemories, TRIPS). However, for their use to be widespread, it will be necessary to develop compiler technology that enables a portable, high-level language to execute efficiently across a range of wire-exposed architectures.In this paper, we describe our compiler for StreamIt: a high-level, architecture-independent language for streaming applications. We focus on our backend for the Raw processor. Though StreamIt exposes the parallelism and communication patterns of stream programs, some analysis is needed to adapt a stream program to a software-exposed processor. We describe a partitioning algorithm that employs fission and fusion transformations to adjust the granularity of a stream graph, a layout algorithm that maps a stream graph to a given network topology, and a scheduling strategy that generates a fine-grained static communication pattern for each computational element.We have implemented a fully functional compiler that parallelizes StreamIt applications for Raw, including several load-balancing transformations. Using the cycle-accurate Raw simulator, we demonstrate that the StreamIt compiler can automatically map a high-level stream abstraction to Raw without losing performance. We consider this work to be a first step towards a portable programming model for communication-exposed architectures.</abstract><doi>10.1145/635506.605428</doi><tpages>13</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0163-5964 |
ispartof | Computer architecture news, 2002-12, Vol.30 (5), p.291-303 |
issn | 0163-5964 |
language | eng |
recordid | cdi_crossref_primary_10_1145_635506_605428 |
source | ACM Digital Library Complete |
title | A stream compiler for communication-exposed architectures |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T09%3A23%3A06IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20stream%20compiler%20for%20communication-exposed%20architectures&rft.jtitle=Computer%20architecture%20news&rft.au=Gordon,%20Michael%20I.&rft.date=2002-12&rft.volume=30&rft.issue=5&rft.spage=291&rft.epage=303&rft.pages=291-303&rft.issn=0163-5964&rft_id=info:doi/10.1145/635506.605428&rft_dat=%3Ccrossref%3E10_1145_635506_605428%3C/crossref%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |