Compiler transformations for effectively exploiting a zero overhead loop buffer
A Zero Overhead Loop Buffer (ZOLB) is an architectural feature that is commonly found in DSP processors. This buffer can be viewed as a compiler managed cache that contains a sequence of instructions that will be executed a specified number of times without incurring any loop overhead. Unlike loop u...
Gespeichert in:
Veröffentlicht in: | Software, practice & experience practice & experience, 2005-04, Vol.35 (4), p.393-412 |
---|---|
Hauptverfasser: | , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 412 |
---|---|
container_issue | 4 |
container_start_page | 393 |
container_title | Software, practice & experience |
container_volume | 35 |
creator | Uh, Gang-Ryung Wang, Yuhong Whalley, David Jinturkar, Sanjay Paek, Yunheung Cao, Vincent Burns, Chris |
description | A Zero Overhead Loop Buffer (ZOLB) is an architectural feature that is commonly found in DSP processors. This buffer can be viewed as a compiler managed cache that contains a sequence of instructions that will be executed a specified number of times without incurring any loop overhead. Unlike loop unrolling, a loop buffer can be used to minimize loop overhead without the penalty of increasing code size. In addition, a ZOLB requires relatively little space and power, which are both important considerations for most DSP applications. This paper describes strategies for generating code to effectively use a ZOLB. We have found that many common code improving transformations used by optimizing compilers on conventional architectures can be easily used to (1) allow more loops to be placed in a ZOLB, (2) further reduce loop overhead of the loops placed in a ZOLB, and (3) avoid redundant loading of ZOLB loops. The results given in this paper demonstrate that this architectural feature can often be exploited with substantial improvements in execution time and slight reductions in code size for various signal processing applications. Copyright © 2004 John Wiley & Sons, Ltd. |
doi_str_mv | 10.1002/spe.642 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_28528990</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>28528990</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3302-196acf338551014630dbcee8272647566efa8137ab7c06e2f9c38ca0849171663</originalsourceid><addsrcrecordid>eNp10DFPwzAQhmELgUQpiL_gCQaUcrYTJxlRKQVR0aqAYLNc9wKBJA52Wlp-PUFBbEx3w6NveAk5ZjBgAPzc1ziQId8hPQZpHAAPn3dJD0AkAcgw3CcH3r8BMBZx2SPToS3rvEBHG6crn1lX6ia3laftSzHL0DT5GostxU1d2LzJqxeq6Rc6S-0a3SvqJS2sreli1WJ3SPYyXXg8-r198ng1ehheB5Pp-GZ4MQmMEMADlkptMiGSKGLAQilguTCICY-5DONISsx0wkSsF7EBiTxLjUiMhiRMWcykFH1y0u3Wzn6s0DeqzL3BotAV2pVXPIl4kqbQwtMOGme9d5ip2uWldlvFQP0EU20w1QZr5VknP9se2_-Yup-NOh10OvcNbv60du9KxiKO1NPdWM3lrbyU85mai2-PhXtB</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>28528990</pqid></control><display><type>article</type><title>Compiler transformations for effectively exploiting a zero overhead loop buffer</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Uh, Gang-Ryung ; Wang, Yuhong ; Whalley, David ; Jinturkar, Sanjay ; Paek, Yunheung ; Cao, Vincent ; Burns, Chris</creator><creatorcontrib>Uh, Gang-Ryung ; Wang, Yuhong ; Whalley, David ; Jinturkar, Sanjay ; Paek, Yunheung ; Cao, Vincent ; Burns, Chris</creatorcontrib><description>A Zero Overhead Loop Buffer (ZOLB) is an architectural feature that is commonly found in DSP processors. This buffer can be viewed as a compiler managed cache that contains a sequence of instructions that will be executed a specified number of times without incurring any loop overhead. Unlike loop unrolling, a loop buffer can be used to minimize loop overhead without the penalty of increasing code size. In addition, a ZOLB requires relatively little space and power, which are both important considerations for most DSP applications. This paper describes strategies for generating code to effectively use a ZOLB. We have found that many common code improving transformations used by optimizing compilers on conventional architectures can be easily used to (1) allow more loops to be placed in a ZOLB, (2) further reduce loop overhead of the loops placed in a ZOLB, and (3) avoid redundant loading of ZOLB loops. The results given in this paper demonstrate that this architectural feature can often be exploited with substantial improvements in execution time and slight reductions in code size for various signal processing applications. Copyright © 2004 John Wiley & Sons, Ltd.</description><identifier>ISSN: 0038-0644</identifier><identifier>EISSN: 1097-024X</identifier><identifier>DOI: 10.1002/spe.642</identifier><language>eng</language><publisher>Chichester, UK: John Wiley & Sons, Ltd</publisher><subject>compiler ; DSP ; optimization ; ZOLB</subject><ispartof>Software, practice & experience, 2005-04, Vol.35 (4), p.393-412</ispartof><rights>Copyright © 2004 John Wiley & Sons, Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3302-196acf338551014630dbcee8272647566efa8137ab7c06e2f9c38ca0849171663</citedby><cites>FETCH-LOGICAL-c3302-196acf338551014630dbcee8272647566efa8137ab7c06e2f9c38ca0849171663</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fspe.642$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fspe.642$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1416,27915,27916,45565,45566</link.rule.ids></links><search><creatorcontrib>Uh, Gang-Ryung</creatorcontrib><creatorcontrib>Wang, Yuhong</creatorcontrib><creatorcontrib>Whalley, David</creatorcontrib><creatorcontrib>Jinturkar, Sanjay</creatorcontrib><creatorcontrib>Paek, Yunheung</creatorcontrib><creatorcontrib>Cao, Vincent</creatorcontrib><creatorcontrib>Burns, Chris</creatorcontrib><title>Compiler transformations for effectively exploiting a zero overhead loop buffer</title><title>Software, practice & experience</title><addtitle>Softw: Pract. Exper</addtitle><description>A Zero Overhead Loop Buffer (ZOLB) is an architectural feature that is commonly found in DSP processors. This buffer can be viewed as a compiler managed cache that contains a sequence of instructions that will be executed a specified number of times without incurring any loop overhead. Unlike loop unrolling, a loop buffer can be used to minimize loop overhead without the penalty of increasing code size. In addition, a ZOLB requires relatively little space and power, which are both important considerations for most DSP applications. This paper describes strategies for generating code to effectively use a ZOLB. We have found that many common code improving transformations used by optimizing compilers on conventional architectures can be easily used to (1) allow more loops to be placed in a ZOLB, (2) further reduce loop overhead of the loops placed in a ZOLB, and (3) avoid redundant loading of ZOLB loops. The results given in this paper demonstrate that this architectural feature can often be exploited with substantial improvements in execution time and slight reductions in code size for various signal processing applications. Copyright © 2004 John Wiley & Sons, Ltd.</description><subject>compiler</subject><subject>DSP</subject><subject>optimization</subject><subject>ZOLB</subject><issn>0038-0644</issn><issn>1097-024X</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2005</creationdate><recordtype>article</recordtype><recordid>eNp10DFPwzAQhmELgUQpiL_gCQaUcrYTJxlRKQVR0aqAYLNc9wKBJA52Wlp-PUFBbEx3w6NveAk5ZjBgAPzc1ziQId8hPQZpHAAPn3dJD0AkAcgw3CcH3r8BMBZx2SPToS3rvEBHG6crn1lX6ia3laftSzHL0DT5GostxU1d2LzJqxeq6Rc6S-0a3SvqJS2sreli1WJ3SPYyXXg8-r198ng1ehheB5Pp-GZ4MQmMEMADlkptMiGSKGLAQilguTCICY-5DONISsx0wkSsF7EBiTxLjUiMhiRMWcykFH1y0u3Wzn6s0DeqzL3BotAV2pVXPIl4kqbQwtMOGme9d5ip2uWldlvFQP0EU20w1QZr5VknP9se2_-Yup-NOh10OvcNbv60du9KxiKO1NPdWM3lrbyU85mai2-PhXtB</recordid><startdate>20050410</startdate><enddate>20050410</enddate><creator>Uh, Gang-Ryung</creator><creator>Wang, Yuhong</creator><creator>Whalley, David</creator><creator>Jinturkar, Sanjay</creator><creator>Paek, Yunheung</creator><creator>Cao, Vincent</creator><creator>Burns, Chris</creator><general>John Wiley & Sons, Ltd</general><scope>BSCLL</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope></search><sort><creationdate>20050410</creationdate><title>Compiler transformations for effectively exploiting a zero overhead loop buffer</title><author>Uh, Gang-Ryung ; Wang, Yuhong ; Whalley, David ; Jinturkar, Sanjay ; Paek, Yunheung ; Cao, Vincent ; Burns, Chris</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3302-196acf338551014630dbcee8272647566efa8137ab7c06e2f9c38ca0849171663</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2005</creationdate><topic>compiler</topic><topic>DSP</topic><topic>optimization</topic><topic>ZOLB</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Uh, Gang-Ryung</creatorcontrib><creatorcontrib>Wang, Yuhong</creatorcontrib><creatorcontrib>Whalley, David</creatorcontrib><creatorcontrib>Jinturkar, Sanjay</creatorcontrib><creatorcontrib>Paek, Yunheung</creatorcontrib><creatorcontrib>Cao, Vincent</creatorcontrib><creatorcontrib>Burns, Chris</creatorcontrib><collection>Istex</collection><collection>CrossRef</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology & Engineering</collection><collection>Engineering Research Database</collection><jtitle>Software, practice & experience</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Uh, Gang-Ryung</au><au>Wang, Yuhong</au><au>Whalley, David</au><au>Jinturkar, Sanjay</au><au>Paek, Yunheung</au><au>Cao, Vincent</au><au>Burns, Chris</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Compiler transformations for effectively exploiting a zero overhead loop buffer</atitle><jtitle>Software, practice & experience</jtitle><addtitle>Softw: Pract. Exper</addtitle><date>2005-04-10</date><risdate>2005</risdate><volume>35</volume><issue>4</issue><spage>393</spage><epage>412</epage><pages>393-412</pages><issn>0038-0644</issn><eissn>1097-024X</eissn><abstract>A Zero Overhead Loop Buffer (ZOLB) is an architectural feature that is commonly found in DSP processors. This buffer can be viewed as a compiler managed cache that contains a sequence of instructions that will be executed a specified number of times without incurring any loop overhead. Unlike loop unrolling, a loop buffer can be used to minimize loop overhead without the penalty of increasing code size. In addition, a ZOLB requires relatively little space and power, which are both important considerations for most DSP applications. This paper describes strategies for generating code to effectively use a ZOLB. We have found that many common code improving transformations used by optimizing compilers on conventional architectures can be easily used to (1) allow more loops to be placed in a ZOLB, (2) further reduce loop overhead of the loops placed in a ZOLB, and (3) avoid redundant loading of ZOLB loops. The results given in this paper demonstrate that this architectural feature can often be exploited with substantial improvements in execution time and slight reductions in code size for various signal processing applications. Copyright © 2004 John Wiley & Sons, Ltd.</abstract><cop>Chichester, UK</cop><pub>John Wiley & Sons, Ltd</pub><doi>10.1002/spe.642</doi><tpages>20</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0038-0644 |
ispartof | Software, practice & experience, 2005-04, Vol.35 (4), p.393-412 |
issn | 0038-0644 1097-024X |
language | eng |
recordid | cdi_proquest_miscellaneous_28528990 |
source | Wiley Online Library Journals Frontfile Complete |
subjects | compiler DSP optimization ZOLB |
title | Compiler transformations for effectively exploiting a zero overhead loop buffer |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-15T06%3A17%3A14IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Compiler%20transformations%20for%20effectively%20exploiting%20a%20zero%20overhead%20loop%20buffer&rft.jtitle=Software,%20practice%20&%20experience&rft.au=Uh,%20Gang-Ryung&rft.date=2005-04-10&rft.volume=35&rft.issue=4&rft.spage=393&rft.epage=412&rft.pages=393-412&rft.issn=0038-0644&rft.eissn=1097-024X&rft_id=info:doi/10.1002/spe.642&rft_dat=%3Cproquest_cross%3E28528990%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=28528990&rft_id=info:pmid/&rfr_iscdi=true |