A high performance router with dynamic buffer allocation for on-chip interconnect networks

With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Shubo Qi, Minxuan Zhang, Jinwen Li, Tianlei Zhao, Chengyi Zhang, Shaoqing Li
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 467
container_issue
container_start_page 462
container_title
container_volume
creator Shubo Qi
Minxuan Zhang
Jinwen Li
Tianlei Zhao
Chengyi Zhang
Shaoqing Li
description With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.
doi_str_mv 10.1109/ICCD.2010.5647657
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5647657</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5647657</ieee_id><sourcerecordid>5647657</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-c4ae760eb40d43cfe8a42f020de1db5364276d90a073b5967ab3f2f1d85f95f03</originalsourceid><addsrcrecordid>eNpFUElOwzAUNZNEWjgAYuMLpHzP8bIKU6VKbGDDpnIcmxgau3JSVb09kajE6umNi4fQHYEFIaAfVnX9uKAwUSG5kkKdoRnhlPNKMwHnqKBCyVJqLS_-DakvUUFAslJy4NdoNgzfAFAxogr0ucRd-OrwzmWfcm-idTin_egyPoSxw-0xmj5Y3Oy9nzSz3SZrxpAinuI4xdJ2YYdDnAo2xejsiKMbDyn_DDfoypvt4G5POEcfz0_v9Wu5fntZ1ct1GYgSY2m5cUqCazi0nFnvKsOpBwqtI20jmORUyVaDAcUaoaUyDfPUk7YSXgsPbI7u_3aDc26zy6E3-bg5HcR-AaHQVzU</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A high performance router with dynamic buffer allocation for on-chip interconnect networks</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Shubo Qi ; Minxuan Zhang ; Jinwen Li ; Tianlei Zhao ; Chengyi Zhang ; Shaoqing Li</creator><creatorcontrib>Shubo Qi ; Minxuan Zhang ; Jinwen Li ; Tianlei Zhao ; Chengyi Zhang ; Shaoqing Li</creatorcontrib><description>With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.</description><identifier>ISSN: 1063-6404</identifier><identifier>ISBN: 1424489369</identifier><identifier>ISBN: 9781424489367</identifier><identifier>EISSN: 2576-6996</identifier><identifier>EISBN: 1424489350</identifier><identifier>EISBN: 1424489377</identifier><identifier>EISBN: 9781424489350</identifier><identifier>EISBN: 9781424489374</identifier><identifier>DOI: 10.1109/ICCD.2010.5647657</identifier><language>eng</language><publisher>IEEE</publisher><subject>Delay ; DVOQR ; flow control ; Network on chip ; Pipelines ; Resource management ; router ; Routing ; Switches ; Throughput ; Traffic control ; zero-load latency</subject><ispartof>2010 IEEE International Conference on Computer Design, 2010, p.462-467</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5647657$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5647657$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Shubo Qi</creatorcontrib><creatorcontrib>Minxuan Zhang</creatorcontrib><creatorcontrib>Jinwen Li</creatorcontrib><creatorcontrib>Tianlei Zhao</creatorcontrib><creatorcontrib>Chengyi Zhang</creatorcontrib><creatorcontrib>Shaoqing Li</creatorcontrib><title>A high performance router with dynamic buffer allocation for on-chip interconnect networks</title><title>2010 IEEE International Conference on Computer Design</title><addtitle>ICCD</addtitle><description>With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.</description><subject>Delay</subject><subject>DVOQR</subject><subject>flow control</subject><subject>Network on chip</subject><subject>Pipelines</subject><subject>Resource management</subject><subject>router</subject><subject>Routing</subject><subject>Switches</subject><subject>Throughput</subject><subject>Traffic control</subject><subject>zero-load latency</subject><issn>1063-6404</issn><issn>2576-6996</issn><isbn>1424489369</isbn><isbn>9781424489367</isbn><isbn>1424489350</isbn><isbn>1424489377</isbn><isbn>9781424489350</isbn><isbn>9781424489374</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2010</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFUElOwzAUNZNEWjgAYuMLpHzP8bIKU6VKbGDDpnIcmxgau3JSVb09kajE6umNi4fQHYEFIaAfVnX9uKAwUSG5kkKdoRnhlPNKMwHnqKBCyVJqLS_-DakvUUFAslJy4NdoNgzfAFAxogr0ucRd-OrwzmWfcm-idTin_egyPoSxw-0xmj5Y3Oy9nzSz3SZrxpAinuI4xdJ2YYdDnAo2xejsiKMbDyn_DDfoypvt4G5POEcfz0_v9Wu5fntZ1ct1GYgSY2m5cUqCazi0nFnvKsOpBwqtI20jmORUyVaDAcUaoaUyDfPUk7YSXgsPbI7u_3aDc26zy6E3-bg5HcR-AaHQVzU</recordid><startdate>201010</startdate><enddate>201010</enddate><creator>Shubo Qi</creator><creator>Minxuan Zhang</creator><creator>Jinwen Li</creator><creator>Tianlei Zhao</creator><creator>Chengyi Zhang</creator><creator>Shaoqing Li</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201010</creationdate><title>A high performance router with dynamic buffer allocation for on-chip interconnect networks</title><author>Shubo Qi ; Minxuan Zhang ; Jinwen Li ; Tianlei Zhao ; Chengyi Zhang ; Shaoqing Li</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-c4ae760eb40d43cfe8a42f020de1db5364276d90a073b5967ab3f2f1d85f95f03</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Delay</topic><topic>DVOQR</topic><topic>flow control</topic><topic>Network on chip</topic><topic>Pipelines</topic><topic>Resource management</topic><topic>router</topic><topic>Routing</topic><topic>Switches</topic><topic>Throughput</topic><topic>Traffic control</topic><topic>zero-load latency</topic><toplevel>online_resources</toplevel><creatorcontrib>Shubo Qi</creatorcontrib><creatorcontrib>Minxuan Zhang</creatorcontrib><creatorcontrib>Jinwen Li</creatorcontrib><creatorcontrib>Tianlei Zhao</creatorcontrib><creatorcontrib>Chengyi Zhang</creatorcontrib><creatorcontrib>Shaoqing Li</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shubo Qi</au><au>Minxuan Zhang</au><au>Jinwen Li</au><au>Tianlei Zhao</au><au>Chengyi Zhang</au><au>Shaoqing Li</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A high performance router with dynamic buffer allocation for on-chip interconnect networks</atitle><btitle>2010 IEEE International Conference on Computer Design</btitle><stitle>ICCD</stitle><date>2010-10</date><risdate>2010</risdate><spage>462</spage><epage>467</epage><pages>462-467</pages><issn>1063-6404</issn><eissn>2576-6996</eissn><isbn>1424489369</isbn><isbn>9781424489367</isbn><eisbn>1424489350</eisbn><eisbn>1424489377</eisbn><eisbn>9781424489350</eisbn><eisbn>9781424489374</eisbn><abstract>With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.</abstract><pub>IEEE</pub><doi>10.1109/ICCD.2010.5647657</doi><tpages>6</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1063-6404
ispartof 2010 IEEE International Conference on Computer Design, 2010, p.462-467
issn 1063-6404
2576-6996
language eng
recordid cdi_ieee_primary_5647657
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Delay
DVOQR
flow control
Network on chip
Pipelines
Resource management
router
Routing
Switches
Throughput
Traffic control
zero-load latency
title A high performance router with dynamic buffer allocation for on-chip interconnect networks
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T10%3A22%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20high%20performance%20router%20with%20dynamic%20buffer%20allocation%20for%20on-chip%20interconnect%20networks&rft.btitle=2010%20IEEE%20International%20Conference%20on%20Computer%20Design&rft.au=Shubo%20Qi&rft.date=2010-10&rft.spage=462&rft.epage=467&rft.pages=462-467&rft.issn=1063-6404&rft.eissn=2576-6996&rft.isbn=1424489369&rft.isbn_list=9781424489367&rft_id=info:doi/10.1109/ICCD.2010.5647657&rft_dat=%3Cieee_6IE%3E5647657%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424489350&rft.eisbn_list=1424489377&rft.eisbn_list=9781424489350&rft.eisbn_list=9781424489374&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5647657&rfr_iscdi=true