A high performance router with dynamic buffer allocation for on-chip interconnect networks

With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Shubo Qi, Minxuan Zhang, Jinwen Li, Tianlei Zhao, Chengyi Zhang, Shaoqing Li
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Delay DVOQR flow control Network on chip Pipelines Resource management router Routing Switches Throughput Traffic control zero-load latency
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	467
container_issue
container_start_page	462
container_title
container_volume
creator	Shubo Qi Minxuan Zhang Jinwen Li Tianlei Zhao Chengyi Zhang Shaoqing Li
description	With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.
doi_str_mv	10.1109/ICCD.2010.5647657
format	Conference Proceeding
fullrecord	<record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_5647657</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>5647657</ieee_id><sourcerecordid>5647657</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-c4ae760eb40d43cfe8a42f020de1db5364276d90a073b5967ab3f2f1d85f95f03</originalsourceid><addsrcrecordid>eNpFUElOwzAUNZNEWjgAYuMLpHzP8bIKU6VKbGDDpnIcmxgau3JSVb09kajE6umNi4fQHYEFIaAfVnX9uKAwUSG5kkKdoRnhlPNKMwHnqKBCyVJqLS_-DakvUUFAslJy4NdoNgzfAFAxogr0ucRd-OrwzmWfcm-idTin_egyPoSxw-0xmj5Y3Oy9nzSz3SZrxpAinuI4xdJ2YYdDnAo2xejsiKMbDyn_DDfoypvt4G5POEcfz0_v9Wu5fntZ1ct1GYgSY2m5cUqCazi0nFnvKsOpBwqtI20jmORUyVaDAcUaoaUyDfPUk7YSXgsPbI7u_3aDc26zy6E3-bg5HcR-AaHQVzU</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A high performance router with dynamic buffer allocation for on-chip interconnect networks</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Shubo Qi ; Minxuan Zhang ; Jinwen Li ; Tianlei Zhao ; Chengyi Zhang ; Shaoqing Li</creator><creatorcontrib>Shubo Qi ; Minxuan Zhang ; Jinwen Li ; Tianlei Zhao ; Chengyi Zhang ; Shaoqing Li</creatorcontrib><description>With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.</description><identifier>ISSN: 1063-6404</identifier><identifier>ISBN: 1424489369</identifier><identifier>ISBN: 9781424489367</identifier><identifier>EISSN: 2576-6996</identifier><identifier>EISBN: 1424489350</identifier><identifier>EISBN: 1424489377</identifier><identifier>EISBN: 9781424489350</identifier><identifier>EISBN: 9781424489374</identifier><identifier>DOI: 10.1109/ICCD.2010.5647657</identifier><language>eng</language><publisher>IEEE</publisher><subject>Delay ; DVOQR ; flow control ; Network on chip ; Pipelines ; Resource management ; router ; Routing ; Switches ; Throughput ; Traffic control ; zero-load latency</subject><ispartof>2010 IEEE International Conference on Computer Design, 2010, p.462-467</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/5647657$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2058,27925,54920</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/5647657$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Shubo Qi</creatorcontrib><creatorcontrib>Minxuan Zhang</creatorcontrib><creatorcontrib>Jinwen Li</creatorcontrib><creatorcontrib>Tianlei Zhao</creatorcontrib><creatorcontrib>Chengyi Zhang</creatorcontrib><creatorcontrib>Shaoqing Li</creatorcontrib><title>A high performance router with dynamic buffer allocation for on-chip interconnect networks</title><title>2010 IEEE International Conference on Computer Design</title><addtitle>ICCD</addtitle><description>With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.</description><subject>Delay</subject><subject>DVOQR</subject><subject>flow control</subject><subject>Network on chip</subject><subject>Pipelines</subject><subject>Resource management</subject><subject>router</subject><subject>Routing</subject><subject>Switches</subject><subject>Throughput</subject><subject>Traffic control</subject><subject>zero-load latency</subject><issn>1063-6404</issn><issn>2576-6996</issn><isbn>1424489369</isbn><isbn>9781424489367</isbn><isbn>1424489350</isbn><isbn>1424489377</isbn><isbn>9781424489350</isbn><isbn>9781424489374</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2010</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFUElOwzAUNZNEWjgAYuMLpHzP8bIKU6VKbGDDpnIcmxgau3JSVb09kajE6umNi4fQHYEFIaAfVnX9uKAwUSG5kkKdoRnhlPNKMwHnqKBCyVJqLS_-DakvUUFAslJy4NdoNgzfAFAxogr0ucRd-OrwzmWfcm-idTin_egyPoSxw-0xmj5Y3Oy9nzSz3SZrxpAinuI4xdJ2YYdDnAo2xejsiKMbDyn_DDfoypvt4G5POEcfz0_v9Wu5fntZ1ct1GYgSY2m5cUqCazi0nFnvKsOpBwqtI20jmORUyVaDAcUaoaUyDfPUk7YSXgsPbI7u_3aDc26zy6E3-bg5HcR-AaHQVzU</recordid><startdate>201010</startdate><enddate>201010</enddate><creator>Shubo Qi</creator><creator>Minxuan Zhang</creator><creator>Jinwen Li</creator><creator>Tianlei Zhao</creator><creator>Chengyi Zhang</creator><creator>Shaoqing Li</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201010</creationdate><title>A high performance router with dynamic buffer allocation for on-chip interconnect networks</title><author>Shubo Qi ; Minxuan Zhang ; Jinwen Li ; Tianlei Zhao ; Chengyi Zhang ; Shaoqing Li</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-c4ae760eb40d43cfe8a42f020de1db5364276d90a073b5967ab3f2f1d85f95f03</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2010</creationdate><topic>Delay</topic><topic>DVOQR</topic><topic>flow control</topic><topic>Network on chip</topic><topic>Pipelines</topic><topic>Resource management</topic><topic>router</topic><topic>Routing</topic><topic>Switches</topic><topic>Throughput</topic><topic>Traffic control</topic><topic>zero-load latency</topic><toplevel>online_resources</toplevel><creatorcontrib>Shubo Qi</creatorcontrib><creatorcontrib>Minxuan Zhang</creatorcontrib><creatorcontrib>Jinwen Li</creatorcontrib><creatorcontrib>Tianlei Zhao</creatorcontrib><creatorcontrib>Chengyi Zhang</creatorcontrib><creatorcontrib>Shaoqing Li</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Shubo Qi</au><au>Minxuan Zhang</au><au>Jinwen Li</au><au>Tianlei Zhao</au><au>Chengyi Zhang</au><au>Shaoqing Li</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A high performance router with dynamic buffer allocation for on-chip interconnect networks</atitle><btitle>2010 IEEE International Conference on Computer Design</btitle><stitle>ICCD</stitle><date>2010-10</date><risdate>2010</risdate><spage>462</spage><epage>467</epage><pages>462-467</pages><issn>1063-6404</issn><eissn>2576-6996</eissn><isbn>1424489369</isbn><isbn>9781424489367</isbn><eisbn>1424489350</eisbn><eisbn>1424489377</eisbn><eisbn>9781424489350</eisbn><eisbn>9781424489374</eisbn><abstract>With the number of processor cores increasing in chip multi-processors (CMPs) and global wire delays increasing, networks on chip have been gaining wide acceptance for on-chip inter-core communication. This paper introduces a low latency Dynamic Virtual Output Queues Router (DVOQR), which can reduce the router latency to two cycles by leveraging look-ahead routing computation and virtual output address queues scheme. Simulation results show that network throughput on a 4×4 mesh increases by up to 46.9% and 28.6%, compared to wormhole router and virtual channel router, and that DVOQR outperforms doubled buffer virtual channel router by 1.9% under same input speedup. Network zero-load-latency also decreases by 25.6% and 41% respectively under random traffic. The results with place and route used by Cadence Encounter in TSMC 65nm technology display that the frequency of DVOQR can reach 1.4 GHz, the cell area of the router is only 0.424mm 2 and the power consumption is 274 mw under the 50% injection rate.</abstract><pub>IEEE</pub><doi>10.1109/ICCD.2010.5647657</doi><tpages>6</tpages></addata></record>
fulltext	fulltext_linktorsrc
identifier	ISSN: 1063-6404
ispartof	2010 IEEE International Conference on Computer Design, 2010, p.462-467
issn	1063-6404 2576-6996
language	eng
recordid	cdi_ieee_primary_5647657
source	IEEE Electronic Library (IEL) Conference Proceedings
subjects	Delay DVOQR flow control Network on chip Pipelines Resource management router Routing Switches Throughput Traffic control zero-load latency
title	A high performance router with dynamic buffer allocation for on-chip interconnect networks
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T10%3A22%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20high%20performance%20router%20with%20dynamic%20buffer%20allocation%20for%20on-chip%20interconnect%20networks&rft.btitle=2010%20IEEE%20International%20Conference%20on%20Computer%20Design&rft.au=Shubo%20Qi&rft.date=2010-10&rft.spage=462&rft.epage=467&rft.pages=462-467&rft.issn=1063-6404&rft.eissn=2576-6996&rft.isbn=1424489369&rft.isbn_list=9781424489367&rft_id=info:doi/10.1109/ICCD.2010.5647657&rft_dat=%3Cieee_6IE%3E5647657%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424489350&rft.eisbn_list=1424489377&rft.eisbn_list=9781424489350&rft.eisbn_list=9781424489374&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=5647657&rfr_iscdi=true