A scalable 0.128-to-1Tb/s 0.8-to-2.6pJ/b 64-lane parallel I/O in 32nm CMOS
High-performance computing (HPC) systems demand aggressive scaling of memory and I/O to achieve multiple terabits/sec of bandwidth. Minimizing I/O cost, area and power are crucial to achieving a practically realizable system with such large bandwidth. To meet these needs, we developed a low-power de...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 403 |
---|---|
container_issue | |
container_start_page | 402 |
container_title | |
container_volume | |
creator | Mansuri, M. Jaussi, J. E. Kennedy, J. T. Hsueh, T. Shekhar, S. Balamurugan, G. O'Mahony, F. Roberts, C. Mooney, R. Casper, B. |
description | High-performance computing (HPC) systems demand aggressive scaling of memory and I/O to achieve multiple terabits/sec of bandwidth. Minimizing I/O cost, area and power are crucial to achieving a practically realizable system with such large bandwidth. To meet these needs, we developed a low-power dense 64-lane I/O system with per-port aggregate bandwidth up to 1Tb/s and 2.6pJ/bit power efficiency. We developed a high-density connector and cable, attached to the top side of the package that enables this high interconnect density. A lane-failover mechanism provides design robustness for fault-tolerance. To further optimize power efficiency, the lane data rate scales from 2 to 16Gb/s with non-linear power efficiency of 0.8 to 2.6pJ/bit, providing scalable aggregate bandwidth of 0.128 to 1Tb/s. Highly power scalable circuits such as CMOS clocking and reconfigurable current-mode (CM) or voltage-mode (VM) TX driver enable the 8× bandwidth and 3× power efficiency scalability with aggressive supply voltage scaling (0.6 to 1.08V). |
doi_str_mv | 10.1109/ISSCC.2013.6487788 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_6487788</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>6487788</ieee_id><sourcerecordid>6487788</sourcerecordid><originalsourceid>FETCH-LOGICAL-i175t-c2cb9c331fa03952797df0eba8e743da8b4206bd64c109e6bd50c3338b3881083</originalsourceid><addsrcrecordid>eNo1UMtuwjAQdF9SA-UH2ot_wMna68f6iKI-gqhygJ6RHYyUylBEuPTvG7X0NDOa0ax2GHuUUEoJvmpWq7ouFUgsrSbniK7YRGrrUBup6ZoVCp0VZMHesJl39O8Ze8sKkB6FNQj3bDIMnwBgvKWCLeZ86EIOMSc-HlIkzl9CrmM1jPJXqNIeF1XkVoscDokfwynknDJvqpb3B47qsOf1e7t6YHe7kIc0u-CUfbw8r-s3sWxfm3q-FL105iw61UXfIcpdAPRGOe-2O0gxUHIat4GiVmDj1upufDuNzMAYR4pIJIFwyp7-evuU0uZ46vfh9L25bII_UQ9LTQ</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>A scalable 0.128-to-1Tb/s 0.8-to-2.6pJ/b 64-lane parallel I/O in 32nm CMOS</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Mansuri, M. ; Jaussi, J. E. ; Kennedy, J. T. ; Hsueh, T. ; Shekhar, S. ; Balamurugan, G. ; O'Mahony, F. ; Roberts, C. ; Mooney, R. ; Casper, B.</creator><creatorcontrib>Mansuri, M. ; Jaussi, J. E. ; Kennedy, J. T. ; Hsueh, T. ; Shekhar, S. ; Balamurugan, G. ; O'Mahony, F. ; Roberts, C. ; Mooney, R. ; Casper, B.</creatorcontrib><description>High-performance computing (HPC) systems demand aggressive scaling of memory and I/O to achieve multiple terabits/sec of bandwidth. Minimizing I/O cost, area and power are crucial to achieving a practically realizable system with such large bandwidth. To meet these needs, we developed a low-power dense 64-lane I/O system with per-port aggregate bandwidth up to 1Tb/s and 2.6pJ/bit power efficiency. We developed a high-density connector and cable, attached to the top side of the package that enables this high interconnect density. A lane-failover mechanism provides design robustness for fault-tolerance. To further optimize power efficiency, the lane data rate scales from 2 to 16Gb/s with non-linear power efficiency of 0.8 to 2.6pJ/bit, providing scalable aggregate bandwidth of 0.128 to 1Tb/s. Highly power scalable circuits such as CMOS clocking and reconfigurable current-mode (CM) or voltage-mode (VM) TX driver enable the 8× bandwidth and 3× power efficiency scalability with aggressive supply voltage scaling (0.6 to 1.08V).</description><identifier>ISSN: 0193-6530</identifier><identifier>ISBN: 9781467345156</identifier><identifier>ISBN: 1467345156</identifier><identifier>EISSN: 2376-8606</identifier><identifier>EISBN: 1467345148</identifier><identifier>EISBN: 9781467345149</identifier><identifier>EISBN: 9781467345163</identifier><identifier>EISBN: 1467345164</identifier><identifier>DOI: 10.1109/ISSCC.2013.6487788</identifier><language>eng</language><publisher>IEEE</publisher><subject>Aggregates ; Bandwidth ; Clocks ; CMOS integrated circuits ; Connectors ; Integrated circuit interconnections ; Solid state circuits</subject><ispartof>2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers, 2013, p.402-403</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/6487788$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2057,27924,54919</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/6487788$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Mansuri, M.</creatorcontrib><creatorcontrib>Jaussi, J. E.</creatorcontrib><creatorcontrib>Kennedy, J. T.</creatorcontrib><creatorcontrib>Hsueh, T.</creatorcontrib><creatorcontrib>Shekhar, S.</creatorcontrib><creatorcontrib>Balamurugan, G.</creatorcontrib><creatorcontrib>O'Mahony, F.</creatorcontrib><creatorcontrib>Roberts, C.</creatorcontrib><creatorcontrib>Mooney, R.</creatorcontrib><creatorcontrib>Casper, B.</creatorcontrib><title>A scalable 0.128-to-1Tb/s 0.8-to-2.6pJ/b 64-lane parallel I/O in 32nm CMOS</title><title>2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers</title><addtitle>ISSCC</addtitle><description>High-performance computing (HPC) systems demand aggressive scaling of memory and I/O to achieve multiple terabits/sec of bandwidth. Minimizing I/O cost, area and power are crucial to achieving a practically realizable system with such large bandwidth. To meet these needs, we developed a low-power dense 64-lane I/O system with per-port aggregate bandwidth up to 1Tb/s and 2.6pJ/bit power efficiency. We developed a high-density connector and cable, attached to the top side of the package that enables this high interconnect density. A lane-failover mechanism provides design robustness for fault-tolerance. To further optimize power efficiency, the lane data rate scales from 2 to 16Gb/s with non-linear power efficiency of 0.8 to 2.6pJ/bit, providing scalable aggregate bandwidth of 0.128 to 1Tb/s. Highly power scalable circuits such as CMOS clocking and reconfigurable current-mode (CM) or voltage-mode (VM) TX driver enable the 8× bandwidth and 3× power efficiency scalability with aggressive supply voltage scaling (0.6 to 1.08V).</description><subject>Aggregates</subject><subject>Bandwidth</subject><subject>Clocks</subject><subject>CMOS integrated circuits</subject><subject>Connectors</subject><subject>Integrated circuit interconnections</subject><subject>Solid state circuits</subject><issn>0193-6530</issn><issn>2376-8606</issn><isbn>9781467345156</isbn><isbn>1467345156</isbn><isbn>1467345148</isbn><isbn>9781467345149</isbn><isbn>9781467345163</isbn><isbn>1467345164</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2013</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNo1UMtuwjAQdF9SA-UH2ot_wMna68f6iKI-gqhygJ6RHYyUylBEuPTvG7X0NDOa0ax2GHuUUEoJvmpWq7ouFUgsrSbniK7YRGrrUBup6ZoVCp0VZMHesJl39O8Ze8sKkB6FNQj3bDIMnwBgvKWCLeZ86EIOMSc-HlIkzl9CrmM1jPJXqNIeF1XkVoscDokfwynknDJvqpb3B47qsOf1e7t6YHe7kIc0u-CUfbw8r-s3sWxfm3q-FL105iw61UXfIcpdAPRGOe-2O0gxUHIat4GiVmDj1upufDuNzMAYR4pIJIFwyp7-evuU0uZ46vfh9L25bII_UQ9LTQ</recordid><startdate>201302</startdate><enddate>201302</enddate><creator>Mansuri, M.</creator><creator>Jaussi, J. E.</creator><creator>Kennedy, J. T.</creator><creator>Hsueh, T.</creator><creator>Shekhar, S.</creator><creator>Balamurugan, G.</creator><creator>O'Mahony, F.</creator><creator>Roberts, C.</creator><creator>Mooney, R.</creator><creator>Casper, B.</creator><general>IEEE</general><scope>6IE</scope><scope>6IH</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIO</scope></search><sort><creationdate>201302</creationdate><title>A scalable 0.128-to-1Tb/s 0.8-to-2.6pJ/b 64-lane parallel I/O in 32nm CMOS</title><author>Mansuri, M. ; Jaussi, J. E. ; Kennedy, J. T. ; Hsueh, T. ; Shekhar, S. ; Balamurugan, G. ; O'Mahony, F. ; Roberts, C. ; Mooney, R. ; Casper, B.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i175t-c2cb9c331fa03952797df0eba8e743da8b4206bd64c109e6bd50c3338b3881083</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Aggregates</topic><topic>Bandwidth</topic><topic>Clocks</topic><topic>CMOS integrated circuits</topic><topic>Connectors</topic><topic>Integrated circuit interconnections</topic><topic>Solid state circuits</topic><toplevel>online_resources</toplevel><creatorcontrib>Mansuri, M.</creatorcontrib><creatorcontrib>Jaussi, J. E.</creatorcontrib><creatorcontrib>Kennedy, J. T.</creatorcontrib><creatorcontrib>Hsueh, T.</creatorcontrib><creatorcontrib>Shekhar, S.</creatorcontrib><creatorcontrib>Balamurugan, G.</creatorcontrib><creatorcontrib>O'Mahony, F.</creatorcontrib><creatorcontrib>Roberts, C.</creatorcontrib><creatorcontrib>Mooney, R.</creatorcontrib><creatorcontrib>Casper, B.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan (POP) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP) 1998-present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Mansuri, M.</au><au>Jaussi, J. E.</au><au>Kennedy, J. T.</au><au>Hsueh, T.</au><au>Shekhar, S.</au><au>Balamurugan, G.</au><au>O'Mahony, F.</au><au>Roberts, C.</au><au>Mooney, R.</au><au>Casper, B.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>A scalable 0.128-to-1Tb/s 0.8-to-2.6pJ/b 64-lane parallel I/O in 32nm CMOS</atitle><btitle>2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers</btitle><stitle>ISSCC</stitle><date>2013-02</date><risdate>2013</risdate><spage>402</spage><epage>403</epage><pages>402-403</pages><issn>0193-6530</issn><eissn>2376-8606</eissn><isbn>9781467345156</isbn><isbn>1467345156</isbn><eisbn>1467345148</eisbn><eisbn>9781467345149</eisbn><eisbn>9781467345163</eisbn><eisbn>1467345164</eisbn><abstract>High-performance computing (HPC) systems demand aggressive scaling of memory and I/O to achieve multiple terabits/sec of bandwidth. Minimizing I/O cost, area and power are crucial to achieving a practically realizable system with such large bandwidth. To meet these needs, we developed a low-power dense 64-lane I/O system with per-port aggregate bandwidth up to 1Tb/s and 2.6pJ/bit power efficiency. We developed a high-density connector and cable, attached to the top side of the package that enables this high interconnect density. A lane-failover mechanism provides design robustness for fault-tolerance. To further optimize power efficiency, the lane data rate scales from 2 to 16Gb/s with non-linear power efficiency of 0.8 to 2.6pJ/bit, providing scalable aggregate bandwidth of 0.128 to 1Tb/s. Highly power scalable circuits such as CMOS clocking and reconfigurable current-mode (CM) or voltage-mode (VM) TX driver enable the 8× bandwidth and 3× power efficiency scalability with aggressive supply voltage scaling (0.6 to 1.08V).</abstract><pub>IEEE</pub><doi>10.1109/ISSCC.2013.6487788</doi><tpages>2</tpages></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | ISSN: 0193-6530 |
ispartof | 2013 IEEE International Solid-State Circuits Conference Digest of Technical Papers, 2013, p.402-403 |
issn | 0193-6530 2376-8606 |
language | eng |
recordid | cdi_ieee_primary_6487788 |
source | IEEE Electronic Library (IEL) Conference Proceedings |
subjects | Aggregates Bandwidth Clocks CMOS integrated circuits Connectors Integrated circuit interconnections Solid state circuits |
title | A scalable 0.128-to-1Tb/s 0.8-to-2.6pJ/b 64-lane parallel I/O in 32nm CMOS |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T04%3A04%3A31IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=A%20scalable%200.128-to-1Tb/s%200.8-to-2.6pJ/b%2064-lane%20parallel%20I/O%20in%2032nm%20CMOS&rft.btitle=2013%20IEEE%20International%20Solid-State%20Circuits%20Conference%20Digest%20of%20Technical%20Papers&rft.au=Mansuri,%20M.&rft.date=2013-02&rft.spage=402&rft.epage=403&rft.pages=402-403&rft.issn=0193-6530&rft.eissn=2376-8606&rft.isbn=9781467345156&rft.isbn_list=1467345156&rft_id=info:doi/10.1109/ISSCC.2013.6487788&rft_dat=%3Cieee_6IE%3E6487788%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1467345148&rft.eisbn_list=9781467345149&rft.eisbn_list=9781467345163&rft.eisbn_list=1467345164&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=6487788&rfr_iscdi=true |