Architecture and Early Performance of the New IBM HPS Fabric and Adapter

In this paper we describe the architecture, design, and performance of the new cluster switch fabric and adapter called HPS (High Performance Switch). HPS delivers very low latency and very high bandwidth. We demonstrate latency of less than 4.3us MPI library; 1.8GB/s of delivered unidirectional ban...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:High Performance Computing - HiPC 2004 2004-01, p.156-165
Hauptverfasser: Govindaraju, Rama K, Hochschild, Peter, Grice, Don, Gildea, Kevin, Blackmore, Robert, Bender, Carl A, Kim, Chulho, Chaudhary, Piyush, Goscinski, Jason, Herring, Jay, Martin, Steven, Houston, John
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 165
container_issue
container_start_page 156
container_title High Performance Computing - HiPC 2004
container_volume
creator Govindaraju, Rama K
Hochschild, Peter
Grice, Don
Gildea, Kevin
Blackmore, Robert
Bender, Carl A
Kim, Chulho
Chaudhary, Piyush
Goscinski, Jason
Herring, Jay
Martin, Steven
Houston, John
description In this paper we describe the architecture, design, and performance of the new cluster switch fabric and adapter called HPS (High Performance Switch). HPS delivers very low latency and very high bandwidth. We demonstrate latency of less than 4.3us MPI library; 1.8GB/s of delivered unidirectional bandwidth and 2.9GB/s of bidirectional bandwidth between 2 MPI tasks running on 1.9GHz Power 4+ IH based nodes. HPS also supports RDMA (remote direct memory access capability). A unique capability of RDMA over HPS is that reliable RDMA is supported over an underlying unreliable transport (unlike Infiniband and other RDMA transport protocols which depend on the underlying transport being reliable). We profile the performance of RDMA and its impact on striping for systems in which multiple network adapters are available to tasks of parallel jobs.
doi_str_mv 10.1007/978-3-540-30474-6_21
format Article
fullrecord <record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_16398361</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>16398361</sourcerecordid><originalsourceid>FETCH-LOGICAL-p228t-d18395cf3da7942c8078673bf8fa705d97b129a4d543a27f251b5f03cc9917873</originalsourceid><addsrcrecordid>eNotkE1PAyEQhvErsdb-Aw9cPKLAsAsca9PaJlWbqGfCsmBX2-4G1pj-e2l1LpO875PJ5EHohtE7Rqm811IRIIWgBKiQgpSGsxN0BTk5BuUpGrCSMQIg9BkaZf7QccG4FudokClOtBRwiUYpfdI8TFIlygGaj6NbN713_Xf02O5qPLVxs8crH0Mbt3bnPG4D7tceP_sfvHh4wvPVK57ZKjbuyI9r2_U-XqOLYDfJj_73EL3Ppm-TOVm-PC4m4yXpOFc9qZkCXbgAtZVacKeoVKWEKqhgJS1qLav8tBV1IcByGXjBqiJQcE5rJpWEIbr9u9vZ5OwmxPxik0wXm62Ne8NK0ApKljn-x6Vc7T58NFXbfiXDqDk4NdmRAZMtmaNCc3AKvxjLYfw</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Architecture and Early Performance of the New IBM HPS Fabric and Adapter</title><source>Springer Books</source><creator>Govindaraju, Rama K ; Hochschild, Peter ; Grice, Don ; Gildea, Kevin ; Blackmore, Robert ; Bender, Carl A ; Kim, Chulho ; Chaudhary, Piyush ; Goscinski, Jason ; Herring, Jay ; Martin, Steven ; Houston, John</creator><contributor>Bougé, Luc ; Prasanna, Viktor K.</contributor><creatorcontrib>Govindaraju, Rama K ; Hochschild, Peter ; Grice, Don ; Gildea, Kevin ; Blackmore, Robert ; Bender, Carl A ; Kim, Chulho ; Chaudhary, Piyush ; Goscinski, Jason ; Herring, Jay ; Martin, Steven ; Houston, John ; Bougé, Luc ; Prasanna, Viktor K.</creatorcontrib><description>In this paper we describe the architecture, design, and performance of the new cluster switch fabric and adapter called HPS (High Performance Switch). HPS delivers very low latency and very high bandwidth. We demonstrate latency of less than 4.3us MPI library; 1.8GB/s of delivered unidirectional bandwidth and 2.9GB/s of bidirectional bandwidth between 2 MPI tasks running on 1.9GHz Power 4+ IH based nodes. HPS also supports RDMA (remote direct memory access capability). A unique capability of RDMA over HPS is that reliable RDMA is supported over an underlying unreliable transport (unlike Infiniband and other RDMA transport protocols which depend on the underlying transport being reliable). We profile the performance of RDMA and its impact on striping for systems in which multiple network adapters are available to tasks of parallel jobs.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 9783540241294</identifier><identifier>ISBN: 3540241299</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 3540304746</identifier><identifier>EISBN: 9783540304746</identifier><identifier>DOI: 10.1007/978-3-540-30474-6_21</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>Algorithmics. Computability. Computer arithmetics ; Applied sciences ; Cache Line ; Computer science; control theory; systems ; Early Performance ; Exact sciences and technology ; Message Size ; Switch Adapter ; Theoretical computing ; User Space</subject><ispartof>High Performance Computing - HiPC 2004, 2004-01, p.156-165</ispartof><rights>Springer-Verlag Berlin Heidelberg 2004</rights><rights>2005 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/978-3-540-30474-6_21$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/978-3-540-30474-6_21$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,779,780,784,789,790,793,27925,38255,41442,42511</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=16398361$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Bougé, Luc</contributor><contributor>Prasanna, Viktor K.</contributor><creatorcontrib>Govindaraju, Rama K</creatorcontrib><creatorcontrib>Hochschild, Peter</creatorcontrib><creatorcontrib>Grice, Don</creatorcontrib><creatorcontrib>Gildea, Kevin</creatorcontrib><creatorcontrib>Blackmore, Robert</creatorcontrib><creatorcontrib>Bender, Carl A</creatorcontrib><creatorcontrib>Kim, Chulho</creatorcontrib><creatorcontrib>Chaudhary, Piyush</creatorcontrib><creatorcontrib>Goscinski, Jason</creatorcontrib><creatorcontrib>Herring, Jay</creatorcontrib><creatorcontrib>Martin, Steven</creatorcontrib><creatorcontrib>Houston, John</creatorcontrib><title>Architecture and Early Performance of the New IBM HPS Fabric and Adapter</title><title>High Performance Computing - HiPC 2004</title><description>In this paper we describe the architecture, design, and performance of the new cluster switch fabric and adapter called HPS (High Performance Switch). HPS delivers very low latency and very high bandwidth. We demonstrate latency of less than 4.3us MPI library; 1.8GB/s of delivered unidirectional bandwidth and 2.9GB/s of bidirectional bandwidth between 2 MPI tasks running on 1.9GHz Power 4+ IH based nodes. HPS also supports RDMA (remote direct memory access capability). A unique capability of RDMA over HPS is that reliable RDMA is supported over an underlying unreliable transport (unlike Infiniband and other RDMA transport protocols which depend on the underlying transport being reliable). We profile the performance of RDMA and its impact on striping for systems in which multiple network adapters are available to tasks of parallel jobs.</description><subject>Algorithmics. Computability. Computer arithmetics</subject><subject>Applied sciences</subject><subject>Cache Line</subject><subject>Computer science; control theory; systems</subject><subject>Early Performance</subject><subject>Exact sciences and technology</subject><subject>Message Size</subject><subject>Switch Adapter</subject><subject>Theoretical computing</subject><subject>User Space</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>9783540241294</isbn><isbn>3540241299</isbn><isbn>3540304746</isbn><isbn>9783540304746</isbn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2004</creationdate><recordtype>article</recordtype><recordid>eNotkE1PAyEQhvErsdb-Aw9cPKLAsAsca9PaJlWbqGfCsmBX2-4G1pj-e2l1LpO875PJ5EHohtE7Rqm811IRIIWgBKiQgpSGsxN0BTk5BuUpGrCSMQIg9BkaZf7QccG4FudokClOtBRwiUYpfdI8TFIlygGaj6NbN713_Xf02O5qPLVxs8crH0Mbt3bnPG4D7tceP_sfvHh4wvPVK57ZKjbuyI9r2_U-XqOLYDfJj_73EL3Ppm-TOVm-PC4m4yXpOFc9qZkCXbgAtZVacKeoVKWEKqhgJS1qLav8tBV1IcByGXjBqiJQcE5rJpWEIbr9u9vZ5OwmxPxik0wXm62Ne8NK0ApKljn-x6Vc7T58NFXbfiXDqDk4NdmRAZMtmaNCc3AKvxjLYfw</recordid><startdate>20040101</startdate><enddate>20040101</enddate><creator>Govindaraju, Rama K</creator><creator>Hochschild, Peter</creator><creator>Grice, Don</creator><creator>Gildea, Kevin</creator><creator>Blackmore, Robert</creator><creator>Bender, Carl A</creator><creator>Kim, Chulho</creator><creator>Chaudhary, Piyush</creator><creator>Goscinski, Jason</creator><creator>Herring, Jay</creator><creator>Martin, Steven</creator><creator>Houston, John</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>20040101</creationdate><title>Architecture and Early Performance of the New IBM HPS Fabric and Adapter</title><author>Govindaraju, Rama K ; Hochschild, Peter ; Grice, Don ; Gildea, Kevin ; Blackmore, Robert ; Bender, Carl A ; Kim, Chulho ; Chaudhary, Piyush ; Goscinski, Jason ; Herring, Jay ; Martin, Steven ; Houston, John</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p228t-d18395cf3da7942c8078673bf8fa705d97b129a4d543a27f251b5f03cc9917873</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2004</creationdate><topic>Algorithmics. Computability. Computer arithmetics</topic><topic>Applied sciences</topic><topic>Cache Line</topic><topic>Computer science; control theory; systems</topic><topic>Early Performance</topic><topic>Exact sciences and technology</topic><topic>Message Size</topic><topic>Switch Adapter</topic><topic>Theoretical computing</topic><topic>User Space</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Govindaraju, Rama K</creatorcontrib><creatorcontrib>Hochschild, Peter</creatorcontrib><creatorcontrib>Grice, Don</creatorcontrib><creatorcontrib>Gildea, Kevin</creatorcontrib><creatorcontrib>Blackmore, Robert</creatorcontrib><creatorcontrib>Bender, Carl A</creatorcontrib><creatorcontrib>Kim, Chulho</creatorcontrib><creatorcontrib>Chaudhary, Piyush</creatorcontrib><creatorcontrib>Goscinski, Jason</creatorcontrib><creatorcontrib>Herring, Jay</creatorcontrib><creatorcontrib>Martin, Steven</creatorcontrib><creatorcontrib>Houston, John</creatorcontrib><collection>Pascal-Francis</collection><jtitle>High Performance Computing - HiPC 2004</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Govindaraju, Rama K</au><au>Hochschild, Peter</au><au>Grice, Don</au><au>Gildea, Kevin</au><au>Blackmore, Robert</au><au>Bender, Carl A</au><au>Kim, Chulho</au><au>Chaudhary, Piyush</au><au>Goscinski, Jason</au><au>Herring, Jay</au><au>Martin, Steven</au><au>Houston, John</au><au>Bougé, Luc</au><au>Prasanna, Viktor K.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Architecture and Early Performance of the New IBM HPS Fabric and Adapter</atitle><jtitle>High Performance Computing - HiPC 2004</jtitle><date>2004-01-01</date><risdate>2004</risdate><spage>156</spage><epage>165</epage><pages>156-165</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>9783540241294</isbn><isbn>3540241299</isbn><eisbn>3540304746</eisbn><eisbn>9783540304746</eisbn><abstract>In this paper we describe the architecture, design, and performance of the new cluster switch fabric and adapter called HPS (High Performance Switch). HPS delivers very low latency and very high bandwidth. We demonstrate latency of less than 4.3us MPI library; 1.8GB/s of delivered unidirectional bandwidth and 2.9GB/s of bidirectional bandwidth between 2 MPI tasks running on 1.9GHz Power 4+ IH based nodes. HPS also supports RDMA (remote direct memory access capability). A unique capability of RDMA over HPS is that reliable RDMA is supported over an underlying unreliable transport (unlike Infiniband and other RDMA transport protocols which depend on the underlying transport being reliable). We profile the performance of RDMA and its impact on striping for systems in which multiple network adapters are available to tasks of parallel jobs.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/978-3-540-30474-6_21</doi><tpages>10</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0302-9743
ispartof High Performance Computing - HiPC 2004, 2004-01, p.156-165
issn 0302-9743
1611-3349
language eng
recordid cdi_pascalfrancis_primary_16398361
source Springer Books
subjects Algorithmics. Computability. Computer arithmetics
Applied sciences
Cache Line
Computer science
control theory
systems
Early Performance
Exact sciences and technology
Message Size
Switch Adapter
Theoretical computing
User Space
title Architecture and Early Performance of the New IBM HPS Fabric and Adapter
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T20%3A45%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Architecture%20and%20Early%20Performance%20of%20the%20New%20IBM%20HPS%20Fabric%20and%20Adapter&rft.jtitle=High%20Performance%20Computing%20-%20HiPC%202004&rft.au=Govindaraju,%20Rama%20K&rft.date=2004-01-01&rft.spage=156&rft.epage=165&rft.pages=156-165&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=9783540241294&rft.isbn_list=3540241299&rft_id=info:doi/10.1007/978-3-540-30474-6_21&rft_dat=%3Cpascalfrancis_sprin%3E16398361%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=3540304746&rft.eisbn_list=9783540304746&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true