Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand

MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with imp...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Huang, Wei, Santhanaraman, Gopalakrishnan, Jin, Hyun-Wook, Panda, Dhabaleswar K.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 199
container_issue
container_start_page 191
container_title
container_volume
creator Huang, Wei
Santhanaraman, Gopalakrishnan
Jin, Hyun-Wook
Panda, Dhabaleswar K.
description MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance.
doi_str_mv 10.1007/11557265_27
format Conference Proceeding
fullrecord <record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_17182697</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>17182697</sourcerecordid><originalsourceid>FETCH-LOGICAL-p219t-6eb1b9ae07e4021771330d3459778e493d9f61f053dc1b5cb83ef2fc24fc6fa3</originalsourceid><addsrcrecordid>eNpNUMtOwzAQNC-JUnriB3LhwCHg9TpxfCzlVamoPZRz5CTrKpA4kV1V4u8xKgf2MtI8VqNh7Ab4PXCuHgCyTIk8K4U6YVeYSY6gJeIpm0AOkCJKfcZmWhW_mtCca3nOJhy5SLWSeMlmIXzyeDGn8mLCPp4otDuXzLs9eWf27YFCYlyTbMjbwffG1ZRsvWkoXVsbksgly37sqCe3b90ued8sU5EMB4q8s61rH2P6ml1Y0wWa_eGUbV-et4u3dLV-XS7mq3QUoPdpThVU2hBXFNuCUoDIG5SZVqogqbHRNgfLM2xqqLK6KpCssLWQts6twSm7Pb4dTahNZ31s24Zy9G1v_HcJCgqRaxV9d0dfiJLbkS-rYfgKJfDyd9by36z4A6VfY9c</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</title><source>Springer Books</source><creator>Huang, Wei ; Santhanaraman, Gopalakrishnan ; Jin, Hyun-Wook ; Panda, Dhabaleswar K.</creator><contributor>Dongarra, Jack ; Kranzlmüller, Dieter ; Di Martino, Beniamino</contributor><creatorcontrib>Huang, Wei ; Santhanaraman, Gopalakrishnan ; Jin, Hyun-Wook ; Panda, Dhabaleswar K. ; Dongarra, Jack ; Kranzlmüller, Dieter ; Di Martino, Beniamino</creatorcontrib><description>MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 9783540290094</identifier><identifier>ISBN: 3540290095</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 3540319433</identifier><identifier>EISBN: 9783540319436</identifier><identifier>DOI: 10.1007/11557265_27</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>ADI3 ; Applied sciences ; CH3 ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Exact sciences and technology ; InfiniBand ; MPI-2 ; RDMA channel ; Software</subject><ispartof>Lecture notes in computer science, 2005, p.191-199</ispartof><rights>Springer-Verlag Berlin Heidelberg 2005</rights><rights>2005 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/11557265_27$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/11557265_27$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,779,780,784,789,790,793,4048,4049,27924,38254,41441,42510</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=17182697$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Dongarra, Jack</contributor><contributor>Kranzlmüller, Dieter</contributor><contributor>Di Martino, Beniamino</contributor><creatorcontrib>Huang, Wei</creatorcontrib><creatorcontrib>Santhanaraman, Gopalakrishnan</creatorcontrib><creatorcontrib>Jin, Hyun-Wook</creatorcontrib><creatorcontrib>Panda, Dhabaleswar K.</creatorcontrib><title>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</title><title>Lecture notes in computer science</title><description>MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance.</description><subject>ADI3</subject><subject>Applied sciences</subject><subject>CH3</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Exact sciences and technology</subject><subject>InfiniBand</subject><subject>MPI-2</subject><subject>RDMA channel</subject><subject>Software</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>9783540290094</isbn><isbn>3540290095</isbn><isbn>3540319433</isbn><isbn>9783540319436</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2005</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNpNUMtOwzAQNC-JUnriB3LhwCHg9TpxfCzlVamoPZRz5CTrKpA4kV1V4u8xKgf2MtI8VqNh7Ab4PXCuHgCyTIk8K4U6YVeYSY6gJeIpm0AOkCJKfcZmWhW_mtCca3nOJhy5SLWSeMlmIXzyeDGn8mLCPp4otDuXzLs9eWf27YFCYlyTbMjbwffG1ZRsvWkoXVsbksgly37sqCe3b90ued8sU5EMB4q8s61rH2P6ml1Y0wWa_eGUbV-et4u3dLV-XS7mq3QUoPdpThVU2hBXFNuCUoDIG5SZVqogqbHRNgfLM2xqqLK6KpCssLWQts6twSm7Pb4dTahNZ31s24Zy9G1v_HcJCgqRaxV9d0dfiJLbkS-rYfgKJfDyd9by36z4A6VfY9c</recordid><startdate>2005</startdate><enddate>2005</enddate><creator>Huang, Wei</creator><creator>Santhanaraman, Gopalakrishnan</creator><creator>Jin, Hyun-Wook</creator><creator>Panda, Dhabaleswar K.</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>2005</creationdate><title>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</title><author>Huang, Wei ; Santhanaraman, Gopalakrishnan ; Jin, Hyun-Wook ; Panda, Dhabaleswar K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p219t-6eb1b9ae07e4021771330d3459778e493d9f61f053dc1b5cb83ef2fc24fc6fa3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2005</creationdate><topic>ADI3</topic><topic>Applied sciences</topic><topic>CH3</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Exact sciences and technology</topic><topic>InfiniBand</topic><topic>MPI-2</topic><topic>RDMA channel</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Wei</creatorcontrib><creatorcontrib>Santhanaraman, Gopalakrishnan</creatorcontrib><creatorcontrib>Jin, Hyun-Wook</creatorcontrib><creatorcontrib>Panda, Dhabaleswar K.</creatorcontrib><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Wei</au><au>Santhanaraman, Gopalakrishnan</au><au>Jin, Hyun-Wook</au><au>Panda, Dhabaleswar K.</au><au>Dongarra, Jack</au><au>Kranzlmüller, Dieter</au><au>Di Martino, Beniamino</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</atitle><btitle>Lecture notes in computer science</btitle><date>2005</date><risdate>2005</risdate><spage>191</spage><epage>199</epage><pages>191-199</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>9783540290094</isbn><isbn>3540290095</isbn><eisbn>3540319433</eisbn><eisbn>9783540319436</eisbn><abstract>MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/11557265_27</doi><tpages>9</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0302-9743
ispartof Lecture notes in computer science, 2005, p.191-199
issn 0302-9743
1611-3349
language eng
recordid cdi_pascalfrancis_primary_17182697
source Springer Books
subjects ADI3
Applied sciences
CH3
Computer science
control theory
systems
Computer systems and distributed systems. User interface
Exact sciences and technology
InfiniBand
MPI-2
RDMA channel
Software
title Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T10%3A50%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Design%20Alternatives%20and%20Performance%20Trade-Offs%20for%20Implementing%20MPI-2%20over%20InfiniBand&rft.btitle=Lecture%20notes%20in%20computer%20science&rft.au=Huang,%20Wei&rft.date=2005&rft.spage=191&rft.epage=199&rft.pages=191-199&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=9783540290094&rft.isbn_list=3540290095&rft_id=info:doi/10.1007/11557265_27&rft_dat=%3Cpascalfrancis_sprin%3E17182697%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=3540319433&rft.eisbn_list=9783540319436&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true