Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand
MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with imp...
Gespeichert in:
Hauptverfasser: | , , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 199 |
---|---|
container_issue | |
container_start_page | 191 |
container_title | |
container_volume | |
creator | Huang, Wei Santhanaraman, Gopalakrishnan Jin, Hyun-Wook Panda, Dhabaleswar K. |
description | MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance. |
doi_str_mv | 10.1007/11557265_27 |
format | Conference Proceeding |
fullrecord | <record><control><sourceid>pascalfrancis_sprin</sourceid><recordid>TN_cdi_pascalfrancis_primary_17182697</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>17182697</sourcerecordid><originalsourceid>FETCH-LOGICAL-p219t-6eb1b9ae07e4021771330d3459778e493d9f61f053dc1b5cb83ef2fc24fc6fa3</originalsourceid><addsrcrecordid>eNpNUMtOwzAQNC-JUnriB3LhwCHg9TpxfCzlVamoPZRz5CTrKpA4kV1V4u8xKgf2MtI8VqNh7Ab4PXCuHgCyTIk8K4U6YVeYSY6gJeIpm0AOkCJKfcZmWhW_mtCca3nOJhy5SLWSeMlmIXzyeDGn8mLCPp4otDuXzLs9eWf27YFCYlyTbMjbwffG1ZRsvWkoXVsbksgly37sqCe3b90ued8sU5EMB4q8s61rH2P6ml1Y0wWa_eGUbV-et4u3dLV-XS7mq3QUoPdpThVU2hBXFNuCUoDIG5SZVqogqbHRNgfLM2xqqLK6KpCssLWQts6twSm7Pb4dTahNZ31s24Zy9G1v_HcJCgqRaxV9d0dfiJLbkS-rYfgKJfDyd9by36z4A6VfY9c</addsrcrecordid><sourcetype>Index Database</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</title><source>Springer Books</source><creator>Huang, Wei ; Santhanaraman, Gopalakrishnan ; Jin, Hyun-Wook ; Panda, Dhabaleswar K.</creator><contributor>Dongarra, Jack ; Kranzlmüller, Dieter ; Di Martino, Beniamino</contributor><creatorcontrib>Huang, Wei ; Santhanaraman, Gopalakrishnan ; Jin, Hyun-Wook ; Panda, Dhabaleswar K. ; Dongarra, Jack ; Kranzlmüller, Dieter ; Di Martino, Beniamino</creatorcontrib><description>MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 9783540290094</identifier><identifier>ISBN: 3540290095</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 3540319433</identifier><identifier>EISBN: 9783540319436</identifier><identifier>DOI: 10.1007/11557265_27</identifier><language>eng</language><publisher>Berlin, Heidelberg: Springer Berlin Heidelberg</publisher><subject>ADI3 ; Applied sciences ; CH3 ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Exact sciences and technology ; InfiniBand ; MPI-2 ; RDMA channel ; Software</subject><ispartof>Lecture notes in computer science, 2005, p.191-199</ispartof><rights>Springer-Verlag Berlin Heidelberg 2005</rights><rights>2005 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/11557265_27$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/11557265_27$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>309,310,779,780,784,789,790,793,4048,4049,27924,38254,41441,42510</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=17182697$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Dongarra, Jack</contributor><contributor>Kranzlmüller, Dieter</contributor><contributor>Di Martino, Beniamino</contributor><creatorcontrib>Huang, Wei</creatorcontrib><creatorcontrib>Santhanaraman, Gopalakrishnan</creatorcontrib><creatorcontrib>Jin, Hyun-Wook</creatorcontrib><creatorcontrib>Panda, Dhabaleswar K.</creatorcontrib><title>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</title><title>Lecture notes in computer science</title><description>MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance.</description><subject>ADI3</subject><subject>Applied sciences</subject><subject>CH3</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Exact sciences and technology</subject><subject>InfiniBand</subject><subject>MPI-2</subject><subject>RDMA channel</subject><subject>Software</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>9783540290094</isbn><isbn>3540290095</isbn><isbn>3540319433</isbn><isbn>9783540319436</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2005</creationdate><recordtype>conference_proceeding</recordtype><recordid>eNpNUMtOwzAQNC-JUnriB3LhwCHg9TpxfCzlVamoPZRz5CTrKpA4kV1V4u8xKgf2MtI8VqNh7Ab4PXCuHgCyTIk8K4U6YVeYSY6gJeIpm0AOkCJKfcZmWhW_mtCca3nOJhy5SLWSeMlmIXzyeDGn8mLCPp4otDuXzLs9eWf27YFCYlyTbMjbwffG1ZRsvWkoXVsbksgly37sqCe3b90ued8sU5EMB4q8s61rH2P6ml1Y0wWa_eGUbV-et4u3dLV-XS7mq3QUoPdpThVU2hBXFNuCUoDIG5SZVqogqbHRNgfLM2xqqLK6KpCssLWQts6twSm7Pb4dTahNZ31s24Zy9G1v_HcJCgqRaxV9d0dfiJLbkS-rYfgKJfDyd9by36z4A6VfY9c</recordid><startdate>2005</startdate><enddate>2005</enddate><creator>Huang, Wei</creator><creator>Santhanaraman, Gopalakrishnan</creator><creator>Jin, Hyun-Wook</creator><creator>Panda, Dhabaleswar K.</creator><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>IQODW</scope></search><sort><creationdate>2005</creationdate><title>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</title><author>Huang, Wei ; Santhanaraman, Gopalakrishnan ; Jin, Hyun-Wook ; Panda, Dhabaleswar K.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-p219t-6eb1b9ae07e4021771330d3459778e493d9f61f053dc1b5cb83ef2fc24fc6fa3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2005</creationdate><topic>ADI3</topic><topic>Applied sciences</topic><topic>CH3</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Exact sciences and technology</topic><topic>InfiniBand</topic><topic>MPI-2</topic><topic>RDMA channel</topic><topic>Software</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Huang, Wei</creatorcontrib><creatorcontrib>Santhanaraman, Gopalakrishnan</creatorcontrib><creatorcontrib>Jin, Hyun-Wook</creatorcontrib><creatorcontrib>Panda, Dhabaleswar K.</creatorcontrib><collection>Pascal-Francis</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Huang, Wei</au><au>Santhanaraman, Gopalakrishnan</au><au>Jin, Hyun-Wook</au><au>Panda, Dhabaleswar K.</au><au>Dongarra, Jack</au><au>Kranzlmüller, Dieter</au><au>Di Martino, Beniamino</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand</atitle><btitle>Lecture notes in computer science</btitle><date>2005</date><risdate>2005</risdate><spage>191</spage><epage>199</epage><pages>191-199</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>9783540290094</isbn><isbn>3540290095</isbn><eisbn>3540319433</eisbn><eisbn>9783540319436</eisbn><abstract>MPICH2 provides a layered architecture to achieve both portability and performance. For implementations of MPI-2 over InfiniBand, it provides the flexibility for researchers at the RDMA channel, CH3 or ADI3 layer. In this paper we analyze the performance and complexity trade-offs associated with implementations at these layers. We describe our designs and implementations, as well as optimizations at each layer. To show the performance impacts of these design schemes and optimizations, we evaluate our implementations with different micro-benchmarks, HPCC and NAS test suite. Our experiments show that although the ADI3 layers adds complexity in implementation, the benefits achieved through optimizations justify moving to the ADI layer to extract the best performance.</abstract><cop>Berlin, Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/11557265_27</doi><tpages>9</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0302-9743 |
ispartof | Lecture notes in computer science, 2005, p.191-199 |
issn | 0302-9743 1611-3349 |
language | eng |
recordid | cdi_pascalfrancis_primary_17182697 |
source | Springer Books |
subjects | ADI3 Applied sciences CH3 Computer science control theory systems Computer systems and distributed systems. User interface Exact sciences and technology InfiniBand MPI-2 RDMA channel Software |
title | Design Alternatives and Performance Trade-Offs for Implementing MPI-2 over InfiniBand |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-10T10%3A50%3A19IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-pascalfrancis_sprin&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Design%20Alternatives%20and%20Performance%20Trade-Offs%20for%20Implementing%20MPI-2%20over%20InfiniBand&rft.btitle=Lecture%20notes%20in%20computer%20science&rft.au=Huang,%20Wei&rft.date=2005&rft.spage=191&rft.epage=199&rft.pages=191-199&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=9783540290094&rft.isbn_list=3540290095&rft_id=info:doi/10.1007/11557265_27&rft_dat=%3Cpascalfrancis_sprin%3E17182697%3C/pascalfrancis_sprin%3E%3Curl%3E%3C/url%3E&rft.eisbn=3540319433&rft.eisbn_list=9783540319436&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |