Evaluation of the Partitioned Global Address Space (PGAS) model for an inviscid Euler solver

In this paper we evaluate the performance of Unified Parallel C (which implements the partitioned global address space programming model) using a numerical method that is widely used in fluid dynamics. In order to evaluate the incremental approach to parallelization (which is possible with UPC) and...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2016-11
Hauptverfasser: Prugger, Martina, Einkemmer, Lukas, Ostermann, Alexander
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Prugger, Martina
Einkemmer, Lukas
Ostermann, Alexander
description In this paper we evaluate the performance of Unified Parallel C (which implements the partitioned global address space programming model) using a numerical method that is widely used in fluid dynamics. In order to evaluate the incremental approach to parallelization (which is possible with UPC) and its performance characteristics, we implement different levels of optimization of the UPC code and compare it with an MPI parallelization on four different clusters of the Austrian HPC infrastructure (LEO3, LEO3E, VSC2, VSC3) and on an Intel Xeon Phi. We find that UPC is significantly easier to develop in compared to MPI and that the performance achieved is comparable to MPI in most situations. The obtained results show worse performance (on VSC2), competitive performance (on LEO3, LEO3E and VSC3), and superior performance (on the Intel Xeon Phi).
doi_str_mv 10.48550/arxiv.1601.03623
format Article
fullrecord <record><control><sourceid>proquest_arxiv</sourceid><recordid>TN_cdi_arxiv_primary_1601_03623</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2075392301</sourcerecordid><originalsourceid>FETCH-LOGICAL-a521-f869e38bdf52683a70177db7a3c5a920dab47a180d452ff2e171918a5eae7b5f3</originalsourceid><addsrcrecordid>eNotkE1LAzEYhIMgWGp_gCcDXvSwNR-bTfZYSl2FgoX2KCzvNgmmpJua7C767_vlaWB4GGYGoQdKprkSgrxC_HXDlBaETgkvGL9BI8Y5zVTO2B2apLQjhLBCMiH4CH0tBvA9dC60OFjcfRu8gti5s2E0rnxowOOZ1tGkhNcH2Br8vKpm6xe8D9p4bEPE0GLXDi5tncaL3puIU_CDiffo1oJPZvKvY7R5W2zm79nys_qYz5YZCEYzq4rScNVoK1ihOEhCpdSNBL4VUDKiocklUEV0Lpi1zFBJS6pAGDCyEZaP0eM19jK9PkS3h_hXny-oLxeciKcrcYjhpzepq3ehj-2pU82IFLxknFB-BFttXcI</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2075392301</pqid></control><display><type>article</type><title>Evaluation of the Partitioned Global Address Space (PGAS) model for an inviscid Euler solver</title><source>arXiv.org</source><source>Free E- Journals</source><creator>Prugger, Martina ; Einkemmer, Lukas ; Ostermann, Alexander</creator><creatorcontrib>Prugger, Martina ; Einkemmer, Lukas ; Ostermann, Alexander</creatorcontrib><description>In this paper we evaluate the performance of Unified Parallel C (which implements the partitioned global address space programming model) using a numerical method that is widely used in fluid dynamics. In order to evaluate the incremental approach to parallelization (which is possible with UPC) and its performance characteristics, we implement different levels of optimization of the UPC code and compare it with an MPI parallelization on four different clusters of the Austrian HPC infrastructure (LEO3, LEO3E, VSC2, VSC3) and on an Intel Xeon Phi. We find that UPC is significantly easier to develop in compared to MPI and that the performance achieved is comparable to MPI in most situations. The obtained results show worse performance (on VSC2), competitive performance (on LEO3, LEO3E and VSC3), and superior performance (on the Intel Xeon Phi).</description><identifier>EISSN: 2331-8422</identifier><identifier>DOI: 10.48550/arxiv.1601.03623</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Computational fluid dynamics ; Computer Science - Distributed, Parallel, and Cluster Computing ; Computer Science - Mathematical Software ; Euler solver ; Mathematical models ; Numerical methods ; Optimization ; Performance evaluation</subject><ispartof>arXiv.org, 2016-11</ispartof><rights>2016. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,778,782,883,27908</link.rule.ids><backlink>$$Uhttps://doi.org/10.1016/j.parco.2016.11.001$$DView published paper (Access to full text may be restricted)$$Hfree_for_read</backlink><backlink>$$Uhttps://doi.org/10.48550/arXiv.1601.03623$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Prugger, Martina</creatorcontrib><creatorcontrib>Einkemmer, Lukas</creatorcontrib><creatorcontrib>Ostermann, Alexander</creatorcontrib><title>Evaluation of the Partitioned Global Address Space (PGAS) model for an inviscid Euler solver</title><title>arXiv.org</title><description>In this paper we evaluate the performance of Unified Parallel C (which implements the partitioned global address space programming model) using a numerical method that is widely used in fluid dynamics. In order to evaluate the incremental approach to parallelization (which is possible with UPC) and its performance characteristics, we implement different levels of optimization of the UPC code and compare it with an MPI parallelization on four different clusters of the Austrian HPC infrastructure (LEO3, LEO3E, VSC2, VSC3) and on an Intel Xeon Phi. We find that UPC is significantly easier to develop in compared to MPI and that the performance achieved is comparable to MPI in most situations. The obtained results show worse performance (on VSC2), competitive performance (on LEO3, LEO3E and VSC3), and superior performance (on the Intel Xeon Phi).</description><subject>Computational fluid dynamics</subject><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><subject>Computer Science - Mathematical Software</subject><subject>Euler solver</subject><subject>Mathematical models</subject><subject>Numerical methods</subject><subject>Optimization</subject><subject>Performance evaluation</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GOX</sourceid><recordid>eNotkE1LAzEYhIMgWGp_gCcDXvSwNR-bTfZYSl2FgoX2KCzvNgmmpJua7C767_vlaWB4GGYGoQdKprkSgrxC_HXDlBaETgkvGL9BI8Y5zVTO2B2apLQjhLBCMiH4CH0tBvA9dC60OFjcfRu8gti5s2E0rnxowOOZ1tGkhNcH2Br8vKpm6xe8D9p4bEPE0GLXDi5tncaL3puIU_CDiffo1oJPZvKvY7R5W2zm79nys_qYz5YZCEYzq4rScNVoK1ihOEhCpdSNBL4VUDKiocklUEV0Lpi1zFBJS6pAGDCyEZaP0eM19jK9PkS3h_hXny-oLxeciKcrcYjhpzepq3ehj-2pU82IFLxknFB-BFttXcI</recordid><startdate>20161112</startdate><enddate>20161112</enddate><creator>Prugger, Martina</creator><creator>Einkemmer, Lukas</creator><creator>Ostermann, Alexander</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20161112</creationdate><title>Evaluation of the Partitioned Global Address Space (PGAS) model for an inviscid Euler solver</title><author>Prugger, Martina ; Einkemmer, Lukas ; Ostermann, Alexander</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a521-f869e38bdf52683a70177db7a3c5a920dab47a180d452ff2e171918a5eae7b5f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Computational fluid dynamics</topic><topic>Computer Science - Distributed, Parallel, and Cluster Computing</topic><topic>Computer Science - Mathematical Software</topic><topic>Euler solver</topic><topic>Mathematical models</topic><topic>Numerical methods</topic><topic>Optimization</topic><topic>Performance evaluation</topic><toplevel>online_resources</toplevel><creatorcontrib>Prugger, Martina</creatorcontrib><creatorcontrib>Einkemmer, Lukas</creatorcontrib><creatorcontrib>Ostermann, Alexander</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><collection>arXiv Computer Science</collection><collection>arXiv.org</collection><jtitle>arXiv.org</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Prugger, Martina</au><au>Einkemmer, Lukas</au><au>Ostermann, Alexander</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Evaluation of the Partitioned Global Address Space (PGAS) model for an inviscid Euler solver</atitle><jtitle>arXiv.org</jtitle><date>2016-11-12</date><risdate>2016</risdate><eissn>2331-8422</eissn><abstract>In this paper we evaluate the performance of Unified Parallel C (which implements the partitioned global address space programming model) using a numerical method that is widely used in fluid dynamics. In order to evaluate the incremental approach to parallelization (which is possible with UPC) and its performance characteristics, we implement different levels of optimization of the UPC code and compare it with an MPI parallelization on four different clusters of the Austrian HPC infrastructure (LEO3, LEO3E, VSC2, VSC3) and on an Intel Xeon Phi. We find that UPC is significantly easier to develop in compared to MPI and that the performance achieved is comparable to MPI in most situations. The obtained results show worse performance (on VSC2), competitive performance (on LEO3, LEO3E and VSC3), and superior performance (on the Intel Xeon Phi).</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><doi>10.48550/arxiv.1601.03623</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2016-11
issn 2331-8422
language eng
recordid cdi_arxiv_primary_1601_03623
source arXiv.org; Free E- Journals
subjects Computational fluid dynamics
Computer Science - Distributed, Parallel, and Cluster Computing
Computer Science - Mathematical Software
Euler solver
Mathematical models
Numerical methods
Optimization
Performance evaluation
title Evaluation of the Partitioned Global Address Space (PGAS) model for an inviscid Euler solver
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T23%3A40%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_arxiv&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Evaluation%20of%20the%20Partitioned%20Global%20Address%20Space%20(PGAS)%20model%20for%20an%20inviscid%20Euler%20solver&rft.jtitle=arXiv.org&rft.au=Prugger,%20Martina&rft.date=2016-11-12&rft.eissn=2331-8422&rft_id=info:doi/10.48550/arxiv.1601.03623&rft_dat=%3Cproquest_arxiv%3E2075392301%3C/proquest_arxiv%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2075392301&rft_id=info:pmid/&rfr_iscdi=true