Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids
Cloning is a technique to efficiently simulate a tree of multiple what-if scenarios that are unraveled during the course of a base simulation. However, cloned execution is highly challenging to realize on large, distributed memory computing platforms, due to the dynamic nature of the computational l...
Gespeichert in:
Veröffentlicht in: | ACM transactions on modeling and computer simulation 2018-01, Vol.28 (1), p.11-26 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 26 |
---|---|
container_issue | 1 |
container_start_page | 11 |
container_title | ACM transactions on modeling and computer simulation |
container_volume | 28 |
creator | Yoginath, Srikanth B. Perumalla, Kalyan S. |
description | Cloning is a technique to efficiently simulate a tree of multiple what-if scenarios that are unraveled during the course of a base simulation. However, cloned execution is highly challenging to realize on large, distributed memory computing platforms, due to the dynamic nature of the computational load across clones, and due to the complex dependencies spanning the clone tree. We present the conceptual simulation framework, algorithmic foundations, and runtime interface of C
lone
X, a new system we designed for scalable simulation cloning. It efficiently and dynamically creates whole
logical
copies of a dynamic tree of simulations across a large parallel system without full
physical
duplication of computation and memory. The performance of a prototype implementation executed on up to 1,024 graphical processing units of a supercomputing system has been evaluated with three benchmarks—heat diffusion, forest fire, and disease propagation models—delivering a speed up of over two orders of magnitude compared to replicated runs. The results demonstrate a significantly faster and scalable way to execute many what-if scenario ensembles of large simulations via cloning using the C
lone
X interface. |
doi_str_mv | 10.1145/3158669 |
format | Article |
fullrecord | <record><control><sourceid>crossref_osti_</sourceid><recordid>TN_cdi_osti_scitechconnect_1424471</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_1145_3158669</sourcerecordid><originalsourceid>FETCH-LOGICAL-c285t-b662d60fb6a4b4695839c54667bd05a342b4c0bfbabfcba3495d11aafb83efd3</originalsourceid><addsrcrecordid>eNotkE1LAzEYhIMoWKv4F4IXT6t5Nx_dPZaiq1Cw0HpekmzSRrKbJYmI_96t7WlmmIc5DEL3QJ4AGH-mwCsh6gs0A86rAqDml5MnrC4oJXCNblL6IgQoKcsZ0lstvVTe4JUPgxv2OAx4LePeFMfG4GbziTdeZhtin_CPywe8HEfvtMxuQnPAO9dPcDbjaDq8df23_6_ScamJrku36MpKn8zdWedo9_qyW70V64_mfbVcF7qseC6UEGUniFVCMsVEzStaa86EWKiOcElZqZgmyiqprFZTrnkHIKVVFTW2o3P0cJoNKbs2aZeNPugwDEbnFljJ2AIm6PEE6RhSisa2Y3S9jL8tkPb4X3v-j_4BfJ9iXQ</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids</title><source>Access via ACM Digital Library</source><creator>Yoginath, Srikanth B. ; Perumalla, Kalyan S.</creator><creatorcontrib>Yoginath, Srikanth B. ; Perumalla, Kalyan S. ; Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)</creatorcontrib><description>Cloning is a technique to efficiently simulate a tree of multiple what-if scenarios that are unraveled during the course of a base simulation. However, cloned execution is highly challenging to realize on large, distributed memory computing platforms, due to the dynamic nature of the computational load across clones, and due to the complex dependencies spanning the clone tree. We present the conceptual simulation framework, algorithmic foundations, and runtime interface of C
lone
X, a new system we designed for scalable simulation cloning. It efficiently and dynamically creates whole
logical
copies of a dynamic tree of simulations across a large parallel system without full
physical
duplication of computation and memory. The performance of a prototype implementation executed on up to 1,024 graphical processing units of a supercomputing system has been evaluated with three benchmarks—heat diffusion, forest fire, and disease propagation models—delivering a speed up of over two orders of magnitude compared to replicated runs. The results demonstrate a significantly faster and scalable way to execute many what-if scenario ensembles of large simulations via cloning using the C
lone
X interface.</description><identifier>ISSN: 1049-3301</identifier><identifier>EISSN: 1558-1195</identifier><identifier>DOI: 10.1145/3158669</identifier><language>eng</language><publisher>United States: Association for Computing Machinery</publisher><subject>algorithms ; CUDA ; design ; experimentation ; graphical processing units ; load balancing ; MATHEMATICS AND COMPUTING ; supercomputing ; time synchronization ; what-if decision tree</subject><ispartof>ACM transactions on modeling and computer simulation, 2018-01, Vol.28 (1), p.11-26</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c285t-b662d60fb6a4b4695839c54667bd05a342b4c0bfbabfcba3495d11aafb83efd3</citedby><cites>FETCH-LOGICAL-c285t-b662d60fb6a4b4695839c54667bd05a342b4c0bfbabfcba3495d11aafb83efd3</cites><orcidid>0000-0002-7458-0832</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,780,784,885,27924,27925</link.rule.ids><backlink>$$Uhttps://www.osti.gov/servlets/purl/1424471$$D View this record in Osti.gov$$Hfree_for_read</backlink></links><search><creatorcontrib>Yoginath, Srikanth B.</creatorcontrib><creatorcontrib>Perumalla, Kalyan S.</creatorcontrib><creatorcontrib>Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)</creatorcontrib><title>Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids</title><title>ACM transactions on modeling and computer simulation</title><description>Cloning is a technique to efficiently simulate a tree of multiple what-if scenarios that are unraveled during the course of a base simulation. However, cloned execution is highly challenging to realize on large, distributed memory computing platforms, due to the dynamic nature of the computational load across clones, and due to the complex dependencies spanning the clone tree. We present the conceptual simulation framework, algorithmic foundations, and runtime interface of C
lone
X, a new system we designed for scalable simulation cloning. It efficiently and dynamically creates whole
logical
copies of a dynamic tree of simulations across a large parallel system without full
physical
duplication of computation and memory. The performance of a prototype implementation executed on up to 1,024 graphical processing units of a supercomputing system has been evaluated with three benchmarks—heat diffusion, forest fire, and disease propagation models—delivering a speed up of over two orders of magnitude compared to replicated runs. The results demonstrate a significantly faster and scalable way to execute many what-if scenario ensembles of large simulations via cloning using the C
lone
X interface.</description><subject>algorithms</subject><subject>CUDA</subject><subject>design</subject><subject>experimentation</subject><subject>graphical processing units</subject><subject>load balancing</subject><subject>MATHEMATICS AND COMPUTING</subject><subject>supercomputing</subject><subject>time synchronization</subject><subject>what-if decision tree</subject><issn>1049-3301</issn><issn>1558-1195</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><recordid>eNotkE1LAzEYhIMoWKv4F4IXT6t5Nx_dPZaiq1Cw0HpekmzSRrKbJYmI_96t7WlmmIc5DEL3QJ4AGH-mwCsh6gs0A86rAqDml5MnrC4oJXCNblL6IgQoKcsZ0lstvVTe4JUPgxv2OAx4LePeFMfG4GbziTdeZhtin_CPywe8HEfvtMxuQnPAO9dPcDbjaDq8df23_6_ScamJrku36MpKn8zdWedo9_qyW70V64_mfbVcF7qseC6UEGUniFVCMsVEzStaa86EWKiOcElZqZgmyiqprFZTrnkHIKVVFTW2o3P0cJoNKbs2aZeNPugwDEbnFljJ2AIm6PEE6RhSisa2Y3S9jL8tkPb4X3v-j_4BfJ9iXQ</recordid><startdate>20180131</startdate><enddate>20180131</enddate><creator>Yoginath, Srikanth B.</creator><creator>Perumalla, Kalyan S.</creator><general>Association for Computing Machinery</general><scope>AAYXX</scope><scope>CITATION</scope><scope>OIOZB</scope><scope>OTOTI</scope><orcidid>https://orcid.org/0000-0002-7458-0832</orcidid></search><sort><creationdate>20180131</creationdate><title>Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids</title><author>Yoginath, Srikanth B. ; Perumalla, Kalyan S.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c285t-b662d60fb6a4b4695839c54667bd05a342b4c0bfbabfcba3495d11aafb83efd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>algorithms</topic><topic>CUDA</topic><topic>design</topic><topic>experimentation</topic><topic>graphical processing units</topic><topic>load balancing</topic><topic>MATHEMATICS AND COMPUTING</topic><topic>supercomputing</topic><topic>time synchronization</topic><topic>what-if decision tree</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yoginath, Srikanth B.</creatorcontrib><creatorcontrib>Perumalla, Kalyan S.</creatorcontrib><creatorcontrib>Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)</creatorcontrib><collection>CrossRef</collection><collection>OSTI.GOV - Hybrid</collection><collection>OSTI.GOV</collection><jtitle>ACM transactions on modeling and computer simulation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yoginath, Srikanth B.</au><au>Perumalla, Kalyan S.</au><aucorp>Oak Ridge National Lab. (ORNL), Oak Ridge, TN (United States). Oak Ridge Leadership Computing Facility (OLCF)</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids</atitle><jtitle>ACM transactions on modeling and computer simulation</jtitle><date>2018-01-31</date><risdate>2018</risdate><volume>28</volume><issue>1</issue><spage>11</spage><epage>26</epage><pages>11-26</pages><issn>1049-3301</issn><eissn>1558-1195</eissn><abstract>Cloning is a technique to efficiently simulate a tree of multiple what-if scenarios that are unraveled during the course of a base simulation. However, cloned execution is highly challenging to realize on large, distributed memory computing platforms, due to the dynamic nature of the computational load across clones, and due to the complex dependencies spanning the clone tree. We present the conceptual simulation framework, algorithmic foundations, and runtime interface of C
lone
X, a new system we designed for scalable simulation cloning. It efficiently and dynamically creates whole
logical
copies of a dynamic tree of simulations across a large parallel system without full
physical
duplication of computation and memory. The performance of a prototype implementation executed on up to 1,024 graphical processing units of a supercomputing system has been evaluated with three benchmarks—heat diffusion, forest fire, and disease propagation models—delivering a speed up of over two orders of magnitude compared to replicated runs. The results demonstrate a significantly faster and scalable way to execute many what-if scenario ensembles of large simulations via cloning using the C
lone
X interface.</abstract><cop>United States</cop><pub>Association for Computing Machinery</pub><doi>10.1145/3158669</doi><tpages>16</tpages><orcidid>https://orcid.org/0000-0002-7458-0832</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1049-3301 |
ispartof | ACM transactions on modeling and computer simulation, 2018-01, Vol.28 (1), p.11-26 |
issn | 1049-3301 1558-1195 |
language | eng |
recordid | cdi_osti_scitechconnect_1424471 |
source | Access via ACM Digital Library |
subjects | algorithms CUDA design experimentation graphical processing units load balancing MATHEMATICS AND COMPUTING supercomputing time synchronization what-if decision tree |
title | Scalable Cloning on Large-Scale GPU Platforms with Application to Time-Stepped Simulations on Grids |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T10%3A17%3A15IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_osti_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Scalable%20Cloning%20on%20Large-Scale%20GPU%20Platforms%20with%20Application%20to%20Time-Stepped%20Simulations%20on%20Grids&rft.jtitle=ACM%20transactions%20on%20modeling%20and%20computer%20simulation&rft.au=Yoginath,%20Srikanth%20B.&rft.aucorp=Oak%20Ridge%20National%20Lab.%20(ORNL),%20Oak%20Ridge,%20TN%20(United%20States).%20Oak%20Ridge%20Leadership%20Computing%20Facility%20(OLCF)&rft.date=2018-01-31&rft.volume=28&rft.issue=1&rft.spage=11&rft.epage=26&rft.pages=11-26&rft.issn=1049-3301&rft.eissn=1558-1195&rft_id=info:doi/10.1145/3158669&rft_dat=%3Ccrossref_osti_%3E10_1145_3158669%3C/crossref_osti_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |