New system software for parallel programming models on the Intel SCC many-core processor

SummarySince the beginning of the multicore era, parallel processing has become prevalent across the board. On a traditional multicore system, a single operating system manages all cores and schedules threads and processes among them, inherently supported by hardware‐implemented cache coherence prot...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Concurrency and computation 2015-06, Vol.27 (9), p.2235-2259
Hauptverfasser: Clauss, Carsten, Lankes, Stefan, Reble, Pablo, Bemmerl, Thomas
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 2259
container_issue 9
container_start_page 2235
container_title Concurrency and computation
container_volume 27
creator Clauss, Carsten
Lankes, Stefan
Reble, Pablo
Bemmerl, Thomas
description SummarySince the beginning of the multicore era, parallel processing has become prevalent across the board. On a traditional multicore system, a single operating system manages all cores and schedules threads and processes among them, inherently supported by hardware‐implemented cache coherence protocols. However, a further growth of the number of cores per system implies an increasing chip complexity, especially with respect to the cache coherence protocols. Therefore, a very attractive alternative for future many‐core systems is to waive the hardware‐based cache coherency and to introduce a software‐oriented message‐passing based architecture instead: a so‐called Cluster‐on‐Chip architecture. Intel's Single‐chip Cloud Computer (SCC), a many‐core research processor with 48 non‐coherent memory‐coupled cores, is a very recent example for such a cluster‐on‐chip architecture. The SCC can be configured to run one operating system instance per core by partitioning the shared main memory in a strict manner. However, it is also possible to access the shared main memory in an unsplit and concurrent manner, provided that either the caches are disabled or the cache coherency is then ensured by software. In this article, we detail our experiences gained while developing low‐level software for message‐passing and shared‐memory programming on the SCC. We present an SCC‐customized MPI library (called SCC‐MPICH) as well as a shared virtual memory system (called MetalSVM) for the SCC. In doing so, we evaluate the potential of both programming models and we show how these models can be improved especially with respect to the SCC's many‐core architecture. Copyright © 2013 John Wiley & Sons, Ltd.
doi_str_mv 10.1002/cpe.3033
format Article
fullrecord <record><control><sourceid>istex_cross</sourceid><recordid>TN_cdi_crossref_primary_10_1002_cpe_3033</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>ark_67375_WNG_ZZHMDNN0_J</sourcerecordid><originalsourceid>FETCH-LOGICAL-c3363-ecb1bcb9f986a523067b03f2d6be9222e8be2a3978cf8948c4af158ff55c5e153</originalsourceid><addsrcrecordid>eNp1kE1Lw0AQhhdRsFbBn7BHL6n70XwdJda2UqOgovSybLazNZpkw24g5t-7pVLw4GkG5pkX3gehS0omlBB2rVqYcML5ERrRkLOARHx6fNhZdIrOnPskhFLC6Qi959BjN7gOauyM7nppAWtjcSutrCqocGvN1sq6Lpstrs0GKodNg7sPwMum8_fnLMO1bIZAGf_qaQXOGXuOTrSsHFz8zjF6vZu9ZItg9ThfZjerQHEe8QBUQQtVpDpNIhkyTqK4IFyzTVRAyhiDpAAmeRonSifpNFFTqWmYaB2GKgTfaoyu9rnKGucsaNHaspZ2EJSInRHhjYidEY8Ge7QvKxj-5UT2NPvLl17P94GX9ktEMY9D8ZbPxXq9eLjNcyLu-Q8e0nIS</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>New system software for parallel programming models on the Intel SCC many-core processor</title><source>Wiley Online Library All Journals</source><creator>Clauss, Carsten ; Lankes, Stefan ; Reble, Pablo ; Bemmerl, Thomas</creator><creatorcontrib>Clauss, Carsten ; Lankes, Stefan ; Reble, Pablo ; Bemmerl, Thomas</creatorcontrib><description>SummarySince the beginning of the multicore era, parallel processing has become prevalent across the board. On a traditional multicore system, a single operating system manages all cores and schedules threads and processes among them, inherently supported by hardware‐implemented cache coherence protocols. However, a further growth of the number of cores per system implies an increasing chip complexity, especially with respect to the cache coherence protocols. Therefore, a very attractive alternative for future many‐core systems is to waive the hardware‐based cache coherency and to introduce a software‐oriented message‐passing based architecture instead: a so‐called Cluster‐on‐Chip architecture. Intel's Single‐chip Cloud Computer (SCC), a many‐core research processor with 48 non‐coherent memory‐coupled cores, is a very recent example for such a cluster‐on‐chip architecture. The SCC can be configured to run one operating system instance per core by partitioning the shared main memory in a strict manner. However, it is also possible to access the shared main memory in an unsplit and concurrent manner, provided that either the caches are disabled or the cache coherency is then ensured by software. In this article, we detail our experiences gained while developing low‐level software for message‐passing and shared‐memory programming on the SCC. We present an SCC‐customized MPI library (called SCC‐MPICH) as well as a shared virtual memory system (called MetalSVM) for the SCC. In doing so, we evaluate the potential of both programming models and we show how these models can be improved especially with respect to the SCC's many‐core architecture. Copyright © 2013 John Wiley &amp; Sons, Ltd.</description><identifier>ISSN: 1532-0626</identifier><identifier>EISSN: 1532-0634</identifier><identifier>DOI: 10.1002/cpe.3033</identifier><language>eng</language><publisher>Chichester, UK: John Wiley &amp; Sons, Ltd</publisher><subject>many-core ; message-passing ; MPI ; non-cache-coherent shared-memory ; SCC ; shared virtual memory</subject><ispartof>Concurrency and computation, 2015-06, Vol.27 (9), p.2235-2259</ispartof><rights>Copyright © 2013 John Wiley &amp; Sons, Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c3363-ecb1bcb9f986a523067b03f2d6be9222e8be2a3978cf8948c4af158ff55c5e153</citedby><cites>FETCH-LOGICAL-c3363-ecb1bcb9f986a523067b03f2d6be9222e8be2a3978cf8948c4af158ff55c5e153</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Fcpe.3033$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Fcpe.3033$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1416,27923,27924,45573,45574</link.rule.ids></links><search><creatorcontrib>Clauss, Carsten</creatorcontrib><creatorcontrib>Lankes, Stefan</creatorcontrib><creatorcontrib>Reble, Pablo</creatorcontrib><creatorcontrib>Bemmerl, Thomas</creatorcontrib><title>New system software for parallel programming models on the Intel SCC many-core processor</title><title>Concurrency and computation</title><addtitle>Concurrency Computat.: Pract. Exper</addtitle><description>SummarySince the beginning of the multicore era, parallel processing has become prevalent across the board. On a traditional multicore system, a single operating system manages all cores and schedules threads and processes among them, inherently supported by hardware‐implemented cache coherence protocols. However, a further growth of the number of cores per system implies an increasing chip complexity, especially with respect to the cache coherence protocols. Therefore, a very attractive alternative for future many‐core systems is to waive the hardware‐based cache coherency and to introduce a software‐oriented message‐passing based architecture instead: a so‐called Cluster‐on‐Chip architecture. Intel's Single‐chip Cloud Computer (SCC), a many‐core research processor with 48 non‐coherent memory‐coupled cores, is a very recent example for such a cluster‐on‐chip architecture. The SCC can be configured to run one operating system instance per core by partitioning the shared main memory in a strict manner. However, it is also possible to access the shared main memory in an unsplit and concurrent manner, provided that either the caches are disabled or the cache coherency is then ensured by software. In this article, we detail our experiences gained while developing low‐level software for message‐passing and shared‐memory programming on the SCC. We present an SCC‐customized MPI library (called SCC‐MPICH) as well as a shared virtual memory system (called MetalSVM) for the SCC. In doing so, we evaluate the potential of both programming models and we show how these models can be improved especially with respect to the SCC's many‐core architecture. Copyright © 2013 John Wiley &amp; Sons, Ltd.</description><subject>many-core</subject><subject>message-passing</subject><subject>MPI</subject><subject>non-cache-coherent shared-memory</subject><subject>SCC</subject><subject>shared virtual memory</subject><issn>1532-0626</issn><issn>1532-0634</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><recordid>eNp1kE1Lw0AQhhdRsFbBn7BHL6n70XwdJda2UqOgovSybLazNZpkw24g5t-7pVLw4GkG5pkX3gehS0omlBB2rVqYcML5ERrRkLOARHx6fNhZdIrOnPskhFLC6Qi959BjN7gOauyM7nppAWtjcSutrCqocGvN1sq6Lpstrs0GKodNg7sPwMum8_fnLMO1bIZAGf_qaQXOGXuOTrSsHFz8zjF6vZu9ZItg9ThfZjerQHEe8QBUQQtVpDpNIhkyTqK4IFyzTVRAyhiDpAAmeRonSifpNFFTqWmYaB2GKgTfaoyu9rnKGucsaNHaspZ2EJSInRHhjYidEY8Ge7QvKxj-5UT2NPvLl17P94GX9ktEMY9D8ZbPxXq9eLjNcyLu-Q8e0nIS</recordid><startdate>20150625</startdate><enddate>20150625</enddate><creator>Clauss, Carsten</creator><creator>Lankes, Stefan</creator><creator>Reble, Pablo</creator><creator>Bemmerl, Thomas</creator><general>John Wiley &amp; Sons, Ltd</general><scope>BSCLL</scope><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20150625</creationdate><title>New system software for parallel programming models on the Intel SCC many-core processor</title><author>Clauss, Carsten ; Lankes, Stefan ; Reble, Pablo ; Bemmerl, Thomas</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c3363-ecb1bcb9f986a523067b03f2d6be9222e8be2a3978cf8948c4af158ff55c5e153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>many-core</topic><topic>message-passing</topic><topic>MPI</topic><topic>non-cache-coherent shared-memory</topic><topic>SCC</topic><topic>shared virtual memory</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Clauss, Carsten</creatorcontrib><creatorcontrib>Lankes, Stefan</creatorcontrib><creatorcontrib>Reble, Pablo</creatorcontrib><creatorcontrib>Bemmerl, Thomas</creatorcontrib><collection>Istex</collection><collection>CrossRef</collection><jtitle>Concurrency and computation</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Clauss, Carsten</au><au>Lankes, Stefan</au><au>Reble, Pablo</au><au>Bemmerl, Thomas</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>New system software for parallel programming models on the Intel SCC many-core processor</atitle><jtitle>Concurrency and computation</jtitle><addtitle>Concurrency Computat.: Pract. Exper</addtitle><date>2015-06-25</date><risdate>2015</risdate><volume>27</volume><issue>9</issue><spage>2235</spage><epage>2259</epage><pages>2235-2259</pages><issn>1532-0626</issn><eissn>1532-0634</eissn><abstract>SummarySince the beginning of the multicore era, parallel processing has become prevalent across the board. On a traditional multicore system, a single operating system manages all cores and schedules threads and processes among them, inherently supported by hardware‐implemented cache coherence protocols. However, a further growth of the number of cores per system implies an increasing chip complexity, especially with respect to the cache coherence protocols. Therefore, a very attractive alternative for future many‐core systems is to waive the hardware‐based cache coherency and to introduce a software‐oriented message‐passing based architecture instead: a so‐called Cluster‐on‐Chip architecture. Intel's Single‐chip Cloud Computer (SCC), a many‐core research processor with 48 non‐coherent memory‐coupled cores, is a very recent example for such a cluster‐on‐chip architecture. The SCC can be configured to run one operating system instance per core by partitioning the shared main memory in a strict manner. However, it is also possible to access the shared main memory in an unsplit and concurrent manner, provided that either the caches are disabled or the cache coherency is then ensured by software. In this article, we detail our experiences gained while developing low‐level software for message‐passing and shared‐memory programming on the SCC. We present an SCC‐customized MPI library (called SCC‐MPICH) as well as a shared virtual memory system (called MetalSVM) for the SCC. In doing so, we evaluate the potential of both programming models and we show how these models can be improved especially with respect to the SCC's many‐core architecture. Copyright © 2013 John Wiley &amp; Sons, Ltd.</abstract><cop>Chichester, UK</cop><pub>John Wiley &amp; Sons, Ltd</pub><doi>10.1002/cpe.3033</doi><tpages>25</tpages></addata></record>
fulltext fulltext
identifier ISSN: 1532-0626
ispartof Concurrency and computation, 2015-06, Vol.27 (9), p.2235-2259
issn 1532-0626
1532-0634
language eng
recordid cdi_crossref_primary_10_1002_cpe_3033
source Wiley Online Library All Journals
subjects many-core
message-passing
MPI
non-cache-coherent shared-memory
SCC
shared virtual memory
title New system software for parallel programming models on the Intel SCC many-core processor
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T12%3A15%3A38IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-istex_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=New%20system%20software%20for%20parallel%20programming%20models%20on%20the%20Intel%20SCC%20many-core%20processor&rft.jtitle=Concurrency%20and%20computation&rft.au=Clauss,%20Carsten&rft.date=2015-06-25&rft.volume=27&rft.issue=9&rft.spage=2235&rft.epage=2259&rft.pages=2235-2259&rft.issn=1532-0626&rft.eissn=1532-0634&rft_id=info:doi/10.1002/cpe.3033&rft_dat=%3Cistex_cross%3Eark_67375_WNG_ZZHMDNN0_J%3C/istex_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true