Evaluation of OpenMP for the Cyclops Multithreaded Architecture

Multithreaded architectures have the potential of tolerating large memory and functional unit latencies and increase resource utilization. The Blue Gene/Cyclops architecture, being developed at the IBM T. J. Watson Research Center, is one such systems that offers massive intra-chip parallelism. Alth...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Almasi, George, Ayguadé, Eduard, Caşcaval, Călin, Castaños, José, Labarta, Jesús, Martínez, Francisco, Martorell, Xavier, Moreira, José
Format: Buchkapitel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 83
container_issue
container_start_page 69
container_title
container_volume 2716
creator Almasi, George
Ayguadé, Eduard
Caşcaval, Călin
Castaños, José
Labarta, Jesús
Martínez, Francisco
Martorell, Xavier
Moreira, José
description Multithreaded architectures have the potential of tolerating large memory and functional unit latencies and increase resource utilization. The Blue Gene/Cyclops architecture, being developed at the IBM T. J. Watson Research Center, is one such systems that offers massive intra-chip parallelism. Although the BG/C architecture was initially designed to execute specific applications, we believe that it can be effectively used on a broad range of parallel numerical applications. Programming such applications for this unconventional design requires a significant porting effort when using the basic built-in mechanisms for thread management and synchronization. In this paper, we describe the implementation of an OpenMP environment for parallelizing applications, currently under development at the CEPBA-IBM Research Institute, targeting BG/C. The environment is evaluated with a set of simple numerical kernels and a subset of the NAS OpenMP benchmarks. We identify issues that were not initially considered in the design of the BG/C architecture to support a programming model such as OpenMP. We also evaluate features currently offered by the BG/C architecture that should be considered in the implementation of an efficient OpenMP layer for massive intra-chip parallel architectures.
doi_str_mv 10.1007/3-540-45009-2_6
format Book Chapter
fullrecord <record><control><sourceid>proquest_csuc_</sourceid><recordid>TN_cdi_csuc_recercat_oai_recercat_cat_2072_251448</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>EBC6806070_83_78</sourcerecordid><originalsourceid>FETCH-LOGICAL-c349t-549eb76696f6c8b5f4d675b74085d0e6ef7956830bccd717b2dad9594c37f9543</originalsourceid><addsrcrecordid>eNpFUU1PHDEMDYVWbOmee50LxwHnOzlVaEULEggOrdSblclkulOGmWmSQeLfk91FqiXLsv2eZfsR8pXCBQXQl7yWAmohAWzNUB2RtdWGl9q-xD6QFVWU1pwLe0w-7xsguPx9QlbAgdVWC_6JnErGNQVjTsk6pb9QjIOk3K7It-sXNywu99NYTV31MIfx_rHqpljlbag2r36Y5lTdL0Pu8zYG14a2uop-2-fg8xLDF_Kxc0MK6_d4Rn59v_65uanvHn7cbq7ual9Wy-UKGxqtlFWd8qaRnWiVlo0WYGQLQYVOW6kMh8b7VlPdsNa1Vlrhue6sFPyM0MNcnxaPMfgQvcs4uf5_snMGmiGTVAhTOOcHzuySd0MX3ej7hHPsn118RSqVpRx4wdUHXCqt8U-I2EzTU0IKuBMBOZa_4v7jWEQoePY-N07_lpAyhh3BhzFHN_itm3OICZUBBRrQcCyavQHttYLc</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>book_chapter</recordtype><pqid>EBC6806070_83_78</pqid></control><display><type>book_chapter</type><title>Evaluation of OpenMP for the Cyclops Multithreaded Architecture</title><source>Springer Books</source><source>Recercat</source><creator>Almasi, George ; Ayguadé, Eduard ; Caşcaval, Călin ; Castaños, José ; Labarta, Jesús ; Martínez, Francisco ; Martorell, Xavier ; Moreira, José</creator><contributor>Voss, Michael J ; Voss, Michael J.</contributor><creatorcontrib>Almasi, George ; Ayguadé, Eduard ; Caşcaval, Călin ; Castaños, José ; Labarta, Jesús ; Martínez, Francisco ; Martorell, Xavier ; Moreira, José ; Voss, Michael J ; Voss, Michael J.</creatorcontrib><description>Multithreaded architectures have the potential of tolerating large memory and functional unit latencies and increase resource utilization. The Blue Gene/Cyclops architecture, being developed at the IBM T. J. Watson Research Center, is one such systems that offers massive intra-chip parallelism. Although the BG/C architecture was initially designed to execute specific applications, we believe that it can be effectively used on a broad range of parallel numerical applications. Programming such applications for this unconventional design requires a significant porting effort when using the basic built-in mechanisms for thread management and synchronization. In this paper, we describe the implementation of an OpenMP environment for parallelizing applications, currently under development at the CEPBA-IBM Research Institute, targeting BG/C. The environment is evaluated with a set of simple numerical kernels and a subset of the NAS OpenMP benchmarks. We identify issues that were not initially considered in the design of the BG/C architecture to support a programming model such as OpenMP. We also evaluate features currently offered by the BG/C architecture that should be considered in the implementation of an efficient OpenMP layer for massive intra-chip parallel architectures.</description><identifier>ISSN: 0302-9743</identifier><identifier>ISBN: 354040435X</identifier><identifier>ISBN: 9783540404354</identifier><identifier>EISSN: 1611-3349</identifier><identifier>EISBN: 9783540450092</identifier><identifier>EISBN: 3540450092</identifier><identifier>DOI: 10.1007/3-540-45009-2_6</identifier><identifier>OCLC: 52371088</identifier><identifier>LCCallNum: TK7885-7895</identifier><language>eng</language><publisher>Germany: Springer Berlin / Heidelberg</publisher><subject>Applied sciences ; Arquitectura de computadors ; Arquitectures paral·leles ; Artificial intelligence ; Computer science; control theory; systems ; Data Cache ; Exact sciences and technology ; Global Queue ; Hardware Thread ; Informàtica ; OpenMP ; Parallel programming (Computer science) ; Pattern recognition. Digital image processing. Computational geometry ; Programació en paral·lel (Informàtica) ; Software ; Software Thread ; Speech and sound recognition and synthesis. Linguistics ; Thread Creation ; Àrees temàtiques de la UPC</subject><ispartof>Lecture notes in computer science, 2003, Vol.2716, p.69-83</ispartof><rights>Springer-Verlag Berlin Heidelberg 2003</rights><rights>2004 INIST-CNRS</rights><rights>Attribution-NonCommercial-NoDerivs 3.0 Spain info:eu-repo/semantics/openAccess &lt;a href="http://creativecommons.org/licenses/by-nc-nd/3.0/es/"&gt;http://creativecommons.org/licenses/by-nc-nd/3.0/es/&lt;/a&gt;</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><relation>Lecture Notes in Computer Science</relation></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Uhttps://ebookcentral.proquest.com/covers/6806070-l.jpg</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/3-540-45009-2_6$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/3-540-45009-2_6$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>230,309,310,779,780,784,789,790,793,885,26974,27925,38255,41442,42511</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=15691303$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><contributor>Voss, Michael J</contributor><contributor>Voss, Michael J.</contributor><creatorcontrib>Almasi, George</creatorcontrib><creatorcontrib>Ayguadé, Eduard</creatorcontrib><creatorcontrib>Caşcaval, Călin</creatorcontrib><creatorcontrib>Castaños, José</creatorcontrib><creatorcontrib>Labarta, Jesús</creatorcontrib><creatorcontrib>Martínez, Francisco</creatorcontrib><creatorcontrib>Martorell, Xavier</creatorcontrib><creatorcontrib>Moreira, José</creatorcontrib><title>Evaluation of OpenMP for the Cyclops Multithreaded Architecture</title><title>Lecture notes in computer science</title><description>Multithreaded architectures have the potential of tolerating large memory and functional unit latencies and increase resource utilization. The Blue Gene/Cyclops architecture, being developed at the IBM T. J. Watson Research Center, is one such systems that offers massive intra-chip parallelism. Although the BG/C architecture was initially designed to execute specific applications, we believe that it can be effectively used on a broad range of parallel numerical applications. Programming such applications for this unconventional design requires a significant porting effort when using the basic built-in mechanisms for thread management and synchronization. In this paper, we describe the implementation of an OpenMP environment for parallelizing applications, currently under development at the CEPBA-IBM Research Institute, targeting BG/C. The environment is evaluated with a set of simple numerical kernels and a subset of the NAS OpenMP benchmarks. We identify issues that were not initially considered in the design of the BG/C architecture to support a programming model such as OpenMP. We also evaluate features currently offered by the BG/C architecture that should be considered in the implementation of an efficient OpenMP layer for massive intra-chip parallel architectures.</description><subject>Applied sciences</subject><subject>Arquitectura de computadors</subject><subject>Arquitectures paral·leles</subject><subject>Artificial intelligence</subject><subject>Computer science; control theory; systems</subject><subject>Data Cache</subject><subject>Exact sciences and technology</subject><subject>Global Queue</subject><subject>Hardware Thread</subject><subject>Informàtica</subject><subject>OpenMP</subject><subject>Parallel programming (Computer science)</subject><subject>Pattern recognition. Digital image processing. Computational geometry</subject><subject>Programació en paral·lel (Informàtica)</subject><subject>Software</subject><subject>Software Thread</subject><subject>Speech and sound recognition and synthesis. Linguistics</subject><subject>Thread Creation</subject><subject>Àrees temàtiques de la UPC</subject><issn>0302-9743</issn><issn>1611-3349</issn><isbn>354040435X</isbn><isbn>9783540404354</isbn><isbn>9783540450092</isbn><isbn>3540450092</isbn><fulltext>true</fulltext><rsrctype>book_chapter</rsrctype><creationdate>2003</creationdate><recordtype>book_chapter</recordtype><sourceid>XX2</sourceid><recordid>eNpFUU1PHDEMDYVWbOmee50LxwHnOzlVaEULEggOrdSblclkulOGmWmSQeLfk91FqiXLsv2eZfsR8pXCBQXQl7yWAmohAWzNUB2RtdWGl9q-xD6QFVWU1pwLe0w-7xsguPx9QlbAgdVWC_6JnErGNQVjTsk6pb9QjIOk3K7It-sXNywu99NYTV31MIfx_rHqpljlbag2r36Y5lTdL0Pu8zYG14a2uop-2-fg8xLDF_Kxc0MK6_d4Rn59v_65uanvHn7cbq7ual9Wy-UKGxqtlFWd8qaRnWiVlo0WYGQLQYVOW6kMh8b7VlPdsNa1Vlrhue6sFPyM0MNcnxaPMfgQvcs4uf5_snMGmiGTVAhTOOcHzuySd0MX3ej7hHPsn118RSqVpRx4wdUHXCqt8U-I2EzTU0IKuBMBOZa_4v7jWEQoePY-N07_lpAyhh3BhzFHN_itm3OICZUBBRrQcCyavQHttYLc</recordid><startdate>20030101</startdate><enddate>20030101</enddate><creator>Almasi, George</creator><creator>Ayguadé, Eduard</creator><creator>Caşcaval, Călin</creator><creator>Castaños, José</creator><creator>Labarta, Jesús</creator><creator>Martínez, Francisco</creator><creator>Martorell, Xavier</creator><creator>Moreira, José</creator><general>Springer Berlin / Heidelberg</general><general>Springer Berlin Heidelberg</general><general>Springer</general><scope>FFUUA</scope><scope>IQODW</scope><scope>XX2</scope></search><sort><creationdate>20030101</creationdate><title>Evaluation of OpenMP for the Cyclops Multithreaded Architecture</title><author>Almasi, George ; Ayguadé, Eduard ; Caşcaval, Călin ; Castaños, José ; Labarta, Jesús ; Martínez, Francisco ; Martorell, Xavier ; Moreira, José</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c349t-549eb76696f6c8b5f4d675b74085d0e6ef7956830bccd717b2dad9594c37f9543</frbrgroupid><rsrctype>book_chapters</rsrctype><prefilter>book_chapters</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Applied sciences</topic><topic>Arquitectura de computadors</topic><topic>Arquitectures paral·leles</topic><topic>Artificial intelligence</topic><topic>Computer science; control theory; systems</topic><topic>Data Cache</topic><topic>Exact sciences and technology</topic><topic>Global Queue</topic><topic>Hardware Thread</topic><topic>Informàtica</topic><topic>OpenMP</topic><topic>Parallel programming (Computer science)</topic><topic>Pattern recognition. Digital image processing. Computational geometry</topic><topic>Programació en paral·lel (Informàtica)</topic><topic>Software</topic><topic>Software Thread</topic><topic>Speech and sound recognition and synthesis. Linguistics</topic><topic>Thread Creation</topic><topic>Àrees temàtiques de la UPC</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Almasi, George</creatorcontrib><creatorcontrib>Ayguadé, Eduard</creatorcontrib><creatorcontrib>Caşcaval, Călin</creatorcontrib><creatorcontrib>Castaños, José</creatorcontrib><creatorcontrib>Labarta, Jesús</creatorcontrib><creatorcontrib>Martínez, Francisco</creatorcontrib><creatorcontrib>Martorell, Xavier</creatorcontrib><creatorcontrib>Moreira, José</creatorcontrib><collection>ProQuest Ebook Central - Book Chapters - Demo use only</collection><collection>Pascal-Francis</collection><collection>Recercat</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Almasi, George</au><au>Ayguadé, Eduard</au><au>Caşcaval, Călin</au><au>Castaños, José</au><au>Labarta, Jesús</au><au>Martínez, Francisco</au><au>Martorell, Xavier</au><au>Moreira, José</au><au>Voss, Michael J</au><au>Voss, Michael J.</au><format>book</format><genre>bookitem</genre><ristype>CHAP</ristype><atitle>Evaluation of OpenMP for the Cyclops Multithreaded Architecture</atitle><btitle>Lecture notes in computer science</btitle><seriestitle>Lecture Notes in Computer Science</seriestitle><date>2003-01-01</date><risdate>2003</risdate><volume>2716</volume><spage>69</spage><epage>83</epage><pages>69-83</pages><issn>0302-9743</issn><eissn>1611-3349</eissn><isbn>354040435X</isbn><isbn>9783540404354</isbn><eisbn>9783540450092</eisbn><eisbn>3540450092</eisbn><abstract>Multithreaded architectures have the potential of tolerating large memory and functional unit latencies and increase resource utilization. The Blue Gene/Cyclops architecture, being developed at the IBM T. J. Watson Research Center, is one such systems that offers massive intra-chip parallelism. Although the BG/C architecture was initially designed to execute specific applications, we believe that it can be effectively used on a broad range of parallel numerical applications. Programming such applications for this unconventional design requires a significant porting effort when using the basic built-in mechanisms for thread management and synchronization. In this paper, we describe the implementation of an OpenMP environment for parallelizing applications, currently under development at the CEPBA-IBM Research Institute, targeting BG/C. The environment is evaluated with a set of simple numerical kernels and a subset of the NAS OpenMP benchmarks. We identify issues that were not initially considered in the design of the BG/C architecture to support a programming model such as OpenMP. We also evaluate features currently offered by the BG/C architecture that should be considered in the implementation of an efficient OpenMP layer for massive intra-chip parallel architectures.</abstract><cop>Germany</cop><pub>Springer Berlin / Heidelberg</pub><doi>10.1007/3-540-45009-2_6</doi><oclcid>52371088</oclcid><tpages>15</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0302-9743
ispartof Lecture notes in computer science, 2003, Vol.2716, p.69-83
issn 0302-9743
1611-3349
language eng
recordid cdi_csuc_recercat_oai_recercat_cat_2072_251448
source Springer Books; Recercat
subjects Applied sciences
Arquitectura de computadors
Arquitectures paral·leles
Artificial intelligence
Computer science
control theory
systems
Data Cache
Exact sciences and technology
Global Queue
Hardware Thread
Informàtica
OpenMP
Parallel programming (Computer science)
Pattern recognition. Digital image processing. Computational geometry
Programació en paral·lel (Informàtica)
Software
Software Thread
Speech and sound recognition and synthesis. Linguistics
Thread Creation
Àrees temàtiques de la UPC
title Evaluation of OpenMP for the Cyclops Multithreaded Architecture
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-26T07%3A00%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_csuc_&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=bookitem&rft.atitle=Evaluation%20of%20OpenMP%20for%20the%20Cyclops%20Multithreaded%20Architecture&rft.btitle=Lecture%20notes%20in%20computer%20science&rft.au=Almasi,%20George&rft.date=2003-01-01&rft.volume=2716&rft.spage=69&rft.epage=83&rft.pages=69-83&rft.issn=0302-9743&rft.eissn=1611-3349&rft.isbn=354040435X&rft.isbn_list=9783540404354&rft_id=info:doi/10.1007/3-540-45009-2_6&rft_dat=%3Cproquest_csuc_%3EEBC6806070_83_78%3C/proquest_csuc_%3E%3Curl%3E%3C/url%3E&rft.eisbn=9783540450092&rft.eisbn_list=3540450092&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=EBC6806070_83_78&rft_id=info:pmid/&rfr_iscdi=true