Optimizing XML processing for grid applications using an emulation framework

Chip multi-processors (CMPs), commonly referred to as multi-core processors, are being widely adopted for deployment as part of the grid infrastructure. This change in computer architecture requires corresponding design modifications in programming paradigms, including grid middleware tools, to harn...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Bhowmik, R., Gupta, C., Govindaraju, M., Aggarwal, A.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 11
container_issue
container_start_page 1
container_title
container_volume
creator Bhowmik, R.
Gupta, C.
Govindaraju, M.
Aggarwal, A.
description Chip multi-processors (CMPs), commonly referred to as multi-core processors, are being widely adopted for deployment as part of the grid infrastructure. This change in computer architecture requires corresponding design modifications in programming paradigms, including grid middleware tools, to harness the opportunities presented by multi-core processors. Simple and naive implementations of grid middleware on multi-core systems can severely impact performance. This is because programming for CMPs requires special consideration for issues such as limitations of shared bus bandwidth, cache size and coherency, and communication between threads. The goal of developing an optimized multi-threaded grid middleware for emerging multi-core processors will be realized only if researchers and developers have access to an in-depth analysis of the impact of several low level microarchitectural parameters on performance. None of the current grid simulators and emulators provide feedback at the microarchitectural level, which is essential for such an analysis. In earlier work we presented our initial results on the design and implementation of such an emulation framework, Multi- core Grid (McGrid). In this paper we extend that work and present a performance study on the effect of cache coherency, scheduling of processing threads to take advantage of data available in the cache of each core, and read and write access patterns for shared data structures. We present the performance results, analysis, and recommendations based on experiments conducted using the McGrid framework for processing XML-based grid data and documents.
doi_str_mv 10.1109/IPDPS.2008.4536304
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4536304</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4536304</ieee_id><sourcerecordid>4536304</sourcerecordid><originalsourceid>FETCH-LOGICAL-i90t-cd1200c1668826ed993a57254f6702ea2d2c5eec5b1f4d836e5d8d52e2e4afa13</originalsourceid><addsrcrecordid>eNpFUNtKAzEUjKhgrf0BfckP7Jr7bh6lWi2stGAffCsxOSnRvZFsEf1611pwXoaZYQ6HQeiakpxSom-X6_v1S84IKXMhueJEnKBLKpgQVGmhT_8FJ2doQiUnGSOFvECzlN7JiLHGBJ-gatUPoQnfod3h1-cK97GzkNKv9F3EuxgcNn1fB2uG0LUJ7w-ZaTE0-_rgYR9NA59d_LhC597UCWZHnqLN4mEzf8qq1eNyfldlQZMhs46On1uqVFkyBU5rbmTBpPCqIAwMc8xKACvfqBeu5AqkK51kwEAYbyifopu_swEAtn0MjYlf2-MQ_Aexg1C3</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Optimizing XML processing for grid applications using an emulation framework</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Bhowmik, R. ; Gupta, C. ; Govindaraju, M. ; Aggarwal, A.</creator><creatorcontrib>Bhowmik, R. ; Gupta, C. ; Govindaraju, M. ; Aggarwal, A.</creatorcontrib><description>Chip multi-processors (CMPs), commonly referred to as multi-core processors, are being widely adopted for deployment as part of the grid infrastructure. This change in computer architecture requires corresponding design modifications in programming paradigms, including grid middleware tools, to harness the opportunities presented by multi-core processors. Simple and naive implementations of grid middleware on multi-core systems can severely impact performance. This is because programming for CMPs requires special consideration for issues such as limitations of shared bus bandwidth, cache size and coherency, and communication between threads. The goal of developing an optimized multi-threaded grid middleware for emerging multi-core processors will be realized only if researchers and developers have access to an in-depth analysis of the impact of several low level microarchitectural parameters on performance. None of the current grid simulators and emulators provide feedback at the microarchitectural level, which is essential for such an analysis. In earlier work we presented our initial results on the design and implementation of such an emulation framework, Multi- core Grid (McGrid). In this paper we extend that work and present a performance study on the effect of cache coherency, scheduling of processing threads to take advantage of data available in the cache of each core, and read and write access patterns for shared data structures. We present the performance results, analysis, and recommendations based on experiments conducted using the McGrid framework for processing XML-based grid data and documents.</description><identifier>ISSN: 1530-2075</identifier><identifier>ISBN: 1424416930</identifier><identifier>ISBN: 9781424416936</identifier><identifier>EISBN: 1424416949</identifier><identifier>EISBN: 9781424416943</identifier><identifier>DOI: 10.1109/IPDPS.2008.4536304</identifier><language>eng</language><publisher>IEEE</publisher><subject>Application software ; Bandwidth ; Computer architecture ; Emulation ; Grid Applications ; Microarchitecture ; Middleware ; Multi-core ; Multicore processing ; Performance analysis ; XML ; Yarn</subject><ispartof>2008 IEEE International Symposium on Parallel and Distributed Processing, 2008, p.1-11</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4536304$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2056,27916,54911</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4536304$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Bhowmik, R.</creatorcontrib><creatorcontrib>Gupta, C.</creatorcontrib><creatorcontrib>Govindaraju, M.</creatorcontrib><creatorcontrib>Aggarwal, A.</creatorcontrib><title>Optimizing XML processing for grid applications using an emulation framework</title><title>2008 IEEE International Symposium on Parallel and Distributed Processing</title><addtitle>IPDPS</addtitle><description>Chip multi-processors (CMPs), commonly referred to as multi-core processors, are being widely adopted for deployment as part of the grid infrastructure. This change in computer architecture requires corresponding design modifications in programming paradigms, including grid middleware tools, to harness the opportunities presented by multi-core processors. Simple and naive implementations of grid middleware on multi-core systems can severely impact performance. This is because programming for CMPs requires special consideration for issues such as limitations of shared bus bandwidth, cache size and coherency, and communication between threads. The goal of developing an optimized multi-threaded grid middleware for emerging multi-core processors will be realized only if researchers and developers have access to an in-depth analysis of the impact of several low level microarchitectural parameters on performance. None of the current grid simulators and emulators provide feedback at the microarchitectural level, which is essential for such an analysis. In earlier work we presented our initial results on the design and implementation of such an emulation framework, Multi- core Grid (McGrid). In this paper we extend that work and present a performance study on the effect of cache coherency, scheduling of processing threads to take advantage of data available in the cache of each core, and read and write access patterns for shared data structures. We present the performance results, analysis, and recommendations based on experiments conducted using the McGrid framework for processing XML-based grid data and documents.</description><subject>Application software</subject><subject>Bandwidth</subject><subject>Computer architecture</subject><subject>Emulation</subject><subject>Grid Applications</subject><subject>Microarchitecture</subject><subject>Middleware</subject><subject>Multi-core</subject><subject>Multicore processing</subject><subject>Performance analysis</subject><subject>XML</subject><subject>Yarn</subject><issn>1530-2075</issn><isbn>1424416930</isbn><isbn>9781424416936</isbn><isbn>1424416949</isbn><isbn>9781424416943</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2008</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNpFUNtKAzEUjKhgrf0BfckP7Jr7bh6lWi2stGAffCsxOSnRvZFsEf1611pwXoaZYQ6HQeiakpxSom-X6_v1S84IKXMhueJEnKBLKpgQVGmhT_8FJ2doQiUnGSOFvECzlN7JiLHGBJ-gatUPoQnfod3h1-cK97GzkNKv9F3EuxgcNn1fB2uG0LUJ7w-ZaTE0-_rgYR9NA59d_LhC597UCWZHnqLN4mEzf8qq1eNyfldlQZMhs46On1uqVFkyBU5rbmTBpPCqIAwMc8xKACvfqBeu5AqkK51kwEAYbyifopu_swEAtn0MjYlf2-MQ_Aexg1C3</recordid><startdate>200804</startdate><enddate>200804</enddate><creator>Bhowmik, R.</creator><creator>Gupta, C.</creator><creator>Govindaraju, M.</creator><creator>Aggarwal, A.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>200804</creationdate><title>Optimizing XML processing for grid applications using an emulation framework</title><author>Bhowmik, R. ; Gupta, C. ; Govindaraju, M. ; Aggarwal, A.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i90t-cd1200c1668826ed993a57254f6702ea2d2c5eec5b1f4d836e5d8d52e2e4afa13</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Application software</topic><topic>Bandwidth</topic><topic>Computer architecture</topic><topic>Emulation</topic><topic>Grid Applications</topic><topic>Microarchitecture</topic><topic>Middleware</topic><topic>Multi-core</topic><topic>Multicore processing</topic><topic>Performance analysis</topic><topic>XML</topic><topic>Yarn</topic><toplevel>online_resources</toplevel><creatorcontrib>Bhowmik, R.</creatorcontrib><creatorcontrib>Gupta, C.</creatorcontrib><creatorcontrib>Govindaraju, M.</creatorcontrib><creatorcontrib>Aggarwal, A.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Xplore</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Bhowmik, R.</au><au>Gupta, C.</au><au>Govindaraju, M.</au><au>Aggarwal, A.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Optimizing XML processing for grid applications using an emulation framework</atitle><btitle>2008 IEEE International Symposium on Parallel and Distributed Processing</btitle><stitle>IPDPS</stitle><date>2008-04</date><risdate>2008</risdate><spage>1</spage><epage>11</epage><pages>1-11</pages><issn>1530-2075</issn><isbn>1424416930</isbn><isbn>9781424416936</isbn><eisbn>1424416949</eisbn><eisbn>9781424416943</eisbn><abstract>Chip multi-processors (CMPs), commonly referred to as multi-core processors, are being widely adopted for deployment as part of the grid infrastructure. This change in computer architecture requires corresponding design modifications in programming paradigms, including grid middleware tools, to harness the opportunities presented by multi-core processors. Simple and naive implementations of grid middleware on multi-core systems can severely impact performance. This is because programming for CMPs requires special consideration for issues such as limitations of shared bus bandwidth, cache size and coherency, and communication between threads. The goal of developing an optimized multi-threaded grid middleware for emerging multi-core processors will be realized only if researchers and developers have access to an in-depth analysis of the impact of several low level microarchitectural parameters on performance. None of the current grid simulators and emulators provide feedback at the microarchitectural level, which is essential for such an analysis. In earlier work we presented our initial results on the design and implementation of such an emulation framework, Multi- core Grid (McGrid). In this paper we extend that work and present a performance study on the effect of cache coherency, scheduling of processing threads to take advantage of data available in the cache of each core, and read and write access patterns for shared data structures. We present the performance results, analysis, and recommendations based on experiments conducted using the McGrid framework for processing XML-based grid data and documents.</abstract><pub>IEEE</pub><doi>10.1109/IPDPS.2008.4536304</doi><tpages>11</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1530-2075
ispartof 2008 IEEE International Symposium on Parallel and Distributed Processing, 2008, p.1-11
issn 1530-2075
language eng
recordid cdi_ieee_primary_4536304
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Application software
Bandwidth
Computer architecture
Emulation
Grid Applications
Microarchitecture
Middleware
Multi-core
Multicore processing
Performance analysis
XML
Yarn
title Optimizing XML processing for grid applications using an emulation framework
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-14T20%3A11%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Optimizing%20XML%20processing%20for%20grid%20applications%20using%20an%20emulation%20framework&rft.btitle=2008%20IEEE%20International%20Symposium%20on%20Parallel%20and%20Distributed%20Processing&rft.au=Bhowmik,%20R.&rft.date=2008-04&rft.spage=1&rft.epage=11&rft.pages=1-11&rft.issn=1530-2075&rft.isbn=1424416930&rft.isbn_list=9781424416936&rft_id=info:doi/10.1109/IPDPS.2008.4536304&rft_dat=%3Cieee_6IE%3E4536304%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&rft.eisbn=1424416949&rft.eisbn_list=9781424416943&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4536304&rfr_iscdi=true