Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics

This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Salvador, Giordano, Darvin, Wesley H, Huzaifa, Muhammad, Alsop, Johnathan, Sinclair, Matthew D, Adve, Sarita V
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Distributed, Parallel, and Cluster Computing
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Salvador, Giordano Darvin, Wesley H Huzaifa, Muhammad Alsop, Johnathan Sinclair, Matthew D Adve, Sarita V
description	This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging integrated GPU-CPU systems with native unified shared memory. We study 6 graph applications with 6 graph inputs for a total of 36 workloads running on 12 system (hardware+software) configurations reflecting the above design space of update propagation, coherence, and memory consistency. We make three key contributions. First, we show that there is no single best system configuration for all workloads, motivating systems with flexible coherence and consistency support. Second, we develop a model to accurately predict the best system configuration -- this model can be used by software designers to decide on push vs. pull and the consistency model and by flexible hardware to invoke the appropriate coherence and consistency configuration for the given workload. Third, we show that the design dimensions explored here are inter-dependent, reinforcing the need for software-hardware co-design in the above design dimensions. For example, software designers deciding on push vs. pull must consider the consistency model supported by hardware -- in some cases, push maybe better if hardware supports DRFrlx while pull may be better if hardware does not support DRFrlx.
doi_str_mv	10.48550/arxiv.2002.10245
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2002_10245</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2002_10245</sourcerecordid><originalsourceid>FETCH-LOGICAL-a675-267b4b1b1ffcf4b5078463dc170112a824241147fd8caf2fe9f2b5e6b8054c4f3</originalsourceid><addsrcrecordid>eNotz0FPwyAcBXAuHsz0A3iSD7B2QKGw49Jo1SyxifPcAOW_kiBrYDPWT--cnt57l5f8ELqjpORKCLLS6ct_lowQVlLCuLhGL2-Ts14H_-3jHjeH0SUXrVuea8w-H89jXmIdB9yd8rjqTiFgOCTcdu-4TXoa8SbqMB-9zTfoCnTI7vY_F2j3-LBrnorta_vcbLaFrqUoWC0NN9RQAAvcCCIVr6vBUkkoZVoxzjilXMKgrAYGbg3MCFcbRQS3HKoFuv-7vWD6KfkPneb-F9VfUNUPtVdGbA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics</title><source>arXiv.org</source><creator>Salvador, Giordano ; Darvin, Wesley H ; Huzaifa, Muhammad ; Alsop, Johnathan ; Sinclair, Matthew D ; Adve, Sarita V</creator><creatorcontrib>Salvador, Giordano ; Darvin, Wesley H ; Huzaifa, Muhammad ; Alsop, Johnathan ; Sinclair, Matthew D ; Adve, Sarita V</creatorcontrib><description>This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging integrated GPU-CPU systems with native unified shared memory. We study 6 graph applications with 6 graph inputs for a total of 36 workloads running on 12 system (hardware+software) configurations reflecting the above design space of update propagation, coherence, and memory consistency. We make three key contributions. First, we show that there is no single best system configuration for all workloads, motivating systems with flexible coherence and consistency support. Second, we develop a model to accurately predict the best system configuration -- this model can be used by software designers to decide on push vs. pull and the consistency model and by flexible hardware to invoke the appropriate coherence and consistency configuration for the given workload. Third, we show that the design dimensions explored here are inter-dependent, reinforcing the need for software-hardware co-design in the above design dimensions. For example, software designers deciding on push vs. pull must consider the consistency model supported by hardware -- in some cases, push maybe better if hardware supports DRFrlx while pull may be better if hardware does not support DRFrlx.</description><identifier>DOI: 10.48550/arxiv.2002.10245</identifier><language>eng</language><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><creationdate>2020-02</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2002.10245$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2002.10245$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Salvador, Giordano</creatorcontrib><creatorcontrib>Darvin, Wesley H</creatorcontrib><creatorcontrib>Huzaifa, Muhammad</creatorcontrib><creatorcontrib>Alsop, Johnathan</creatorcontrib><creatorcontrib>Sinclair, Matthew D</creatorcontrib><creatorcontrib>Adve, Sarita V</creatorcontrib><title>Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics</title><description>This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging integrated GPU-CPU systems with native unified shared memory. We study 6 graph applications with 6 graph inputs for a total of 36 workloads running on 12 system (hardware+software) configurations reflecting the above design space of update propagation, coherence, and memory consistency. We make three key contributions. First, we show that there is no single best system configuration for all workloads, motivating systems with flexible coherence and consistency support. Second, we develop a model to accurately predict the best system configuration -- this model can be used by software designers to decide on push vs. pull and the consistency model and by flexible hardware to invoke the appropriate coherence and consistency configuration for the given workload. Third, we show that the design dimensions explored here are inter-dependent, reinforcing the need for software-hardware co-design in the above design dimensions. For example, software designers deciding on push vs. pull must consider the consistency model supported by hardware -- in some cases, push maybe better if hardware supports DRFrlx while pull may be better if hardware does not support DRFrlx.</description><subject>Computer Science - Distributed, Parallel, and Cluster Computing</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz0FPwyAcBXAuHsz0A3iSD7B2QKGw49Jo1SyxifPcAOW_kiBrYDPWT--cnt57l5f8ELqjpORKCLLS6ct_lowQVlLCuLhGL2-Ts14H_-3jHjeH0SUXrVuea8w-H89jXmIdB9yd8rjqTiFgOCTcdu-4TXoa8SbqMB-9zTfoCnTI7vY_F2j3-LBrnorta_vcbLaFrqUoWC0NN9RQAAvcCCIVr6vBUkkoZVoxzjilXMKgrAYGbg3MCFcbRQS3HKoFuv-7vWD6KfkPneb-F9VfUNUPtVdGbA</recordid><startdate>20200219</startdate><enddate>20200219</enddate><creator>Salvador, Giordano</creator><creator>Darvin, Wesley H</creator><creator>Huzaifa, Muhammad</creator><creator>Alsop, Johnathan</creator><creator>Sinclair, Matthew D</creator><creator>Adve, Sarita V</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20200219</creationdate><title>Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics</title><author>Salvador, Giordano ; Darvin, Wesley H ; Huzaifa, Muhammad ; Alsop, Johnathan ; Sinclair, Matthew D ; Adve, Sarita V</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a675-267b4b1b1ffcf4b5078463dc170112a824241147fd8caf2fe9f2b5e6b8054c4f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science - Distributed, Parallel, and Cluster Computing</topic><toplevel>online_resources</toplevel><creatorcontrib>Salvador, Giordano</creatorcontrib><creatorcontrib>Darvin, Wesley H</creatorcontrib><creatorcontrib>Huzaifa, Muhammad</creatorcontrib><creatorcontrib>Alsop, Johnathan</creatorcontrib><creatorcontrib>Sinclair, Matthew D</creatorcontrib><creatorcontrib>Adve, Sarita V</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Salvador, Giordano</au><au>Darvin, Wesley H</au><au>Huzaifa, Muhammad</au><au>Alsop, Johnathan</au><au>Sinclair, Matthew D</au><au>Adve, Sarita V</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics</atitle><date>2020-02-19</date><risdate>2020</risdate><abstract>This work provides the first study to explore the interaction of update propagation with and without fine-grained synchronization (push vs. pull), emerging coherence protocols (GPU vs. DeNovo coherence), and software-centric consistency models (DRF0, DRF1, and DRFrlx) for graph workloads on emerging integrated GPU-CPU systems with native unified shared memory. We study 6 graph applications with 6 graph inputs for a total of 36 workloads running on 12 system (hardware+software) configurations reflecting the above design space of update propagation, coherence, and memory consistency. We make three key contributions. First, we show that there is no single best system configuration for all workloads, motivating systems with flexible coherence and consistency support. Second, we develop a model to accurately predict the best system configuration -- this model can be used by software designers to decide on push vs. pull and the consistency model and by flexible hardware to invoke the appropriate coherence and consistency configuration for the given workload. Third, we show that the design dimensions explored here are inter-dependent, reinforcing the need for software-hardware co-design in the above design dimensions. For example, software designers deciding on push vs. pull must consider the consistency model supported by hardware -- in some cases, push maybe better if hardware supports DRFrlx while pull may be better if hardware does not support DRFrlx.</abstract><doi>10.48550/arxiv.2002.10245</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2002.10245
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2002_10245
source	arXiv.org
subjects	Computer Science - Distributed, Parallel, and Cluster Computing
title	Specializing Coherence, Consistency, and Push/Pull for GPU Graph Analytics
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-20T21%3A01%3A18IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Specializing%20Coherence,%20Consistency,%20and%20Push/Pull%20for%20GPU%20Graph%20Analytics&rft.au=Salvador,%20Giordano&rft.date=2020-02-19&rft_id=info:doi/10.48550/arxiv.2002.10245&rft_dat=%3Carxiv_GOX%3E2002_10245%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true