ConnectIt: a framework for static and incremental parallel graph connectivity algorithms
Connected components is a fundamental kernel in graph applications. The fastest existing multicore algorithms for solving graph connectivity are based on some form of edge sampling and/or linking and compressing trees. However, many combinations of these design choices have been left unexplored. In...
Gespeichert in:
Veröffentlicht in: | Proceedings of the VLDB Endowment 2020-12, Vol.14 (4), p.653-667 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 667 |
---|---|
container_issue | 4 |
container_start_page | 653 |
container_title | Proceedings of the VLDB Endowment |
container_volume | 14 |
creator | Dhulipala, Laxman Hong, Changwan Shun, Julian |
description | Connected components is a fundamental kernel in graph applications. The fastest existing multicore algorithms for solving graph connectivity are based on some form of edge sampling and/or linking and compressing trees. However, many combinations of these design choices have been left unexplored. In this paper, we design the ConnectIt framework, which provides different sampling strategies as well as various tree linking and compression schemes. ConnectIt enables us to obtain several hundred new variants of connectivity algorithms, most of which extend to computing spanning forest. In addition to static graphs, we also extend ConnectIt to support mixes of insertions and connectivity queries in the concurrent setting.
We present an experimental evaluation of ConnectIt on a 72-core machine, which we believe is the most comprehensive evaluation of parallel connectivity algorithms to date. Compared to a collection of state-of-the-art static multicore algorithms, we obtain an average speedup of 12.4x (2.36x average speedup over the fastest existing implementation for each graph). Using ConnectIt, we are able to compute connectivity on the largest publicly-available graph (with over 3.5 billion vertices and 128 billion edges) in under 10 seconds using a 72-core machine, providing a 3.1x speedup over the fastest existing connectivity result for this graph, in any computational setting. For our incremental algorithms, we show that our algorithms can ingest graph updates at up to several billion edges per second. To guide the user in selecting the best variants in ConnectIt for different situations, we provide a detailed analysis of the different strategies. Finally, we show how the techniques in ConnectIt can be used to speed up two important graph applications: approximate minimum spanning forest and SCAN clustering. |
doi_str_mv | 10.14778/3436905.3436923 |
format | Article |
fullrecord | <record><control><sourceid>crossref_osti_</sourceid><recordid>TN_cdi_osti_scitechconnect_1852729</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>10_14778_3436905_3436923</sourcerecordid><originalsourceid>FETCH-LOGICAL-c138t-d20454c458e1c510026942302332b0e581c1ef08701de5ff371d07b1640aa3193</originalsourceid><addsrcrecordid>eNpNj7tOw0AURFcIJEygp6Z3uI99uUQWj0iR0iT1yrneFUYQI-82_D0idkF1phiN5ih1j7BG7Zx_ZM22AbM-k_hCVYQGag-Nu_yXr9VNzh8A1lv0lara8XSKUjblVl2l7jPHu4UrdXh53rdv9Xb3ummftrUg-1L3BNpo0cZHFIMAZBtNDMRMR4jGo2BM4B1gH01K7LAHd0SroesYG16ph3l3zGUIWYYS5V3mFwG9IUd_JZhLMo05TzGF72n46qafgBDOvmHxDYsv_wJZ7UOB</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>ConnectIt: a framework for static and incremental parallel graph connectivity algorithms</title><source>ACM Digital Library Complete</source><creator>Dhulipala, Laxman ; Hong, Changwan ; Shun, Julian</creator><creatorcontrib>Dhulipala, Laxman ; Hong, Changwan ; Shun, Julian ; Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)</creatorcontrib><description>Connected components is a fundamental kernel in graph applications. The fastest existing multicore algorithms for solving graph connectivity are based on some form of edge sampling and/or linking and compressing trees. However, many combinations of these design choices have been left unexplored. In this paper, we design the ConnectIt framework, which provides different sampling strategies as well as various tree linking and compression schemes. ConnectIt enables us to obtain several hundred new variants of connectivity algorithms, most of which extend to computing spanning forest. In addition to static graphs, we also extend ConnectIt to support mixes of insertions and connectivity queries in the concurrent setting.
We present an experimental evaluation of ConnectIt on a 72-core machine, which we believe is the most comprehensive evaluation of parallel connectivity algorithms to date. Compared to a collection of state-of-the-art static multicore algorithms, we obtain an average speedup of 12.4x (2.36x average speedup over the fastest existing implementation for each graph). Using ConnectIt, we are able to compute connectivity on the largest publicly-available graph (with over 3.5 billion vertices and 128 billion edges) in under 10 seconds using a 72-core machine, providing a 3.1x speedup over the fastest existing connectivity result for this graph, in any computational setting. For our incremental algorithms, we show that our algorithms can ingest graph updates at up to several billion edges per second. To guide the user in selecting the best variants in ConnectIt for different situations, we provide a detailed analysis of the different strategies. Finally, we show how the techniques in ConnectIt can be used to speed up two important graph applications: approximate minimum spanning forest and SCAN clustering.</description><identifier>ISSN: 2150-8097</identifier><identifier>EISSN: 2150-8097</identifier><identifier>DOI: 10.14778/3436905.3436923</identifier><language>eng</language><publisher>United States: Association for Computing Machinery (ACM)</publisher><subject>Computer Science</subject><ispartof>Proceedings of the VLDB Endowment, 2020-12, Vol.14 (4), p.653-667</ispartof><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c138t-d20454c458e1c510026942302332b0e581c1ef08701de5ff371d07b1640aa3193</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27901,27902</link.rule.ids><backlink>$$Uhttps://www.osti.gov/biblio/1852729$$D View this record in Osti.gov$$Hfree_for_read</backlink></links><search><creatorcontrib>Dhulipala, Laxman</creatorcontrib><creatorcontrib>Hong, Changwan</creatorcontrib><creatorcontrib>Shun, Julian</creatorcontrib><creatorcontrib>Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)</creatorcontrib><title>ConnectIt: a framework for static and incremental parallel graph connectivity algorithms</title><title>Proceedings of the VLDB Endowment</title><description>Connected components is a fundamental kernel in graph applications. The fastest existing multicore algorithms for solving graph connectivity are based on some form of edge sampling and/or linking and compressing trees. However, many combinations of these design choices have been left unexplored. In this paper, we design the ConnectIt framework, which provides different sampling strategies as well as various tree linking and compression schemes. ConnectIt enables us to obtain several hundred new variants of connectivity algorithms, most of which extend to computing spanning forest. In addition to static graphs, we also extend ConnectIt to support mixes of insertions and connectivity queries in the concurrent setting.
We present an experimental evaluation of ConnectIt on a 72-core machine, which we believe is the most comprehensive evaluation of parallel connectivity algorithms to date. Compared to a collection of state-of-the-art static multicore algorithms, we obtain an average speedup of 12.4x (2.36x average speedup over the fastest existing implementation for each graph). Using ConnectIt, we are able to compute connectivity on the largest publicly-available graph (with over 3.5 billion vertices and 128 billion edges) in under 10 seconds using a 72-core machine, providing a 3.1x speedup over the fastest existing connectivity result for this graph, in any computational setting. For our incremental algorithms, we show that our algorithms can ingest graph updates at up to several billion edges per second. To guide the user in selecting the best variants in ConnectIt for different situations, we provide a detailed analysis of the different strategies. Finally, we show how the techniques in ConnectIt can be used to speed up two important graph applications: approximate minimum spanning forest and SCAN clustering.</description><subject>Computer Science</subject><issn>2150-8097</issn><issn>2150-8097</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><recordid>eNpNj7tOw0AURFcIJEygp6Z3uI99uUQWj0iR0iT1yrneFUYQI-82_D0idkF1phiN5ih1j7BG7Zx_ZM22AbM-k_hCVYQGag-Nu_yXr9VNzh8A1lv0lara8XSKUjblVl2l7jPHu4UrdXh53rdv9Xb3ummftrUg-1L3BNpo0cZHFIMAZBtNDMRMR4jGo2BM4B1gH01K7LAHd0SroesYG16ph3l3zGUIWYYS5V3mFwG9IUd_JZhLMo05TzGF72n46qafgBDOvmHxDYsv_wJZ7UOB</recordid><startdate>20201201</startdate><enddate>20201201</enddate><creator>Dhulipala, Laxman</creator><creator>Hong, Changwan</creator><creator>Shun, Julian</creator><general>Association for Computing Machinery (ACM)</general><scope>AAYXX</scope><scope>CITATION</scope><scope>OTOTI</scope></search><sort><creationdate>20201201</creationdate><title>ConnectIt</title><author>Dhulipala, Laxman ; Hong, Changwan ; Shun, Julian</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c138t-d20454c458e1c510026942302332b0e581c1ef08701de5ff371d07b1640aa3193</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>Computer Science</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Dhulipala, Laxman</creatorcontrib><creatorcontrib>Hong, Changwan</creatorcontrib><creatorcontrib>Shun, Julian</creatorcontrib><creatorcontrib>Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)</creatorcontrib><collection>CrossRef</collection><collection>OSTI.GOV</collection><jtitle>Proceedings of the VLDB Endowment</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Dhulipala, Laxman</au><au>Hong, Changwan</au><au>Shun, Julian</au><aucorp>Massachusetts Inst. of Technology (MIT), Cambridge, MA (United States)</aucorp><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>ConnectIt: a framework for static and incremental parallel graph connectivity algorithms</atitle><jtitle>Proceedings of the VLDB Endowment</jtitle><date>2020-12-01</date><risdate>2020</risdate><volume>14</volume><issue>4</issue><spage>653</spage><epage>667</epage><pages>653-667</pages><issn>2150-8097</issn><eissn>2150-8097</eissn><abstract>Connected components is a fundamental kernel in graph applications. The fastest existing multicore algorithms for solving graph connectivity are based on some form of edge sampling and/or linking and compressing trees. However, many combinations of these design choices have been left unexplored. In this paper, we design the ConnectIt framework, which provides different sampling strategies as well as various tree linking and compression schemes. ConnectIt enables us to obtain several hundred new variants of connectivity algorithms, most of which extend to computing spanning forest. In addition to static graphs, we also extend ConnectIt to support mixes of insertions and connectivity queries in the concurrent setting.
We present an experimental evaluation of ConnectIt on a 72-core machine, which we believe is the most comprehensive evaluation of parallel connectivity algorithms to date. Compared to a collection of state-of-the-art static multicore algorithms, we obtain an average speedup of 12.4x (2.36x average speedup over the fastest existing implementation for each graph). Using ConnectIt, we are able to compute connectivity on the largest publicly-available graph (with over 3.5 billion vertices and 128 billion edges) in under 10 seconds using a 72-core machine, providing a 3.1x speedup over the fastest existing connectivity result for this graph, in any computational setting. For our incremental algorithms, we show that our algorithms can ingest graph updates at up to several billion edges per second. To guide the user in selecting the best variants in ConnectIt for different situations, we provide a detailed analysis of the different strategies. Finally, we show how the techniques in ConnectIt can be used to speed up two important graph applications: approximate minimum spanning forest and SCAN clustering.</abstract><cop>United States</cop><pub>Association for Computing Machinery (ACM)</pub><doi>10.14778/3436905.3436923</doi><tpages>15</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2150-8097 |
ispartof | Proceedings of the VLDB Endowment, 2020-12, Vol.14 (4), p.653-667 |
issn | 2150-8097 2150-8097 |
language | eng |
recordid | cdi_osti_scitechconnect_1852729 |
source | ACM Digital Library Complete |
subjects | Computer Science |
title | ConnectIt: a framework for static and incremental parallel graph connectivity algorithms |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T17%3A39%3A29IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-crossref_osti_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=ConnectIt:%20a%20framework%20for%20static%20and%20incremental%20parallel%20graph%20connectivity%20algorithms&rft.jtitle=Proceedings%20of%20the%20VLDB%20Endowment&rft.au=Dhulipala,%20Laxman&rft.aucorp=Massachusetts%20Inst.%20of%20Technology%20(MIT),%20Cambridge,%20MA%20(United%20States)&rft.date=2020-12-01&rft.volume=14&rft.issue=4&rft.spage=653&rft.epage=667&rft.pages=653-667&rft.issn=2150-8097&rft.eissn=2150-8097&rft_id=info:doi/10.14778/3436905.3436923&rft_dat=%3Ccrossref_osti_%3E10_14778_3436905_3436923%3C/crossref_osti_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |