Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher

Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input formats over different frameworks. However, each framework creates its specific format, which may not...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2024-06
Hauptverfasser: Esfahani, Mohsen Koohi, D'Antonio, Marco, Syed, Ibtisam Tauhidi, Thai Son Mai, Vandierendonck, Hans
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Esfahani, Mohsen Koohi
D'Antonio, Marco
Syed, Ibtisam Tauhidi
Thai Son Mai
Vandierendonck, Hans
description Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input formats over different frameworks. However, each framework creates its specific format, which may not support reading large-scale real-world graph datasets. This shows a demand for high-performance libraries capable of loading graphs to (i) accelerate designing new graph algorithms, (ii) to evaluate the contributions on a wide range of graph algorithms, and (iii) to facilitate easy and fast comparison over different graph frameworks. To that end, we present ParaGrapher, a high-performance API and library for loading large-scale and compressed graphs. ParaGrapher supports different types of requests for accessing graphs in shared- and distributed-memory and out-of-core graph processing. We explain the design of ParaGrapher and present a performance model of graph decompression, which is used for evaluation of ParaGrapher over three storage types. Our evaluation shows that by decompressing compressed graphs in WebGraph format, ParaGrapher delivers up to 3.2 times speedup in loading and up to 5.2 times speedup in end-to-end execution in comparison to the binary and textual formats. ParaGrapher is available online on https://blogs.qub.ac.uk/DIPSA/ParaGrapher/.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_3049794020</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3049794020</sourcerecordid><originalsourceid>FETCH-proquest_journals_30497940203</originalsourceid><addsrcrecordid>eNqNi80KgkAURocgSMp3uNBamGY0cy39LFwEto-LXv-YHJur9fqF9ACtPg7nfAvhKa13wSFUaiV85k5KqfaxiiLtiSwnQ8XYvgiu6NAYMpBZLNu-BltBhq6mIC_QEKT2MThiphLODoeG4d2OzXybmdxGLCs0TP5v12J7Ot7SSzA4-5yIx3tnJ9d_1V3LMImTUCqp_6s-EXc9BA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3049794020</pqid></control><display><type>article</type><title>Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher</title><source>Free E- Journals</source><creator>Esfahani, Mohsen Koohi ; D'Antonio, Marco ; Syed, Ibtisam Tauhidi ; Thai Son Mai ; Vandierendonck, Hans</creator><creatorcontrib>Esfahani, Mohsen Koohi ; D'Antonio, Marco ; Syed, Ibtisam Tauhidi ; Thai Son Mai ; Vandierendonck, Hans</creatorcontrib><description>Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input formats over different frameworks. However, each framework creates its specific format, which may not support reading large-scale real-world graph datasets. This shows a demand for high-performance libraries capable of loading graphs to (i) accelerate designing new graph algorithms, (ii) to evaluate the contributions on a wide range of graph algorithms, and (iii) to facilitate easy and fast comparison over different graph frameworks. To that end, we present ParaGrapher, a high-performance API and library for loading large-scale and compressed graphs. ParaGrapher supports different types of requests for accessing graphs in shared- and distributed-memory and out-of-core graph processing. We explain the design of ParaGrapher and present a performance model of graph decompression, which is used for evaluation of ParaGrapher over three storage types. Our evaluation shows that by decompressing compressed graphs in WebGraph format, ParaGrapher delivers up to 3.2 times speedup in loading and up to 5.2 times speedup in end-to-end execution in comparison to the binary and textual formats. ParaGrapher is available online on https://blogs.qub.ac.uk/DIPSA/ParaGrapher/.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Algorithms ; Decompression ; Distributed memory ; Format ; Graphs</subject><ispartof>arXiv.org, 2024-06</ispartof><rights>2024. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>780,784</link.rule.ids></links><search><creatorcontrib>Esfahani, Mohsen Koohi</creatorcontrib><creatorcontrib>D'Antonio, Marco</creatorcontrib><creatorcontrib>Syed, Ibtisam Tauhidi</creatorcontrib><creatorcontrib>Thai Son Mai</creatorcontrib><creatorcontrib>Vandierendonck, Hans</creatorcontrib><title>Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher</title><title>arXiv.org</title><description>Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input formats over different frameworks. However, each framework creates its specific format, which may not support reading large-scale real-world graph datasets. This shows a demand for high-performance libraries capable of loading graphs to (i) accelerate designing new graph algorithms, (ii) to evaluate the contributions on a wide range of graph algorithms, and (iii) to facilitate easy and fast comparison over different graph frameworks. To that end, we present ParaGrapher, a high-performance API and library for loading large-scale and compressed graphs. ParaGrapher supports different types of requests for accessing graphs in shared- and distributed-memory and out-of-core graph processing. We explain the design of ParaGrapher and present a performance model of graph decompression, which is used for evaluation of ParaGrapher over three storage types. Our evaluation shows that by decompressing compressed graphs in WebGraph format, ParaGrapher delivers up to 3.2 times speedup in loading and up to 5.2 times speedup in end-to-end execution in comparison to the binary and textual formats. ParaGrapher is available online on https://blogs.qub.ac.uk/DIPSA/ParaGrapher/.</description><subject>Algorithms</subject><subject>Decompression</subject><subject>Distributed memory</subject><subject>Format</subject><subject>Graphs</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNi80KgkAURocgSMp3uNBamGY0cy39LFwEto-LXv-YHJur9fqF9ACtPg7nfAvhKa13wSFUaiV85k5KqfaxiiLtiSwnQ8XYvgiu6NAYMpBZLNu-BltBhq6mIC_QEKT2MThiphLODoeG4d2OzXybmdxGLCs0TP5v12J7Ot7SSzA4-5yIx3tnJ9d_1V3LMImTUCqp_6s-EXc9BA</recordid><startdate>20240617</startdate><enddate>20240617</enddate><creator>Esfahani, Mohsen Koohi</creator><creator>D'Antonio, Marco</creator><creator>Syed, Ibtisam Tauhidi</creator><creator>Thai Son Mai</creator><creator>Vandierendonck, Hans</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20240617</creationdate><title>Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher</title><author>Esfahani, Mohsen Koohi ; D'Antonio, Marco ; Syed, Ibtisam Tauhidi ; Thai Son Mai ; Vandierendonck, Hans</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_30497940203</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Decompression</topic><topic>Distributed memory</topic><topic>Format</topic><topic>Graphs</topic><toplevel>online_resources</toplevel><creatorcontrib>Esfahani, Mohsen Koohi</creatorcontrib><creatorcontrib>D'Antonio, Marco</creatorcontrib><creatorcontrib>Syed, Ibtisam Tauhidi</creatorcontrib><creatorcontrib>Thai Son Mai</creatorcontrib><creatorcontrib>Vandierendonck, Hans</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Esfahani, Mohsen Koohi</au><au>D'Antonio, Marco</au><au>Syed, Ibtisam Tauhidi</au><au>Thai Son Mai</au><au>Vandierendonck, Hans</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher</atitle><jtitle>arXiv.org</jtitle><date>2024-06-17</date><risdate>2024</risdate><eissn>2331-8422</eissn><abstract>Comprehensive evaluation is one of the basis of experimental science. In High-Performance Graph Processing, a thorough evaluation of contributions becomes more achievable by supporting common input formats over different frameworks. However, each framework creates its specific format, which may not support reading large-scale real-world graph datasets. This shows a demand for high-performance libraries capable of loading graphs to (i) accelerate designing new graph algorithms, (ii) to evaluate the contributions on a wide range of graph algorithms, and (iii) to facilitate easy and fast comparison over different graph frameworks. To that end, we present ParaGrapher, a high-performance API and library for loading large-scale and compressed graphs. ParaGrapher supports different types of requests for accessing graphs in shared- and distributed-memory and out-of-core graph processing. We explain the design of ParaGrapher and present a performance model of graph decompression, which is used for evaluation of ParaGrapher over three storage types. Our evaluation shows that by decompressing compressed graphs in WebGraph format, ParaGrapher delivers up to 3.2 times speedup in loading and up to 5.2 times speedup in end-to-end execution in comparison to the binary and textual formats. ParaGrapher is available online on https://blogs.qub.ac.uk/DIPSA/ParaGrapher/.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2024-06
issn 2331-8422
language eng
recordid cdi_proquest_journals_3049794020
source Free E- Journals
subjects Algorithms
Decompression
Distributed memory
Format
Graphs
title Selective Parallel Loading of Large-Scale Compressed Graphs with ParaGrapher
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-09T14%3A05%3A59IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Selective%20Parallel%20Loading%20of%20Large-Scale%20Compressed%20Graphs%20with%20ParaGrapher&rft.jtitle=arXiv.org&rft.au=Esfahani,%20Mohsen%20Koohi&rft.date=2024-06-17&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E3049794020%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3049794020&rft_id=info:pmid/&rfr_iscdi=true