Closed Walk Sampler: An Efficient Method for Estimating Eigenvalues of Large Graphs

Eigenvalues of a graph are of high interest in graph analytics for Big Data due to their relevance to many important properties of the graph including network resilience, community detection and the speed of viral propagation. Accurate computation of eigenvalues of extremely large graphs is usually...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Han, Guyue, Sethu, Harish
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Han, Guyue
Sethu, Harish
description Eigenvalues of a graph are of high interest in graph analytics for Big Data due to their relevance to many important properties of the graph including network resilience, community detection and the speed of viral propagation. Accurate computation of eigenvalues of extremely large graphs is usually not feasible due to the prohibitive computational and storage costs and also because full access to many social network graphs is often restricted to most researchers. In this paper, we present a series of new sampling algorithms which solve both of the above-mentioned problems and estimate the two largest eigenvalues of a large graph efficiently and with high accuracy. Unlike previous methods which try to extract a subgraph with the most influential nodes, our algorithms sample only a small portion of the large graph via a simple random walk, and arrive at estimates of the two largest eigenvalues by estimating the number of closed walks of a certain length. Our experimental results using real graphs show that our algorithms are substantially faster while also achieving significantly better accuracy on most graphs than the current state-of-the-art algorithms.
doi_str_mv 10.48550/arxiv.1805.07448
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_1805_07448</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1805_07448</sourcerecordid><originalsourceid>FETCH-LOGICAL-a678-ef4cd8300f6cf216e4e49b6041c21ba40380eb96271f45ffd13348143ebe90a03</originalsourceid><addsrcrecordid>eNotz8FOg0AUheHZuDCtD-DKeQHwDjPA0F1DsJrQuGgTl-QC99KJFMiAjb69Wl2d3Z_zCXGvIDQ2juER_ae7hMpCHEJqjL0Vh7wfZ2rlG_bv8oDnqSe_kdtBFsyucTQsck_LaWwlj14W8-LOuLihk4XraLhg_0GzHFmW6DuSO4_TaV6LG8Z-prv_XYnjU3HMn4PydfeSb8sAk9QGxKZprQbgpOFIJWTIZHUCRjWRqtGAtkB1lkSpYhMzt0prY5XRVFMGCHolHv6yV1U1-Z9r_qv61VVXnf4GoE9Jkg</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Closed Walk Sampler: An Efficient Method for Estimating Eigenvalues of Large Graphs</title><source>arXiv.org</source><creator>Han, Guyue ; Sethu, Harish</creator><creatorcontrib>Han, Guyue ; Sethu, Harish</creatorcontrib><description>Eigenvalues of a graph are of high interest in graph analytics for Big Data due to their relevance to many important properties of the graph including network resilience, community detection and the speed of viral propagation. Accurate computation of eigenvalues of extremely large graphs is usually not feasible due to the prohibitive computational and storage costs and also because full access to many social network graphs is often restricted to most researchers. In this paper, we present a series of new sampling algorithms which solve both of the above-mentioned problems and estimate the two largest eigenvalues of a large graph efficiently and with high accuracy. Unlike previous methods which try to extract a subgraph with the most influential nodes, our algorithms sample only a small portion of the large graph via a simple random walk, and arrive at estimates of the two largest eigenvalues by estimating the number of closed walks of a certain length. Our experimental results using real graphs show that our algorithms are substantially faster while also achieving significantly better accuracy on most graphs than the current state-of-the-art algorithms.</description><identifier>DOI: 10.48550/arxiv.1805.07448</identifier><language>eng</language><subject>Computer Science - Social and Information Networks</subject><creationdate>2018-05</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,780,885</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/1805.07448$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.1805.07448$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Han, Guyue</creatorcontrib><creatorcontrib>Sethu, Harish</creatorcontrib><title>Closed Walk Sampler: An Efficient Method for Estimating Eigenvalues of Large Graphs</title><description>Eigenvalues of a graph are of high interest in graph analytics for Big Data due to their relevance to many important properties of the graph including network resilience, community detection and the speed of viral propagation. Accurate computation of eigenvalues of extremely large graphs is usually not feasible due to the prohibitive computational and storage costs and also because full access to many social network graphs is often restricted to most researchers. In this paper, we present a series of new sampling algorithms which solve both of the above-mentioned problems and estimate the two largest eigenvalues of a large graph efficiently and with high accuracy. Unlike previous methods which try to extract a subgraph with the most influential nodes, our algorithms sample only a small portion of the large graph via a simple random walk, and arrive at estimates of the two largest eigenvalues by estimating the number of closed walks of a certain length. Our experimental results using real graphs show that our algorithms are substantially faster while also achieving significantly better accuracy on most graphs than the current state-of-the-art algorithms.</description><subject>Computer Science - Social and Information Networks</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotz8FOg0AUheHZuDCtD-DKeQHwDjPA0F1DsJrQuGgTl-QC99KJFMiAjb69Wl2d3Z_zCXGvIDQ2juER_ae7hMpCHEJqjL0Vh7wfZ2rlG_bv8oDnqSe_kdtBFsyucTQsck_LaWwlj14W8-LOuLihk4XraLhg_0GzHFmW6DuSO4_TaV6LG8Z-prv_XYnjU3HMn4PydfeSb8sAk9QGxKZprQbgpOFIJWTIZHUCRjWRqtGAtkB1lkSpYhMzt0prY5XRVFMGCHolHv6yV1U1-Z9r_qv61VVXnf4GoE9Jkg</recordid><startdate>20180518</startdate><enddate>20180518</enddate><creator>Han, Guyue</creator><creator>Sethu, Harish</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20180518</creationdate><title>Closed Walk Sampler: An Efficient Method for Estimating Eigenvalues of Large Graphs</title><author>Han, Guyue ; Sethu, Harish</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a678-ef4cd8300f6cf216e4e49b6041c21ba40380eb96271f45ffd13348143ebe90a03</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Computer Science - Social and Information Networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Han, Guyue</creatorcontrib><creatorcontrib>Sethu, Harish</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Han, Guyue</au><au>Sethu, Harish</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Closed Walk Sampler: An Efficient Method for Estimating Eigenvalues of Large Graphs</atitle><date>2018-05-18</date><risdate>2018</risdate><abstract>Eigenvalues of a graph are of high interest in graph analytics for Big Data due to their relevance to many important properties of the graph including network resilience, community detection and the speed of viral propagation. Accurate computation of eigenvalues of extremely large graphs is usually not feasible due to the prohibitive computational and storage costs and also because full access to many social network graphs is often restricted to most researchers. In this paper, we present a series of new sampling algorithms which solve both of the above-mentioned problems and estimate the two largest eigenvalues of a large graph efficiently and with high accuracy. Unlike previous methods which try to extract a subgraph with the most influential nodes, our algorithms sample only a small portion of the large graph via a simple random walk, and arrive at estimates of the two largest eigenvalues by estimating the number of closed walks of a certain length. Our experimental results using real graphs show that our algorithms are substantially faster while also achieving significantly better accuracy on most graphs than the current state-of-the-art algorithms.</abstract><doi>10.48550/arxiv.1805.07448</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.1805.07448
ispartof
issn
language eng
recordid cdi_arxiv_primary_1805_07448
source arXiv.org
subjects Computer Science - Social and Information Networks
title Closed Walk Sampler: An Efficient Method for Estimating Eigenvalues of Large Graphs
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-06T07%3A51%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Closed%20Walk%20Sampler:%20An%20Efficient%20Method%20for%20Estimating%20Eigenvalues%20of%20Large%20Graphs&rft.au=Han,%20Guyue&rft.date=2018-05-18&rft_id=info:doi/10.48550/arxiv.1805.07448&rft_dat=%3Carxiv_GOX%3E1805_07448%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true