RoleSim: Scaling axiomatic role-based similarity ranking on large graphs

RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring simila...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:World wide web (Bussum) 2022-03, Vol.25 (2), p.785-829
Hauptverfasser: Yu, Weiren, Iranmanesh, Sima, Haldar, Aparajita, Zhang, Maoyin, Ferhatosmanoglu, Hakan
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 829
container_issue 2
container_start_page 785
container_title World wide web (Bussum)
container_volume 25
creator Yu, Weiren
Iranmanesh, Sima
Haldar, Aparajita
Zhang, Maoyin
Ferhatosmanoglu, Hakan
description RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.
doi_str_mv 10.1007/s11280-021-00925-z
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2634667529</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2634667529</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwNOC52gms8lmvUlRKxQEq-AtpLvZNXX_1GQF209v6grePM1j5r038CPkHNglMJZdBQCuGGUcKGM5F3R3QCYgMqSQAh5GjUpGLV6PyUkIa8aYxBwmZP7UN3bp2utkWZjGdXVivlzfmsEViY8nujLBlklwrWuMd8M28aZ73_v6Lomb2ia1N5u3cEqOKtMEe_Y7p-Tl7vZ5NqeLx_uH2c2CFihxoKlCsHyVq1RVsKpAos1KJVkqEHnGysxirpTgyAALg8gws6ISZZoXOSgQOCUXY-_G9x-fNgx63X_6Lr7UXGIqZSZ4Hl18dBW-D8HbSm-8a43famB6T0yPxHQkpn-I6V0M4RgK0dzV1v9V_5P6BisNbRA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2634667529</pqid></control><display><type>article</type><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><source>SpringerLink Journals - AutoHoldings</source><creator>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</creator><creatorcontrib>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</creatorcontrib><description>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-021-00925-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Accuracy ; Computer Science ; Computing time ; Database Management ; Equivalence ; Graph theory ; Graphs ; Information Systems Applications (incl.Internet) ; Operating Systems ; Similarity ; Special Issue on Large Scale Graph Data Analytics</subject><ispartof>World wide web (Bussum), 2022-03, Vol.25 (2), p.785-829</ispartof><rights>The Author(s) 2021</rights><rights>The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</citedby><cites>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</cites><orcidid>0000-0002-1082-9475</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-021-00925-z$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-021-00925-z$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,27907,27908,41471,42540,51302</link.rule.ids></links><search><creatorcontrib>Yu, Weiren</creatorcontrib><creatorcontrib>Iranmanesh, Sima</creatorcontrib><creatorcontrib>Haldar, Aparajita</creatorcontrib><creatorcontrib>Zhang, Maoyin</creatorcontrib><creatorcontrib>Ferhatosmanoglu, Hakan</creatorcontrib><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</description><subject>Accuracy</subject><subject>Computer Science</subject><subject>Computing time</subject><subject>Database Management</subject><subject>Equivalence</subject><subject>Graph theory</subject><subject>Graphs</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Operating Systems</subject><subject>Similarity</subject><subject>Special Issue on Large Scale Graph Data Analytics</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwNOC52gms8lmvUlRKxQEq-AtpLvZNXX_1GQF209v6grePM1j5r038CPkHNglMJZdBQCuGGUcKGM5F3R3QCYgMqSQAh5GjUpGLV6PyUkIa8aYxBwmZP7UN3bp2utkWZjGdXVivlzfmsEViY8nujLBlklwrWuMd8M28aZ73_v6Lomb2ia1N5u3cEqOKtMEe_Y7p-Tl7vZ5NqeLx_uH2c2CFihxoKlCsHyVq1RVsKpAos1KJVkqEHnGysxirpTgyAALg8gws6ISZZoXOSgQOCUXY-_G9x-fNgx63X_6Lr7UXGIqZSZ4Hl18dBW-D8HbSm-8a43famB6T0yPxHQkpn-I6V0M4RgK0dzV1v9V_5P6BisNbRA</recordid><startdate>20220301</startdate><enddate>20220301</enddate><creator>Yu, Weiren</creator><creator>Iranmanesh, Sima</creator><creator>Haldar, Aparajita</creator><creator>Zhang, Maoyin</creator><creator>Ferhatosmanoglu, Hakan</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7XB</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0002-1082-9475</orcidid></search><sort><creationdate>20220301</creationdate><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><author>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accuracy</topic><topic>Computer Science</topic><topic>Computing time</topic><topic>Database Management</topic><topic>Equivalence</topic><topic>Graph theory</topic><topic>Graphs</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Operating Systems</topic><topic>Similarity</topic><topic>Special Issue on Large Scale Graph Data Analytics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Weiren</creatorcontrib><creatorcontrib>Iranmanesh, Sima</creatorcontrib><creatorcontrib>Haldar, Aparajita</creatorcontrib><creatorcontrib>Zhang, Maoyin</creatorcontrib><creatorcontrib>Ferhatosmanoglu, Hakan</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Weiren</au><au>Iranmanesh, Sima</au><au>Haldar, Aparajita</au><au>Zhang, Maoyin</au><au>Ferhatosmanoglu, Hakan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2022-03-01</date><risdate>2022</risdate><volume>25</volume><issue>2</issue><spage>785</spage><epage>829</epage><pages>785-829</pages><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-021-00925-z</doi><tpages>45</tpages><orcidid>https://orcid.org/0000-0002-1082-9475</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1386-145X
ispartof World wide web (Bussum), 2022-03, Vol.25 (2), p.785-829
issn 1386-145X
1573-1413
language eng
recordid cdi_proquest_journals_2634667529
source SpringerLink Journals - AutoHoldings
subjects Accuracy
Computer Science
Computing time
Database Management
Equivalence
Graph theory
Graphs
Information Systems Applications (incl.Internet)
Operating Systems
Similarity
Special Issue on Large Scale Graph Data Analytics
title RoleSim: Scaling axiomatic role-based similarity ranking on large graphs
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T22%3A14%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RoleSim:%20Scaling%20axiomatic%20role-based%20similarity%20ranking%20on%20large%20graphs&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Yu,%20Weiren&rft.date=2022-03-01&rft.volume=25&rft.issue=2&rft.spage=785&rft.epage=829&rft.pages=785-829&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-021-00925-z&rft_dat=%3Cproquest_cross%3E2634667529%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2634667529&rft_id=info:pmid/&rfr_iscdi=true