RoleSim: Scaling axiomatic role-based similarity ranking on large graphs

RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring simila...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	World wide web (Bussum) 2022-03, Vol.25 (2), p.785-829
Hauptverfasser:	Yu, Weiren, Iranmanesh, Sima, Haldar, Aparajita, Zhang, Maoyin, Ferhatosmanoglu, Hakan
Format:	Artikel
Sprache:	eng
Schlagworte:	Accuracy Computer Science Computing time Database Management Equivalence Graph theory Graphs Information Systems Applications (incl.Internet) Operating Systems Similarity Special Issue on Large Scale Graph Data Analytics
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	829
container_issue	2
container_start_page	785
container_title	World wide web (Bussum)
container_volume	25
creator	Yu, Weiren Iranmanesh, Sima Haldar, Aparajita Zhang, Maoyin Ferhatosmanoglu, Hakan
description	RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.
doi_str_mv	10.1007/s11280-021-00925-z
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2634667529</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2634667529</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwNOC52gms8lmvUlRKxQEq-AtpLvZNXX_1GQF209v6grePM1j5r038CPkHNglMJZdBQCuGGUcKGM5F3R3QCYgMqSQAh5GjUpGLV6PyUkIa8aYxBwmZP7UN3bp2utkWZjGdXVivlzfmsEViY8nujLBlklwrWuMd8M28aZ73_v6Lomb2ia1N5u3cEqOKtMEe_Y7p-Tl7vZ5NqeLx_uH2c2CFihxoKlCsHyVq1RVsKpAos1KJVkqEHnGysxirpTgyAALg8gws6ISZZoXOSgQOCUXY-_G9x-fNgx63X_6Lr7UXGIqZSZ4Hl18dBW-D8HbSm-8a43famB6T0yPxHQkpn-I6V0M4RgK0dzV1v9V_5P6BisNbRA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2634667529</pqid></control><display><type>article</type><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><source>SpringerLink Journals - AutoHoldings</source><creator>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</creator><creatorcontrib>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</creatorcontrib><description>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-021-00925-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Accuracy ; Computer Science ; Computing time ; Database Management ; Equivalence ; Graph theory ; Graphs ; Information Systems Applications (incl.Internet) ; Operating Systems ; Similarity ; Special Issue on Large Scale Graph Data Analytics</subject><ispartof>World wide web (Bussum), 2022-03, Vol.25 (2), p.785-829</ispartof><rights>The Author(s) 2021</rights><rights>The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</citedby><cites>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</cites><orcidid>0000-0002-1082-9475</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-021-00925-z$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-021-00925-z$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,27907,27908,41471,42540,51302</link.rule.ids></links><search><creatorcontrib>Yu, Weiren</creatorcontrib><creatorcontrib>Iranmanesh, Sima</creatorcontrib><creatorcontrib>Haldar, Aparajita</creatorcontrib><creatorcontrib>Zhang, Maoyin</creatorcontrib><creatorcontrib>Ferhatosmanoglu, Hakan</creatorcontrib><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</description><subject>Accuracy</subject><subject>Computer Science</subject><subject>Computing time</subject><subject>Database Management</subject><subject>Equivalence</subject><subject>Graph theory</subject><subject>Graphs</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Operating Systems</subject><subject>Similarity</subject><subject>Special Issue on Large Scale Graph Data Analytics</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwNOC52gms8lmvUlRKxQEq-AtpLvZNXX_1GQF209v6grePM1j5r038CPkHNglMJZdBQCuGGUcKGM5F3R3QCYgMqSQAh5GjUpGLV6PyUkIa8aYxBwmZP7UN3bp2utkWZjGdXVivlzfmsEViY8nujLBlklwrWuMd8M28aZ73_v6Lomb2ia1N5u3cEqOKtMEe_Y7p-Tl7vZ5NqeLx_uH2c2CFihxoKlCsHyVq1RVsKpAos1KJVkqEHnGysxirpTgyAALg8gws6ISZZoXOSgQOCUXY-_G9x-fNgx63X_6Lr7UXGIqZSZ4Hl18dBW-D8HbSm-8a43famB6T0yPxHQkpn-I6V0M4RgK0dzV1v9V_5P6BisNbRA</recordid><startdate>20220301</startdate><enddate>20220301</enddate><creator>Yu, Weiren</creator><creator>Iranmanesh, Sima</creator><creator>Haldar, Aparajita</creator><creator>Zhang, Maoyin</creator><creator>Ferhatosmanoglu, Hakan</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7XB</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0002-1082-9475</orcidid></search><sort><creationdate>20220301</creationdate><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><author>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accuracy</topic><topic>Computer Science</topic><topic>Computing time</topic><topic>Database Management</topic><topic>Equivalence</topic><topic>Graph theory</topic><topic>Graphs</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Operating Systems</topic><topic>Similarity</topic><topic>Special Issue on Large Scale Graph Data Analytics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Weiren</creatorcontrib><creatorcontrib>Iranmanesh, Sima</creatorcontrib><creatorcontrib>Haldar, Aparajita</creatorcontrib><creatorcontrib>Zhang, Maoyin</creatorcontrib><creatorcontrib>Ferhatosmanoglu, Hakan</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Weiren</au><au>Iranmanesh, Sima</au><au>Haldar, Aparajita</au><au>Zhang, Maoyin</au><au>Ferhatosmanoglu, Hakan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2022-03-01</date><risdate>2022</risdate><volume>25</volume><issue>2</issue><spage>785</spage><epage>829</epage><pages>785-829</pages><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties ( i.e., symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-021-00925-z</doi><tpages>45</tpages><orcidid>https://orcid.org/0000-0002-1082-9475</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1386-145X
ispartof	World wide web (Bussum), 2022-03, Vol.25 (2), p.785-829
issn	1386-145X 1573-1413
language	eng
recordid	cdi_proquest_journals_2634667529
source	SpringerLink Journals - AutoHoldings
subjects	Accuracy Computer Science Computing time Database Management Equivalence Graph theory Graphs Information Systems Applications (incl.Internet) Operating Systems Similarity Special Issue on Large Scale Graph Data Analytics
title	RoleSim: Scaling axiomatic role-based similarity ranking on large graphs
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T22%3A14%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RoleSim:%20Scaling%20axiomatic%20role-based%20similarity%20ranking%20on%20large%20graphs&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Yu,%20Weiren&rft.date=2022-03-01&rft.volume=25&rft.issue=2&rft.spage=785&rft.epage=829&rft.pages=785-829&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-021-00925-z&rft_dat=%3Cproquest_cross%3E2634667529%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2634667529&rft_id=info:pmid/&rfr_iscdi=true