RoleSim: Scaling axiomatic role-based similarity ranking on large graphs
RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in, e.g., web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring simila...
Gespeichert in:
Veröffentlicht in: | World wide web (Bussum) 2022-03, Vol.25 (2), p.785-829 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 829 |
---|---|
container_issue | 2 |
container_start_page | 785 |
container_title | World wide web (Bussum) |
container_volume | 25 |
creator | Yu, Weiren Iranmanesh, Sima Haldar, Aparajita Zhang, Maoyin Ferhatosmanoglu, Hakan |
description | RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in,
e.g.,
web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties (
i.e.,
symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges. |
doi_str_mv | 10.1007/s11280-021-00925-z |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2634667529</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2634667529</sourcerecordid><originalsourceid>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</originalsourceid><addsrcrecordid>eNp9kE9LAzEQxYMoWKtfwNOC52gms8lmvUlRKxQEq-AtpLvZNXX_1GQF209v6grePM1j5r038CPkHNglMJZdBQCuGGUcKGM5F3R3QCYgMqSQAh5GjUpGLV6PyUkIa8aYxBwmZP7UN3bp2utkWZjGdXVivlzfmsEViY8nujLBlklwrWuMd8M28aZ73_v6Lomb2ia1N5u3cEqOKtMEe_Y7p-Tl7vZ5NqeLx_uH2c2CFihxoKlCsHyVq1RVsKpAos1KJVkqEHnGysxirpTgyAALg8gws6ISZZoXOSgQOCUXY-_G9x-fNgx63X_6Lr7UXGIqZSZ4Hl18dBW-D8HbSm-8a43famB6T0yPxHQkpn-I6V0M4RgK0dzV1v9V_5P6BisNbRA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2634667529</pqid></control><display><type>article</type><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><source>SpringerLink Journals - AutoHoldings</source><creator>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</creator><creatorcontrib>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</creatorcontrib><description>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in,
e.g.,
web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties (
i.e.,
symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</description><identifier>ISSN: 1386-145X</identifier><identifier>EISSN: 1573-1413</identifier><identifier>DOI: 10.1007/s11280-021-00925-z</identifier><language>eng</language><publisher>New York: Springer US</publisher><subject>Accuracy ; Computer Science ; Computing time ; Database Management ; Equivalence ; Graph theory ; Graphs ; Information Systems Applications (incl.Internet) ; Operating Systems ; Similarity ; Special Issue on Large Scale Graph Data Analytics</subject><ispartof>World wide web (Bussum), 2022-03, Vol.25 (2), p.785-829</ispartof><rights>The Author(s) 2021</rights><rights>The Author(s) 2021. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</citedby><cites>FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</cites><orcidid>0000-0002-1082-9475</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s11280-021-00925-z$$EPDF$$P50$$Gspringer$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s11280-021-00925-z$$EHTML$$P50$$Gspringer$$Hfree_for_read</linktohtml><link.rule.ids>314,778,782,27907,27908,41471,42540,51302</link.rule.ids></links><search><creatorcontrib>Yu, Weiren</creatorcontrib><creatorcontrib>Iranmanesh, Sima</creatorcontrib><creatorcontrib>Haldar, Aparajita</creatorcontrib><creatorcontrib>Zhang, Maoyin</creatorcontrib><creatorcontrib>Ferhatosmanoglu, Hakan</creatorcontrib><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><title>World wide web (Bussum)</title><addtitle>World Wide Web</addtitle><description>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in,
e.g.,
web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties (
i.e.,
symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</description><subject>Accuracy</subject><subject>Computer Science</subject><subject>Computing time</subject><subject>Database Management</subject><subject>Equivalence</subject><subject>Graph theory</subject><subject>Graphs</subject><subject>Information Systems Applications (incl.Internet)</subject><subject>Operating Systems</subject><subject>Similarity</subject><subject>Special Issue on Large Scale Graph Data Analytics</subject><issn>1386-145X</issn><issn>1573-1413</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kE9LAzEQxYMoWKtfwNOC52gms8lmvUlRKxQEq-AtpLvZNXX_1GQF209v6grePM1j5r038CPkHNglMJZdBQCuGGUcKGM5F3R3QCYgMqSQAh5GjUpGLV6PyUkIa8aYxBwmZP7UN3bp2utkWZjGdXVivlzfmsEViY8nujLBlklwrWuMd8M28aZ73_v6Lomb2ia1N5u3cEqOKtMEe_Y7p-Tl7vZ5NqeLx_uH2c2CFihxoKlCsHyVq1RVsKpAos1KJVkqEHnGysxirpTgyAALg8gws6ISZZoXOSgQOCUXY-_G9x-fNgx63X_6Lr7UXGIqZSZ4Hl18dBW-D8HbSm-8a43famB6T0yPxHQkpn-I6V0M4RgK0dzV1v9V_5P6BisNbRA</recordid><startdate>20220301</startdate><enddate>20220301</enddate><creator>Yu, Weiren</creator><creator>Iranmanesh, Sima</creator><creator>Haldar, Aparajita</creator><creator>Zhang, Maoyin</creator><creator>Ferhatosmanoglu, Hakan</creator><general>Springer US</general><general>Springer Nature B.V</general><scope>C6C</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7SC</scope><scope>7XB</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><scope>M0N</scope><scope>P5Z</scope><scope>P62</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><orcidid>https://orcid.org/0000-0002-1082-9475</orcidid></search><sort><creationdate>20220301</creationdate><title>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</title><author>Yu, Weiren ; Iranmanesh, Sima ; Haldar, Aparajita ; Zhang, Maoyin ; Ferhatosmanoglu, Hakan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c363t-4831e2b9848f1bf163e7d8604533270d7e3988523013ca33037e5f5d49c918153</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Accuracy</topic><topic>Computer Science</topic><topic>Computing time</topic><topic>Database Management</topic><topic>Equivalence</topic><topic>Graph theory</topic><topic>Graphs</topic><topic>Information Systems Applications (incl.Internet)</topic><topic>Operating Systems</topic><topic>Similarity</topic><topic>Special Issue on Large Scale Graph Data Analytics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yu, Weiren</creatorcontrib><creatorcontrib>Iranmanesh, Sima</creatorcontrib><creatorcontrib>Haldar, Aparajita</creatorcontrib><creatorcontrib>Zhang, Maoyin</creatorcontrib><creatorcontrib>Ferhatosmanoglu, Hakan</creatorcontrib><collection>Springer Nature OA Free Journals</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Computer and Information Systems Abstracts</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><collection>Computing Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>World wide web (Bussum)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yu, Weiren</au><au>Iranmanesh, Sima</au><au>Haldar, Aparajita</au><au>Zhang, Maoyin</au><au>Ferhatosmanoglu, Hakan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RoleSim: Scaling axiomatic role-based similarity ranking on large graphs</atitle><jtitle>World wide web (Bussum)</jtitle><stitle>World Wide Web</stitle><date>2022-03-01</date><risdate>2022</risdate><volume>25</volume><issue>2</issue><spage>785</spage><epage>829</epage><pages>785-829</pages><issn>1386-145X</issn><eissn>1573-1413</eissn><abstract>RoleSim and SimRank are among the popular graph-theoretic similarity measures with many applications in,
e.g.,
web search, collaborative filtering, and sociometry. While RoleSim addresses the automorphic (role) equivalence of pairwise similarity which SimRank lacks, it ignores the neighboring similarity information out of the automorphically equivalent set. Consequently, two pairs of nodes, which are not automorphically equivalent by nature, cannot be well distinguished by RoleSim if the averages of their neighboring similarities over the automorphically equivalent set are the same. To alleviate this problem: 1) We propose a novel similarity model, namely RoleSim*, which accurately evaluates pairwise role similarities in a more comprehensive manner. RoleSim* not only guarantees the automorphic equivalence that SimRank lacks, but also takes into account the neighboring similarity information outside the automorphically equivalent sets that are overlooked by RoleSim. 2) We prove the existence and uniqueness of the RoleSim* solution, and show its three axiomatic properties (
i.e.,
symmetry, boundedness, and non-increasing monotonicity). 3) We provide a concise bound for iteratively computing RoleSim* formula, and estimate the number of iterations required to attain a desired accuracy. 4) We induce a distance metric based on RoleSim* similarity, and show that the RoleSim* metric fulfills the triangular inequality, which implies the sum-transitivity of its similarity scores. 5) We present a threshold-based RoleSim* model that reduces the computational time further with provable accuracy guarantee. 6) We propose a single-source RoleSim* model, which scales well for sizable graphs. 7) We also devise methods to scale RoleSim* based search by incorporating its triangular inequality property with partitioning techniques. Our experimental results on real datasets demonstrate that RoleSim* achieves higher accuracy than its competitors while scaling well on sizable graphs with billions of edges.</abstract><cop>New York</cop><pub>Springer US</pub><doi>10.1007/s11280-021-00925-z</doi><tpages>45</tpages><orcidid>https://orcid.org/0000-0002-1082-9475</orcidid><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1386-145X |
ispartof | World wide web (Bussum), 2022-03, Vol.25 (2), p.785-829 |
issn | 1386-145X 1573-1413 |
language | eng |
recordid | cdi_proquest_journals_2634667529 |
source | SpringerLink Journals - AutoHoldings |
subjects | Accuracy Computer Science Computing time Database Management Equivalence Graph theory Graphs Information Systems Applications (incl.Internet) Operating Systems Similarity Special Issue on Large Scale Graph Data Analytics |
title | RoleSim: Scaling axiomatic role-based similarity ranking on large graphs |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-16T22%3A14%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RoleSim:%20Scaling%20axiomatic%20role-based%20similarity%20ranking%20on%20large%20graphs&rft.jtitle=World%20wide%20web%20(Bussum)&rft.au=Yu,%20Weiren&rft.date=2022-03-01&rft.volume=25&rft.issue=2&rft.spage=785&rft.epage=829&rft.pages=785-829&rft.issn=1386-145X&rft.eissn=1573-1413&rft_id=info:doi/10.1007/s11280-021-00925-z&rft_dat=%3Cproquest_cross%3E2634667529%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2634667529&rft_id=info:pmid/&rfr_iscdi=true |