Graph Ranking and the Cost of Sybil Defense

Ranking functions such as PageRank assign numeric values (ranks) to nodes of graphs, most notably the web graph. Node rankings are an integral part of Internet search algorithms, since they can be used to order the results of queries. However, these ranking functions are famously subject to attacks...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:arXiv.org 2023-06
Hauptverfasser: Farach-Colton, Gwendolyn, Farach-Colton, Martin, Goldberg, Leslie Ann, Komlos, Hanna, Lapinskas, John, Levi, Reut, Medina, Moti, Mosteiro, Miguel A
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title arXiv.org
container_volume
creator Farach-Colton, Gwendolyn
Farach-Colton, Martin
Goldberg, Leslie Ann
Komlos, Hanna
Lapinskas, John
Levi, Reut
Medina, Moti
Mosteiro, Miguel A
description Ranking functions such as PageRank assign numeric values (ranks) to nodes of graphs, most notably the web graph. Node rankings are an integral part of Internet search algorithms, since they can be used to order the results of queries. However, these ranking functions are famously subject to attacks by spammers, who modify the web graph in order to give their own pages more rank. We characterize the interplay between rankers and spammers as a game. We define the two critical features of this game, spam resistance and distortion, based on how spammers spam and how rankers protect against spam. We observe that all the ranking functions that are well-studied in the literature, including the original formulation of PageRank, have poor spam resistance, poor distortion, or both. Finally, we study Min-PPR, the form of PageRank used at Google itself, but which has received no (theoretical or empirical) treatment in the literature. We prove that Min-PPR has low distortion and high spam resistance. A secondary benefit is that Min-PPR comes with an explicit cost function on nodes that shows how important they are to the spammer; thus a ranker can focus their spam-detection capacity on these vulnerable nodes. Both Min-PPR and its associated cost function are straightforward to compute.
format Article
fullrecord <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2071784842</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2071784842</sourcerecordid><originalsourceid>FETCH-proquest_journals_20717848423</originalsourceid><addsrcrecordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQdi9KLMhQCErMy87MS1dIzEtRKMlIVXDOLy5RyE9TCK5MysxRcElNS80rTuVhYE1LzClO5YXS3AzKbq4hzh66BUX5haWpxSXxWfmlRXlAqXgjA3NDcwsToJ3GxKkCANu2MEE</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2071784842</pqid></control><display><type>article</type><title>Graph Ranking and the Cost of Sybil Defense</title><source>Free E- Journals</source><creator>Farach-Colton, Gwendolyn ; Farach-Colton, Martin ; Goldberg, Leslie Ann ; Komlos, Hanna ; Lapinskas, John ; Levi, Reut ; Medina, Moti ; Mosteiro, Miguel A</creator><creatorcontrib>Farach-Colton, Gwendolyn ; Farach-Colton, Martin ; Goldberg, Leslie Ann ; Komlos, Hanna ; Lapinskas, John ; Levi, Reut ; Medina, Moti ; Mosteiro, Miguel A</creatorcontrib><description>Ranking functions such as PageRank assign numeric values (ranks) to nodes of graphs, most notably the web graph. Node rankings are an integral part of Internet search algorithms, since they can be used to order the results of queries. However, these ranking functions are famously subject to attacks by spammers, who modify the web graph in order to give their own pages more rank. We characterize the interplay between rankers and spammers as a game. We define the two critical features of this game, spam resistance and distortion, based on how spammers spam and how rankers protect against spam. We observe that all the ranking functions that are well-studied in the literature, including the original formulation of PageRank, have poor spam resistance, poor distortion, or both. Finally, we study Min-PPR, the form of PageRank used at Google itself, but which has received no (theoretical or empirical) treatment in the literature. We prove that Min-PPR has low distortion and high spam resistance. A secondary benefit is that Min-PPR comes with an explicit cost function on nodes that shows how important they are to the spammer; thus a ranker can focus their spam-detection capacity on these vulnerable nodes. Both Min-PPR and its associated cost function are straightforward to compute.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Graph theory ; Hubs ; Mathematical models ; Operators (mathematics) ; Random walk theory ; Ranking ; Search engines ; Spamming</subject><ispartof>arXiv.org, 2023-06</ispartof><rights>2023. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>776,780</link.rule.ids></links><search><creatorcontrib>Farach-Colton, Gwendolyn</creatorcontrib><creatorcontrib>Farach-Colton, Martin</creatorcontrib><creatorcontrib>Goldberg, Leslie Ann</creatorcontrib><creatorcontrib>Komlos, Hanna</creatorcontrib><creatorcontrib>Lapinskas, John</creatorcontrib><creatorcontrib>Levi, Reut</creatorcontrib><creatorcontrib>Medina, Moti</creatorcontrib><creatorcontrib>Mosteiro, Miguel A</creatorcontrib><title>Graph Ranking and the Cost of Sybil Defense</title><title>arXiv.org</title><description>Ranking functions such as PageRank assign numeric values (ranks) to nodes of graphs, most notably the web graph. Node rankings are an integral part of Internet search algorithms, since they can be used to order the results of queries. However, these ranking functions are famously subject to attacks by spammers, who modify the web graph in order to give their own pages more rank. We characterize the interplay between rankers and spammers as a game. We define the two critical features of this game, spam resistance and distortion, based on how spammers spam and how rankers protect against spam. We observe that all the ranking functions that are well-studied in the literature, including the original formulation of PageRank, have poor spam resistance, poor distortion, or both. Finally, we study Min-PPR, the form of PageRank used at Google itself, but which has received no (theoretical or empirical) treatment in the literature. We prove that Min-PPR has low distortion and high spam resistance. A secondary benefit is that Min-PPR comes with an explicit cost function on nodes that shows how important they are to the spammer; thus a ranker can focus their spam-detection capacity on these vulnerable nodes. Both Min-PPR and its associated cost function are straightforward to compute.</description><subject>Graph theory</subject><subject>Hubs</subject><subject>Mathematical models</subject><subject>Operators (mathematics)</subject><subject>Random walk theory</subject><subject>Ranking</subject><subject>Search engines</subject><subject>Spamming</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><sourceid>BENPR</sourceid><recordid>eNpjYuA0MjY21LUwMTLiYOAtLs4yMDAwMjM3MjU15mTQdi9KLMhQCErMy87MS1dIzEtRKMlIVXDOLy5RyE9TCK5MysxRcElNS80rTuVhYE1LzClO5YXS3AzKbq4hzh66BUX5haWpxSXxWfmlRXlAqXgjA3NDcwsToJ3GxKkCANu2MEE</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Farach-Colton, Gwendolyn</creator><creator>Farach-Colton, Martin</creator><creator>Goldberg, Leslie Ann</creator><creator>Komlos, Hanna</creator><creator>Lapinskas, John</creator><creator>Levi, Reut</creator><creator>Medina, Moti</creator><creator>Mosteiro, Miguel A</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20230601</creationdate><title>Graph Ranking and the Cost of Sybil Defense</title><author>Farach-Colton, Gwendolyn ; Farach-Colton, Martin ; Goldberg, Leslie Ann ; Komlos, Hanna ; Lapinskas, John ; Levi, Reut ; Medina, Moti ; Mosteiro, Miguel A</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20717848423</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Graph theory</topic><topic>Hubs</topic><topic>Mathematical models</topic><topic>Operators (mathematics)</topic><topic>Random walk theory</topic><topic>Ranking</topic><topic>Search engines</topic><topic>Spamming</topic><toplevel>online_resources</toplevel><creatorcontrib>Farach-Colton, Gwendolyn</creatorcontrib><creatorcontrib>Farach-Colton, Martin</creatorcontrib><creatorcontrib>Goldberg, Leslie Ann</creatorcontrib><creatorcontrib>Komlos, Hanna</creatorcontrib><creatorcontrib>Lapinskas, John</creatorcontrib><creatorcontrib>Levi, Reut</creatorcontrib><creatorcontrib>Medina, Moti</creatorcontrib><creatorcontrib>Mosteiro, Miguel A</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Farach-Colton, Gwendolyn</au><au>Farach-Colton, Martin</au><au>Goldberg, Leslie Ann</au><au>Komlos, Hanna</au><au>Lapinskas, John</au><au>Levi, Reut</au><au>Medina, Moti</au><au>Mosteiro, Miguel A</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>Graph Ranking and the Cost of Sybil Defense</atitle><jtitle>arXiv.org</jtitle><date>2023-06-01</date><risdate>2023</risdate><eissn>2331-8422</eissn><abstract>Ranking functions such as PageRank assign numeric values (ranks) to nodes of graphs, most notably the web graph. Node rankings are an integral part of Internet search algorithms, since they can be used to order the results of queries. However, these ranking functions are famously subject to attacks by spammers, who modify the web graph in order to give their own pages more rank. We characterize the interplay between rankers and spammers as a game. We define the two critical features of this game, spam resistance and distortion, based on how spammers spam and how rankers protect against spam. We observe that all the ranking functions that are well-studied in the literature, including the original formulation of PageRank, have poor spam resistance, poor distortion, or both. Finally, we study Min-PPR, the form of PageRank used at Google itself, but which has received no (theoretical or empirical) treatment in the literature. We prove that Min-PPR has low distortion and high spam resistance. A secondary benefit is that Min-PPR comes with an explicit cost function on nodes that shows how important they are to the spammer; thus a ranker can focus their spam-detection capacity on these vulnerable nodes. Both Min-PPR and its associated cost function are straightforward to compute.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier EISSN: 2331-8422
ispartof arXiv.org, 2023-06
issn 2331-8422
language eng
recordid cdi_proquest_journals_2071784842
source Free E- Journals
subjects Graph theory
Hubs
Mathematical models
Operators (mathematics)
Random walk theory
Ranking
Search engines
Spamming
title Graph Ranking and the Cost of Sybil Defense
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T22%3A37%3A46IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=Graph%20Ranking%20and%20the%20Cost%20of%20Sybil%20Defense&rft.jtitle=arXiv.org&rft.au=Farach-Colton,%20Gwendolyn&rft.date=2023-06-01&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2071784842%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2071784842&rft_id=info:pmid/&rfr_iscdi=true