Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection

Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as thei...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Gruppi, Maurício, Horne, Benjamin D, Adalı, Sibel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator Gruppi, Maurício
Horne, Benjamin D
Adalı, Sibel
description Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.
doi_str_mv 10.48550/arxiv.2101.10973
format Article
fullrecord <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2101_10973</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2101_10973</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-7d5f18d6c166dd2ac5aea0f2796de80ba7771fc08c41c5275f35fe1b2853f98d3</originalsourceid><addsrcrecordid>eNotj7FOwzAURb0woMIHMPX9QIId17HDVgIFpFKGBhBT9Go_E0shQU4o9O9pC8PV1R3OlQ5jF4KnM6MUv8T4E7ZpJrhIBS-0PGV1RW0LjwSvTQ9v_VeERQzUuQHmka7geQjdO5R9N1I3wrrBeNjX1OA29BH8Piv6HmC9Jy3BC0W0YdzBDY1kx9B3Z-zEYzvQ-X9PWLW4rcr7ZPl091DOlwnmWibaKS-My63Ic-cytAoJuc90kTsyfINaa-EtN3YmrMq08lJ5EpvMKOkL4-SETf9uj4b1ZwwfGHf1wbQ-mspfs_BOWw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><source>arXiv.org</source><creator>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</creator><creatorcontrib>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</creatorcontrib><description>Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.</description><identifier>DOI: 10.48550/arxiv.2101.10973</identifier><language>eng</language><subject>Computer Science - Computers and Society ; Computer Science - Learning ; Computer Science - Social and Information Networks</subject><creationdate>2021-01</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2101.10973$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2101.10973$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Gruppi, Maurício</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adalı, Sibel</creatorcontrib><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><description>Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.</description><subject>Computer Science - Computers and Society</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Social and Information Networks</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj7FOwzAURb0woMIHMPX9QIId17HDVgIFpFKGBhBT9Go_E0shQU4o9O9pC8PV1R3OlQ5jF4KnM6MUv8T4E7ZpJrhIBS-0PGV1RW0LjwSvTQ9v_VeERQzUuQHmka7geQjdO5R9N1I3wrrBeNjX1OA29BH8Piv6HmC9Jy3BC0W0YdzBDY1kx9B3Z-zEYzvQ-X9PWLW4rcr7ZPl091DOlwnmWibaKS-My63Ic-cytAoJuc90kTsyfINaa-EtN3YmrMq08lJ5EpvMKOkL4-SETf9uj4b1ZwwfGHf1wbQ-mspfs_BOWw</recordid><startdate>20210115</startdate><enddate>20210115</enddate><creator>Gruppi, Maurício</creator><creator>Horne, Benjamin D</creator><creator>Adalı, Sibel</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210115</creationdate><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><author>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-7d5f18d6c166dd2ac5aea0f2796de80ba7771fc08c41c5275f35fe1b2853f98d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Computers and Society</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Social and Information Networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Gruppi, Maurício</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adalı, Sibel</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gruppi, Maurício</au><au>Horne, Benjamin D</au><au>Adalı, Sibel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</atitle><date>2021-01-15</date><risdate>2021</risdate><abstract>Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.</abstract><doi>10.48550/arxiv.2101.10973</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier DOI: 10.48550/arxiv.2101.10973
ispartof
issn
language eng
recordid cdi_arxiv_primary_2101_10973
source arXiv.org
subjects Computer Science - Computers and Society
Computer Science - Learning
Computer Science - Social and Information Networks
title Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T16%3A22%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Tell%20Me%20Who%20Your%20Friends%20Are:%20Using%20Content%20Sharing%20Behavior%20for%20News%20Source%20Veracity%20Detection&rft.au=Gruppi,%20Maur%C3%ADcio&rft.date=2021-01-15&rft_id=info:doi/10.48550/arxiv.2101.10973&rft_dat=%3Carxiv_GOX%3E2101_10973%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true