Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection

Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as thei...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Gruppi, Maurício, Horne, Benjamin D, Adalı, Sibel
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Computers and Society Computer Science - Learning Computer Science - Social and Information Networks
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	Gruppi, Maurício Horne, Benjamin D Adalı, Sibel
description	Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.
doi_str_mv	10.48550/arxiv.2101.10973
format	Article
fullrecord	<record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2101_10973</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2101_10973</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-7d5f18d6c166dd2ac5aea0f2796de80ba7771fc08c41c5275f35fe1b2853f98d3</originalsourceid><addsrcrecordid>eNotj7FOwzAURb0woMIHMPX9QIId17HDVgIFpFKGBhBT9Go_E0shQU4o9O9pC8PV1R3OlQ5jF4KnM6MUv8T4E7ZpJrhIBS-0PGV1RW0LjwSvTQ9v_VeERQzUuQHmka7geQjdO5R9N1I3wrrBeNjX1OA29BH8Piv6HmC9Jy3BC0W0YdzBDY1kx9B3Z-zEYzvQ-X9PWLW4rcr7ZPl091DOlwnmWibaKS-My63Ic-cytAoJuc90kTsyfINaa-EtN3YmrMq08lJ5EpvMKOkL4-SETf9uj4b1ZwwfGHf1wbQ-mspfs_BOWw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><source>arXiv.org</source><creator>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</creator><creatorcontrib>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</creatorcontrib><description>Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.</description><identifier>DOI: 10.48550/arxiv.2101.10973</identifier><language>eng</language><subject>Computer Science - Computers and Society ; Computer Science - Learning ; Computer Science - Social and Information Networks</subject><creationdate>2021-01</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2101.10973$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2101.10973$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Gruppi, Maurício</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adalı, Sibel</creatorcontrib><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><description>Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.</description><subject>Computer Science - Computers and Society</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Social and Information Networks</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj7FOwzAURb0woMIHMPX9QIId17HDVgIFpFKGBhBT9Go_E0shQU4o9O9pC8PV1R3OlQ5jF4KnM6MUv8T4E7ZpJrhIBS-0PGV1RW0LjwSvTQ9v_VeERQzUuQHmka7geQjdO5R9N1I3wrrBeNjX1OA29BH8Piv6HmC9Jy3BC0W0YdzBDY1kx9B3Z-zEYzvQ-X9PWLW4rcr7ZPl091DOlwnmWibaKS-My63Ic-cytAoJuc90kTsyfINaa-EtN3YmrMq08lJ5EpvMKOkL4-SETf9uj4b1ZwwfGHf1wbQ-mspfs_BOWw</recordid><startdate>20210115</startdate><enddate>20210115</enddate><creator>Gruppi, Maurício</creator><creator>Horne, Benjamin D</creator><creator>Adalı, Sibel</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210115</creationdate><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><author>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-7d5f18d6c166dd2ac5aea0f2796de80ba7771fc08c41c5275f35fe1b2853f98d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Computers and Society</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Social and Information Networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Gruppi, Maurício</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adalı, Sibel</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gruppi, Maurício</au><au>Horne, Benjamin D</au><au>Adalı, Sibel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</atitle><date>2021-01-15</date><risdate>2021</risdate><abstract>Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as their writing style, to detect veracity. While writing style models have been shown to work well in lab-settings, there are concerns of generalizability and robustness. In this paper, we begin to address these concerns by proposing a novel and robust news veracity detection model that uses the content sharing behavior of news sources formulated as a network. We represent these content sharing networks (CSN) using a deep walk based method for embedding graphs that accounts for similarity in both the network space and the article text space. We show that state of the art writing style and CSN features make diverse mistakes when predicting, meaning that they both play different roles in the classification task. Moreover, we show that the addition of CSN features increases the accuracy of writing style models, boosting accuracy as much as 14\% when using Random Forests. Similarly, we show that the combination of hand-crafted article-level features and CSN features is robust to concept drift, performing consistently well over a 10-month time frame.</abstract><doi>10.48550/arxiv.2101.10973</doi><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier	DOI: 10.48550/arxiv.2101.10973
ispartof
issn
language	eng
recordid	cdi_arxiv_primary_2101_10973
source	arXiv.org
subjects	Computer Science - Computers and Society Computer Science - Learning Computer Science - Social and Information Networks
title	Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T16%3A22%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Tell%20Me%20Who%20Your%20Friends%20Are:%20Using%20Content%20Sharing%20Behavior%20for%20News%20Source%20Veracity%20Detection&rft.au=Gruppi,%20Maur%C3%ADcio&rft.date=2021-01-15&rft_id=info:doi/10.48550/arxiv.2101.10973&rft_dat=%3Carxiv_GOX%3E2101_10973%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true