Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection
Stopping the malicious spread and production of false and misleading news has become a top priority for researchers. Due to this prevalence, many automated methods for detecting low quality information have been introduced. The majority of these methods have used article-level features, such as thei...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | Gruppi, Maurício Horne, Benjamin D Adalı, Sibel |
description | Stopping the malicious spread and production of false and misleading news has
become a top priority for researchers. Due to this prevalence, many automated
methods for detecting low quality information have been introduced. The
majority of these methods have used article-level features, such as their
writing style, to detect veracity. While writing style models have been shown
to work well in lab-settings, there are concerns of generalizability and
robustness. In this paper, we begin to address these concerns by proposing a
novel and robust news veracity detection model that uses the content sharing
behavior of news sources formulated as a network. We represent these content
sharing networks (CSN) using a deep walk based method for embedding graphs that
accounts for similarity in both the network space and the article text space.
We show that state of the art writing style and CSN features make diverse
mistakes when predicting, meaning that they both play different roles in the
classification task. Moreover, we show that the addition of CSN features
increases the accuracy of writing style models, boosting accuracy as much as
14\% when using Random Forests. Similarly, we show that the combination of
hand-crafted article-level features and CSN features is robust to concept
drift, performing consistently well over a 10-month time frame. |
doi_str_mv | 10.48550/arxiv.2101.10973 |
format | Article |
fullrecord | <record><control><sourceid>arxiv_GOX</sourceid><recordid>TN_cdi_arxiv_primary_2101_10973</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2101_10973</sourcerecordid><originalsourceid>FETCH-LOGICAL-a673-7d5f18d6c166dd2ac5aea0f2796de80ba7771fc08c41c5275f35fe1b2853f98d3</originalsourceid><addsrcrecordid>eNotj7FOwzAURb0woMIHMPX9QIId17HDVgIFpFKGBhBT9Go_E0shQU4o9O9pC8PV1R3OlQ5jF4KnM6MUv8T4E7ZpJrhIBS-0PGV1RW0LjwSvTQ9v_VeERQzUuQHmka7geQjdO5R9N1I3wrrBeNjX1OA29BH8Piv6HmC9Jy3BC0W0YdzBDY1kx9B3Z-zEYzvQ-X9PWLW4rcr7ZPl091DOlwnmWibaKS-My63Ic-cytAoJuc90kTsyfINaa-EtN3YmrMq08lJ5EpvMKOkL4-SETf9uj4b1ZwwfGHf1wbQ-mspfs_BOWw</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype></control><display><type>article</type><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><source>arXiv.org</source><creator>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</creator><creatorcontrib>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</creatorcontrib><description>Stopping the malicious spread and production of false and misleading news has
become a top priority for researchers. Due to this prevalence, many automated
methods for detecting low quality information have been introduced. The
majority of these methods have used article-level features, such as their
writing style, to detect veracity. While writing style models have been shown
to work well in lab-settings, there are concerns of generalizability and
robustness. In this paper, we begin to address these concerns by proposing a
novel and robust news veracity detection model that uses the content sharing
behavior of news sources formulated as a network. We represent these content
sharing networks (CSN) using a deep walk based method for embedding graphs that
accounts for similarity in both the network space and the article text space.
We show that state of the art writing style and CSN features make diverse
mistakes when predicting, meaning that they both play different roles in the
classification task. Moreover, we show that the addition of CSN features
increases the accuracy of writing style models, boosting accuracy as much as
14\% when using Random Forests. Similarly, we show that the combination of
hand-crafted article-level features and CSN features is robust to concept
drift, performing consistently well over a 10-month time frame.</description><identifier>DOI: 10.48550/arxiv.2101.10973</identifier><language>eng</language><subject>Computer Science - Computers and Society ; Computer Science - Learning ; Computer Science - Social and Information Networks</subject><creationdate>2021-01</creationdate><rights>http://arxiv.org/licenses/nonexclusive-distrib/1.0</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>228,230,777,882</link.rule.ids><linktorsrc>$$Uhttps://arxiv.org/abs/2101.10973$$EView_record_in_Cornell_University$$FView_record_in_$$GCornell_University$$Hfree_for_read</linktorsrc><backlink>$$Uhttps://doi.org/10.48550/arXiv.2101.10973$$DView paper in arXiv$$Hfree_for_read</backlink></links><search><creatorcontrib>Gruppi, Maurício</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adalı, Sibel</creatorcontrib><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><description>Stopping the malicious spread and production of false and misleading news has
become a top priority for researchers. Due to this prevalence, many automated
methods for detecting low quality information have been introduced. The
majority of these methods have used article-level features, such as their
writing style, to detect veracity. While writing style models have been shown
to work well in lab-settings, there are concerns of generalizability and
robustness. In this paper, we begin to address these concerns by proposing a
novel and robust news veracity detection model that uses the content sharing
behavior of news sources formulated as a network. We represent these content
sharing networks (CSN) using a deep walk based method for embedding graphs that
accounts for similarity in both the network space and the article text space.
We show that state of the art writing style and CSN features make diverse
mistakes when predicting, meaning that they both play different roles in the
classification task. Moreover, we show that the addition of CSN features
increases the accuracy of writing style models, boosting accuracy as much as
14\% when using Random Forests. Similarly, we show that the combination of
hand-crafted article-level features and CSN features is robust to concept
drift, performing consistently well over a 10-month time frame.</description><subject>Computer Science - Computers and Society</subject><subject>Computer Science - Learning</subject><subject>Computer Science - Social and Information Networks</subject><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>GOX</sourceid><recordid>eNotj7FOwzAURb0woMIHMPX9QIId17HDVgIFpFKGBhBT9Go_E0shQU4o9O9pC8PV1R3OlQ5jF4KnM6MUv8T4E7ZpJrhIBS-0PGV1RW0LjwSvTQ9v_VeERQzUuQHmka7geQjdO5R9N1I3wrrBeNjX1OA29BH8Piv6HmC9Jy3BC0W0YdzBDY1kx9B3Z-zEYzvQ-X9PWLW4rcr7ZPl091DOlwnmWibaKS-My63Ic-cytAoJuc90kTsyfINaa-EtN3YmrMq08lJ5EpvMKOkL4-SETf9uj4b1ZwwfGHf1wbQ-mspfs_BOWw</recordid><startdate>20210115</startdate><enddate>20210115</enddate><creator>Gruppi, Maurício</creator><creator>Horne, Benjamin D</creator><creator>Adalı, Sibel</creator><scope>AKY</scope><scope>GOX</scope></search><sort><creationdate>20210115</creationdate><title>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</title><author>Gruppi, Maurício ; Horne, Benjamin D ; Adalı, Sibel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-a673-7d5f18d6c166dd2ac5aea0f2796de80ba7771fc08c41c5275f35fe1b2853f98d3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Computer Science - Computers and Society</topic><topic>Computer Science - Learning</topic><topic>Computer Science - Social and Information Networks</topic><toplevel>online_resources</toplevel><creatorcontrib>Gruppi, Maurício</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adalı, Sibel</creatorcontrib><collection>arXiv Computer Science</collection><collection>arXiv.org</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Gruppi, Maurício</au><au>Horne, Benjamin D</au><au>Adalı, Sibel</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection</atitle><date>2021-01-15</date><risdate>2021</risdate><abstract>Stopping the malicious spread and production of false and misleading news has
become a top priority for researchers. Due to this prevalence, many automated
methods for detecting low quality information have been introduced. The
majority of these methods have used article-level features, such as their
writing style, to detect veracity. While writing style models have been shown
to work well in lab-settings, there are concerns of generalizability and
robustness. In this paper, we begin to address these concerns by proposing a
novel and robust news veracity detection model that uses the content sharing
behavior of news sources formulated as a network. We represent these content
sharing networks (CSN) using a deep walk based method for embedding graphs that
accounts for similarity in both the network space and the article text space.
We show that state of the art writing style and CSN features make diverse
mistakes when predicting, meaning that they both play different roles in the
classification task. Moreover, we show that the addition of CSN features
increases the accuracy of writing style models, boosting accuracy as much as
14\% when using Random Forests. Similarly, we show that the combination of
hand-crafted article-level features and CSN features is robust to concept
drift, performing consistently well over a 10-month time frame.</abstract><doi>10.48550/arxiv.2101.10973</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | DOI: 10.48550/arxiv.2101.10973 |
ispartof | |
issn | |
language | eng |
recordid | cdi_arxiv_primary_2101_10973 |
source | arXiv.org |
subjects | Computer Science - Computers and Society Computer Science - Learning Computer Science - Social and Information Networks |
title | Tell Me Who Your Friends Are: Using Content Sharing Behavior for News Source Veracity Detection |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-17T16%3A22%3A37IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-arxiv_GOX&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Tell%20Me%20Who%20Your%20Friends%20Are:%20Using%20Content%20Sharing%20Behavior%20for%20News%20Source%20Veracity%20Detection&rft.au=Gruppi,%20Maur%C3%ADcio&rft.date=2021-01-15&rft_id=info:doi/10.48550/arxiv.2101.10973&rft_dat=%3Carxiv_GOX%3E2101_10973%3C/arxiv_GOX%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |