An Exploration of Unreliable News Classification in Brazil and The U.S

The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2018-06
Hauptverfasser:	Gruppi, Mauricio, Horne, Benjamin D, Adali, Sibel
Format:	Artikel
Sprache:	eng
Schlagworte:	Datasets Information dissemination Machine learning News Privacy
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title	arXiv.org
container_volume
creator	Gruppi, Mauricio Horne, Benjamin D Adali, Sibel
description	The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable.
format	Article
fullrecord	<record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2073833917</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2073833917</sourcerecordid><originalsourceid>FETCH-proquest_journals_20738339173</originalsourceid><addsrcrecordid>eNqNykELgjAYgOERBEn5Hz7obMx9mXYsUTp1Sc-yatJkbLZPKfr1BfUDOr2H95mwQCDGUbYWYsZCoo5zLjapSBIMWLmzUDx747wctLPgWqitV0bLs1FwVA-C3Egi3erLV2gLey9f2oC0V6huCurVacGmrTSkwl_nbFkWVX6Ieu_uo6Kh6dzo7Wc1gqeYIW7jFP9Tb0dIOis</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2073833917</pqid></control><display><type>article</type><title>An Exploration of Unreliable News Classification in Brazil and The U.S</title><source>Free E- Journals</source><creator>Gruppi, Mauricio ; Horne, Benjamin D ; Adali, Sibel</creator><creatorcontrib>Gruppi, Mauricio ; Horne, Benjamin D ; Adali, Sibel</creatorcontrib><description>The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Datasets ; Information dissemination ; Machine learning ; News ; Privacy</subject><ispartof>arXiv.org, 2018-06</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>777,781</link.rule.ids></links><search><creatorcontrib>Gruppi, Mauricio</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adali, Sibel</creatorcontrib><title>An Exploration of Unreliable News Classification in Brazil and The U.S</title><title>arXiv.org</title><description>The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable.</description><subject>Datasets</subject><subject>Information dissemination</subject><subject>Machine learning</subject><subject>News</subject><subject>Privacy</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNykELgjAYgOERBEn5Hz7obMx9mXYsUTp1Sc-yatJkbLZPKfr1BfUDOr2H95mwQCDGUbYWYsZCoo5zLjapSBIMWLmzUDx747wctLPgWqitV0bLs1FwVA-C3Egi3erLV2gLey9f2oC0V6huCurVacGmrTSkwl_nbFkWVX6Ieu_uo6Kh6dzo7Wc1gqeYIW7jFP9Tb0dIOis</recordid><startdate>20180607</startdate><enddate>20180607</enddate><creator>Gruppi, Mauricio</creator><creator>Horne, Benjamin D</creator><creator>Adali, Sibel</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180607</creationdate><title>An Exploration of Unreliable News Classification in Brazil and The U.S</title><author>Gruppi, Mauricio ; Horne, Benjamin D ; Adali, Sibel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20738339173</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Datasets</topic><topic>Information dissemination</topic><topic>Machine learning</topic><topic>News</topic><topic>Privacy</topic><toplevel>online_resources</toplevel><creatorcontrib>Gruppi, Mauricio</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adali, Sibel</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gruppi, Mauricio</au><au>Horne, Benjamin D</au><au>Adali, Sibel</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>An Exploration of Unreliable News Classification in Brazil and The U.S</atitle><jtitle>arXiv.org</jtitle><date>2018-06-07</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	EISSN: 2331-8422
ispartof	arXiv.org, 2018-06
issn	2331-8422
language	eng
recordid	cdi_proquest_journals_2073833917
source	Free E- Journals
subjects	Datasets Information dissemination Machine learning News Privacy
title	An Exploration of Unreliable News Classification in Brazil and The U.S
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T00%3A38%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=An%20Exploration%20of%20Unreliable%20News%20Classification%20in%20Brazil%20and%20The%20U.S&rft.jtitle=arXiv.org&rft.au=Gruppi,%20Mauricio&rft.date=2018-06-07&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2073833917%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2073833917&rft_id=info:pmid/&rfr_iscdi=true