An Exploration of Unreliable News Classification in Brazil and The U.S
The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impa...
Gespeichert in:
Veröffentlicht in: | arXiv.org 2018-06 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | arXiv.org |
container_volume | |
creator | Gruppi, Mauricio Horne, Benjamin D Adali, Sibel |
description | The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable. |
format | Article |
fullrecord | <record><control><sourceid>proquest</sourceid><recordid>TN_cdi_proquest_journals_2073833917</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2073833917</sourcerecordid><originalsourceid>FETCH-proquest_journals_20738339173</originalsourceid><addsrcrecordid>eNqNykELgjAYgOERBEn5Hz7obMx9mXYsUTp1Sc-yatJkbLZPKfr1BfUDOr2H95mwQCDGUbYWYsZCoo5zLjapSBIMWLmzUDx747wctLPgWqitV0bLs1FwVA-C3Egi3erLV2gLey9f2oC0V6huCurVacGmrTSkwl_nbFkWVX6Ieu_uo6Kh6dzo7Wc1gqeYIW7jFP9Tb0dIOis</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2073833917</pqid></control><display><type>article</type><title>An Exploration of Unreliable News Classification in Brazil and The U.S</title><source>Free E- Journals</source><creator>Gruppi, Mauricio ; Horne, Benjamin D ; Adali, Sibel</creator><creatorcontrib>Gruppi, Mauricio ; Horne, Benjamin D ; Adali, Sibel</creatorcontrib><description>The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable.</description><identifier>EISSN: 2331-8422</identifier><language>eng</language><publisher>Ithaca: Cornell University Library, arXiv.org</publisher><subject>Datasets ; Information dissemination ; Machine learning ; News ; Privacy</subject><ispartof>arXiv.org, 2018-06</ispartof><rights>2018. This work is published under http://arxiv.org/licenses/nonexclusive-distrib/1.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>777,781</link.rule.ids></links><search><creatorcontrib>Gruppi, Mauricio</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adali, Sibel</creatorcontrib><title>An Exploration of Unreliable News Classification in Brazil and The U.S</title><title>arXiv.org</title><description>The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable.</description><subject>Datasets</subject><subject>Information dissemination</subject><subject>Machine learning</subject><subject>News</subject><subject>Privacy</subject><issn>2331-8422</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2018</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><recordid>eNqNykELgjAYgOERBEn5Hz7obMx9mXYsUTp1Sc-yatJkbLZPKfr1BfUDOr2H95mwQCDGUbYWYsZCoo5zLjapSBIMWLmzUDx747wctLPgWqitV0bLs1FwVA-C3Egi3erLV2gLey9f2oC0V6huCurVacGmrTSkwl_nbFkWVX6Ieu_uo6Kh6dzo7Wc1gqeYIW7jFP9Tb0dIOis</recordid><startdate>20180607</startdate><enddate>20180607</enddate><creator>Gruppi, Mauricio</creator><creator>Horne, Benjamin D</creator><creator>Adali, Sibel</creator><general>Cornell University Library, arXiv.org</general><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>HCIFZ</scope><scope>L6V</scope><scope>M7S</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope></search><sort><creationdate>20180607</creationdate><title>An Exploration of Unreliable News Classification in Brazil and The U.S</title><author>Gruppi, Mauricio ; Horne, Benjamin D ; Adali, Sibel</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-proquest_journals_20738339173</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2018</creationdate><topic>Datasets</topic><topic>Information dissemination</topic><topic>Machine learning</topic><topic>News</topic><topic>Privacy</topic><toplevel>online_resources</toplevel><creatorcontrib>Gruppi, Mauricio</creatorcontrib><creatorcontrib>Horne, Benjamin D</creatorcontrib><creatorcontrib>Adali, Sibel</creatorcontrib><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science & Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Gruppi, Mauricio</au><au>Horne, Benjamin D</au><au>Adali, Sibel</au><format>book</format><genre>document</genre><ristype>GEN</ristype><atitle>An Exploration of Unreliable News Classification in Brazil and The U.S</atitle><jtitle>arXiv.org</jtitle><date>2018-06-07</date><risdate>2018</risdate><eissn>2331-8422</eissn><abstract>The propagation of unreliable information is on the rise in many places around the world. This expansion is facilitated by the rapid spread of information and anonymity granted by the Internet. The spread of unreliable information is a wellstudied issue and it is associated with negative social impacts. In a previous work, we have identified significant differences in the structure of news articles from reliable and unreliable sources in the US media. Our goal in this work was to explore such differences in the Brazilian media. We found significant features in two data sets: one with Brazilian news in Portuguese and another one with US news in English. Our results show that features related to the writing style were prominent in both data sets and, despite the language difference, some features have a universal behavior, being significant to both US and Brazilian news articles. Finally, we combined both data sets and used the universal features to build a machine learning classifier to predict the source type of a news article as reliable or unreliable.</abstract><cop>Ithaca</cop><pub>Cornell University Library, arXiv.org</pub><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | EISSN: 2331-8422 |
ispartof | arXiv.org, 2018-06 |
issn | 2331-8422 |
language | eng |
recordid | cdi_proquest_journals_2073833917 |
source | Free E- Journals |
subjects | Datasets Information dissemination Machine learning News Privacy |
title | An Exploration of Unreliable News Classification in Brazil and The U.S |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-18T00%3A38%3A33IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=document&rft.atitle=An%20Exploration%20of%20Unreliable%20News%20Classification%20in%20Brazil%20and%20The%20U.S&rft.jtitle=arXiv.org&rft.au=Gruppi,%20Mauricio&rft.date=2018-06-07&rft.eissn=2331-8422&rft_id=info:doi/&rft_dat=%3Cproquest%3E2073833917%3C/proquest%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2073833917&rft_id=info:pmid/&rfr_iscdi=true |