Cross-Language Fake News Detection
With increasing globalization, news from different countries, and even in different languages, has become readily available and has become a way for many people to learn about other cultures. As people around the world become more reliant on social media, the impact of fake news on public society al...
Gespeichert in:
Veröffentlicht in: | Data and information management 2021-01, Vol.5 (1), p.100-109 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 109 |
---|---|
container_issue | 1 |
container_start_page | 100 |
container_title | Data and information management |
container_volume | 5 |
creator | Chu, Samuel Kai Wah Xie, Runbin Wang, Yanshu |
description | With increasing globalization, news from different countries, and even in different languages, has become readily available and has become a way for many people to learn about other cultures. As people around the world become more reliant on social media, the impact of fake news on public society also increases. However, most of the fake news detection research focuses only on English. In this work, we compared the difference between textual features of different languages (Chinese and English) and their effect on detecting fake news. We also explored the cross-language transmissibility of fake news detection models. We found that Chinese textual features in fake news are more complex compared with English textual features. Our results also illustrated that the bidirectional encoder representations from transformers (BERT) model outperformed other algorithms for within-language data sets. As for detection in cross-language data sets, our findings demonstrated that fake news monitoring across languages is potentially feasible, while models trained with data from a more inclusive language would perform better in cross-language detection. |
doi_str_mv | 10.2478/dim-2020-0025 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2467357946</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2467357946</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2700-f3ef6cb32f836dd114d8f99e28a119d9772a7fb0204403bcf719095c7776fab43</originalsourceid><addsrcrecordid>eNptkD1PwzAQhi0EElXpyB7BbDh_xTEbChSQIlhgtpzEjlLapNiJqv57HAUJBqZ7h-feOz0IXRK4oVxmt3W7wxQoYAAqTtCCCs6wooKc_snnaBXCBiICCpgiC3SV-z4EXJiuGU1jk7X5tMmrPYTkwQ62Gtq-u0BnzmyDXf3MJfpYP77nz7h4e3rJ7wtcUQmAHbMurUpGXcbSuiaE15lTytLMEKJqJSU10pXxNOfAyspJokCJSkqZOlNytkTXc-_e91-jDYPe9KPv4klNeSqZkIqnkcIzVU2Pe-v03rc744-agJ5M6GhCTyb0ZCLydzN_MNvB-to2fjzG8Fv-754gBIB9A3l_YSo</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2467357946</pqid></control><display><type>article</type><title>Cross-Language Fake News Detection</title><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>ProQuest Central UK/Ireland</source><source>Alma/SFX Local Collection</source><creator>Chu, Samuel Kai Wah ; Xie, Runbin ; Wang, Yanshu</creator><creatorcontrib>Chu, Samuel Kai Wah ; Xie, Runbin ; Wang, Yanshu</creatorcontrib><description>With increasing globalization, news from different countries, and even in different languages, has become readily available and has become a way for many people to learn about other cultures. As people around the world become more reliant on social media, the impact of fake news on public society also increases. However, most of the fake news detection research focuses only on English. In this work, we compared the difference between textual features of different languages (Chinese and English) and their effect on detecting fake news. We also explored the cross-language transmissibility of fake news detection models. We found that Chinese textual features in fake news are more complex compared with English textual features. Our results also illustrated that the bidirectional encoder representations from transformers (BERT) model outperformed other algorithms for within-language data sets. As for detection in cross-language data sets, our findings demonstrated that fake news monitoring across languages is potentially feasible, while models trained with data from a more inclusive language would perform better in cross-language detection.</description><identifier>ISSN: 2543-9251</identifier><identifier>EISSN: 2543-9251</identifier><identifier>DOI: 10.2478/dim-2020-0025</identifier><language>eng</language><publisher>Warsaw: Sciendo</publisher><subject>Algorithms ; Coders ; cross-language study ; Datasets ; Digital media ; fake news detection ; Globalization ; information detection ; Language ; Languages ; News</subject><ispartof>Data and information management, 2021-01, Vol.5 (1), p.100-109</ispartof><rights>2021. This work is published under http://creativecommons.org/licenses/by-nc-nd/3.0 (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2700-f3ef6cb32f836dd114d8f99e28a119d9772a7fb0204403bcf719095c7776fab43</citedby><cites>FETCH-LOGICAL-c2700-f3ef6cb32f836dd114d8f99e28a119d9772a7fb0204403bcf719095c7776fab43</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.proquest.com/docview/2467357946?pq-origsite=primo$$EHTML$$P50$$Gproquest$$Hfree_for_read</linktohtml><link.rule.ids>315,781,785,865,27929,27930,64390,64394,72474</link.rule.ids></links><search><creatorcontrib>Chu, Samuel Kai Wah</creatorcontrib><creatorcontrib>Xie, Runbin</creatorcontrib><creatorcontrib>Wang, Yanshu</creatorcontrib><title>Cross-Language Fake News Detection</title><title>Data and information management</title><description>With increasing globalization, news from different countries, and even in different languages, has become readily available and has become a way for many people to learn about other cultures. As people around the world become more reliant on social media, the impact of fake news on public society also increases. However, most of the fake news detection research focuses only on English. In this work, we compared the difference between textual features of different languages (Chinese and English) and their effect on detecting fake news. We also explored the cross-language transmissibility of fake news detection models. We found that Chinese textual features in fake news are more complex compared with English textual features. Our results also illustrated that the bidirectional encoder representations from transformers (BERT) model outperformed other algorithms for within-language data sets. As for detection in cross-language data sets, our findings demonstrated that fake news monitoring across languages is potentially feasible, while models trained with data from a more inclusive language would perform better in cross-language detection.</description><subject>Algorithms</subject><subject>Coders</subject><subject>cross-language study</subject><subject>Datasets</subject><subject>Digital media</subject><subject>fake news detection</subject><subject>Globalization</subject><subject>information detection</subject><subject>Language</subject><subject>Languages</subject><subject>News</subject><issn>2543-9251</issn><issn>2543-9251</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNptkD1PwzAQhi0EElXpyB7BbDh_xTEbChSQIlhgtpzEjlLapNiJqv57HAUJBqZ7h-feOz0IXRK4oVxmt3W7wxQoYAAqTtCCCs6wooKc_snnaBXCBiICCpgiC3SV-z4EXJiuGU1jk7X5tMmrPYTkwQ62Gtq-u0BnzmyDXf3MJfpYP77nz7h4e3rJ7wtcUQmAHbMurUpGXcbSuiaE15lTytLMEKJqJSU10pXxNOfAyspJokCJSkqZOlNytkTXc-_e91-jDYPe9KPv4klNeSqZkIqnkcIzVU2Pe-v03rc744-agJ5M6GhCTyb0ZCLydzN_MNvB-to2fjzG8Fv-754gBIB9A3l_YSo</recordid><startdate>20210101</startdate><enddate>20210101</enddate><creator>Chu, Samuel Kai Wah</creator><creator>Xie, Runbin</creator><creator>Wang, Yanshu</creator><general>Sciendo</general><general>Elsevier Limited</general><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7XB</scope><scope>8AL</scope><scope>8FE</scope><scope>8FG</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ALSLI</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>CNYFK</scope><scope>DWQXO</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>M0N</scope><scope>M1O</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope></search><sort><creationdate>20210101</creationdate><title>Cross-Language Fake News Detection</title><author>Chu, Samuel Kai Wah ; Xie, Runbin ; Wang, Yanshu</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2700-f3ef6cb32f836dd114d8f99e28a119d9772a7fb0204403bcf719095c7776fab43</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Algorithms</topic><topic>Coders</topic><topic>cross-language study</topic><topic>Datasets</topic><topic>Digital media</topic><topic>fake news detection</topic><topic>Globalization</topic><topic>information detection</topic><topic>Language</topic><topic>Languages</topic><topic>News</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chu, Samuel Kai Wah</creatorcontrib><creatorcontrib>Xie, Runbin</creatorcontrib><creatorcontrib>Wang, Yanshu</creatorcontrib><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Computing Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Social Science Premium Collection</collection><collection>Advanced Technologies & Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>Library & Information Science Collection</collection><collection>ProQuest Central Korea</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Computing Database</collection><collection>Library Science Database</collection><collection>Advanced Technologies & Aerospace Database</collection><collection>ProQuest Advanced Technologies & Aerospace Collection</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><jtitle>Data and information management</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chu, Samuel Kai Wah</au><au>Xie, Runbin</au><au>Wang, Yanshu</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Cross-Language Fake News Detection</atitle><jtitle>Data and information management</jtitle><date>2021-01-01</date><risdate>2021</risdate><volume>5</volume><issue>1</issue><spage>100</spage><epage>109</epage><pages>100-109</pages><issn>2543-9251</issn><eissn>2543-9251</eissn><abstract>With increasing globalization, news from different countries, and even in different languages, has become readily available and has become a way for many people to learn about other cultures. As people around the world become more reliant on social media, the impact of fake news on public society also increases. However, most of the fake news detection research focuses only on English. In this work, we compared the difference between textual features of different languages (Chinese and English) and their effect on detecting fake news. We also explored the cross-language transmissibility of fake news detection models. We found that Chinese textual features in fake news are more complex compared with English textual features. Our results also illustrated that the bidirectional encoder representations from transformers (BERT) model outperformed other algorithms for within-language data sets. As for detection in cross-language data sets, our findings demonstrated that fake news monitoring across languages is potentially feasible, while models trained with data from a more inclusive language would perform better in cross-language detection.</abstract><cop>Warsaw</cop><pub>Sciendo</pub><doi>10.2478/dim-2020-0025</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2543-9251 |
ispartof | Data and information management, 2021-01, Vol.5 (1), p.100-109 |
issn | 2543-9251 2543-9251 |
language | eng |
recordid | cdi_proquest_journals_2467357946 |
source | DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; ProQuest Central UK/Ireland; Alma/SFX Local Collection |
subjects | Algorithms Coders cross-language study Datasets Digital media fake news detection Globalization information detection Language Languages News |
title | Cross-Language Fake News Detection |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-13T03%3A05%3A48IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Cross-Language%20Fake%20News%20Detection&rft.jtitle=Data%20and%20information%20management&rft.au=Chu,%20Samuel%20Kai%20Wah&rft.date=2021-01-01&rft.volume=5&rft.issue=1&rft.spage=100&rft.epage=109&rft.pages=100-109&rft.issn=2543-9251&rft.eissn=2543-9251&rft_id=info:doi/10.2478/dim-2020-0025&rft_dat=%3Cproquest_cross%3E2467357946%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2467357946&rft_id=info:pmid/&rfr_iscdi=true |