Health Misinformation Detection in the Social Web: An Overview and a Data Science Approach

The increasing availability of online content these days raises several questions about effective access to information. In particular, the possibility for almost everyone to generate content with no traditional intermediary, if on the one hand led to a process of "information democratization&q...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of environmental research and public health 2022-02, Vol.19 (4), p.2173
Hauptverfasser:	Di Sotto, Stefano, Viviani, Marco
Format:	Artikel
Sprache:	eng
Schlagworte:	Access to information Automation Communication COVID-19 Credibility Data Science False information Health Literacy Humans Information dissemination Information processing Learning algorithms Machine Learning Social Media Social networks User generated content Websites
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue	4
container_start_page	2173
container_title	International journal of environmental research and public health
container_volume	19
creator	Di Sotto, Stefano Viviani, Marco
description	The increasing availability of online content these days raises several questions about effective access to information. In particular, the possibility for almost everyone to generate content with no traditional intermediary, if on the one hand led to a process of "information democratization", on the other hand, has negatively affected the genuineness of the information disseminated. This issue is particularly relevant when accessing health information, which impacts both the individual and societal level. Often, laypersons do not have sufficient health literacy when faced with the decision to rely or not rely on this information, and expert users cannot cope with such a large amount of content. For these reasons, there is a need to develop automated solutions that can assist both experts and non-experts in discerning between genuine and non-genuine health information. To make a contribution in this area, in this paper we proceed to the study and analysis of distinct groups of features and machine learning techniques that can be effective to assess misinformation in online health-related content, whether in the form of Web pages or social media content. To this aim, and for evaluation purposes, we consider several publicly available datasets that have only recently been generated for the assessment of health misinformation under different perspectives.
doi_str_mv	10.3390/ijerph19042173
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8872515</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2633896153</sourcerecordid><originalsourceid>FETCH-LOGICAL-c418t-332b1cc49bbb759489e992442718ec849e657ce8ff94724b4e0044c244a500bb3</originalsourceid><addsrcrecordid>eNpdkc1PGzEQxa0KVL567RFZ4tJLqL_Wa3NAikIplYI40KpSL5ZtZllHG3uxN6n637MhFAGnedL85mmeHkKfKTnlXJOvYQG5b6kmgtGaf0D7VEoyEZLQnVd6Dx2UsiCEKyH1R7THK0Ykr_Q--nMFthtafB1KiE3KSzuEFPEFDOCfVIh4aAHfJh9sh3-DO8PTiG_WkNcB_mIb77DFF3aw-NYHiB7wtO9zsr49QruN7Qp8ep6H6Nflt5-zq8n85vuP2XQ-8YKqYcI5c9R7oZ1zdaWF0qA1E4LVVIFXQoOsag-qabSomXACCBHCj4StCHGOH6LzrW-_cku48xCHbDvT57C0-Z9JNpi3mxhac5_WRqmaVbQaDb48G-T0sIIymGUoHrrORkirYpjkXGk5kiN68g5dpFWOY7wNxWqharmhTreUz6mUDM3LM5SYTW3mbW3jwfHrCC_4_574Ixmik0g</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2632748763</pqid></control><display><type>article</type><title>Health Misinformation Detection in the Social Web: An Overview and a Data Science Approach</title><source>MDPI - Multidisciplinary Digital Publishing Institute</source><source>MEDLINE</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><source>PubMed Central Open Access</source><creator>Di Sotto, Stefano ; Viviani, Marco</creator><creatorcontrib>Di Sotto, Stefano ; Viviani, Marco</creatorcontrib><description>The increasing availability of online content these days raises several questions about effective access to information. In particular, the possibility for almost everyone to generate content with no traditional intermediary, if on the one hand led to a process of "information democratization", on the other hand, has negatively affected the genuineness of the information disseminated. This issue is particularly relevant when accessing health information, which impacts both the individual and societal level. Often, laypersons do not have sufficient health literacy when faced with the decision to rely or not rely on this information, and expert users cannot cope with such a large amount of content. For these reasons, there is a need to develop automated solutions that can assist both experts and non-experts in discerning between genuine and non-genuine health information. To make a contribution in this area, in this paper we proceed to the study and analysis of distinct groups of features and machine learning techniques that can be effective to assess misinformation in online health-related content, whether in the form of Web pages or social media content. To this aim, and for evaluation purposes, we consider several publicly available datasets that have only recently been generated for the assessment of health misinformation under different perspectives.</description><identifier>ISSN: 1660-4601</identifier><identifier>ISSN: 1661-7827</identifier><identifier>EISSN: 1660-4601</identifier><identifier>DOI: 10.3390/ijerph19042173</identifier><identifier>PMID: 35206359</identifier><language>eng</language><publisher>Switzerland: MDPI AG</publisher><subject>Access to information ; Automation ; Communication ; COVID-19 ; Credibility ; Data Science ; False information ; Health Literacy ; Humans ; Information dissemination ; Information processing ; Learning algorithms ; Machine Learning ; Social Media ; Social networks ; User generated content ; Websites</subject><ispartof>International journal of environmental research and public health, 2022-02, Vol.19 (4), p.2173</ispartof><rights>2022 by the authors. Licensee MDPI, Basel, Switzerland. This article is an open access article distributed under the terms and conditions of the Creative Commons Attribution (CC BY) license (https://creativecommons.org/licenses/by/4.0/). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2022 by the authors. 2022</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c418t-332b1cc49bbb759489e992442718ec849e657ce8ff94724b4e0044c244a500bb3</citedby><cites>FETCH-LOGICAL-c418t-332b1cc49bbb759489e992442718ec849e657ce8ff94724b4e0044c244a500bb3</cites><orcidid>0000-0002-2274-9050</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8872515/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8872515/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/35206359$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Di Sotto, Stefano</creatorcontrib><creatorcontrib>Viviani, Marco</creatorcontrib><title>Health Misinformation Detection in the Social Web: An Overview and a Data Science Approach</title><title>International journal of environmental research and public health</title><addtitle>Int J Environ Res Public Health</addtitle><description>The increasing availability of online content these days raises several questions about effective access to information. In particular, the possibility for almost everyone to generate content with no traditional intermediary, if on the one hand led to a process of "information democratization", on the other hand, has negatively affected the genuineness of the information disseminated. This issue is particularly relevant when accessing health information, which impacts both the individual and societal level. Often, laypersons do not have sufficient health literacy when faced with the decision to rely or not rely on this information, and expert users cannot cope with such a large amount of content. For these reasons, there is a need to develop automated solutions that can assist both experts and non-experts in discerning between genuine and non-genuine health information. To make a contribution in this area, in this paper we proceed to the study and analysis of distinct groups of features and machine learning techniques that can be effective to assess misinformation in online health-related content, whether in the form of Web pages or social media content. To this aim, and for evaluation purposes, we consider several publicly available datasets that have only recently been generated for the assessment of health misinformation under different perspectives.</description><subject>Access to information</subject><subject>Automation</subject><subject>Communication</subject><subject>COVID-19</subject><subject>Credibility</subject><subject>Data Science</subject><subject>False information</subject><subject>Health Literacy</subject><subject>Humans</subject><subject>Information dissemination</subject><subject>Information processing</subject><subject>Learning algorithms</subject><subject>Machine Learning</subject><subject>Social Media</subject><subject>Social networks</subject><subject>User generated content</subject><subject>Websites</subject><issn>1660-4601</issn><issn>1661-7827</issn><issn>1660-4601</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>BENPR</sourceid><recordid>eNpdkc1PGzEQxa0KVL567RFZ4tJLqL_Wa3NAikIplYI40KpSL5ZtZllHG3uxN6n637MhFAGnedL85mmeHkKfKTnlXJOvYQG5b6kmgtGaf0D7VEoyEZLQnVd6Dx2UsiCEKyH1R7THK0Ykr_Q--nMFthtafB1KiE3KSzuEFPEFDOCfVIh4aAHfJh9sh3-DO8PTiG_WkNcB_mIb77DFF3aw-NYHiB7wtO9zsr49QruN7Qp8ep6H6Nflt5-zq8n85vuP2XQ-8YKqYcI5c9R7oZ1zdaWF0qA1E4LVVIFXQoOsag-qabSomXACCBHCj4StCHGOH6LzrW-_cku48xCHbDvT57C0-Z9JNpi3mxhac5_WRqmaVbQaDb48G-T0sIIymGUoHrrORkirYpjkXGk5kiN68g5dpFWOY7wNxWqharmhTreUz6mUDM3LM5SYTW3mbW3jwfHrCC_4_574Ixmik0g</recordid><startdate>20220215</startdate><enddate>20220215</enddate><creator>Di Sotto, Stefano</creator><creator>Viviani, Marco</creator><general>MDPI AG</general><general>MDPI</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8C1</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>CCPQU</scope><scope>COVID</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>K9.</scope><scope>M0S</scope><scope>M1P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-2274-9050</orcidid></search><sort><creationdate>20220215</creationdate><title>Health Misinformation Detection in the Social Web: An Overview and a Data Science Approach</title><author>Di Sotto, Stefano ; Viviani, Marco</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c418t-332b1cc49bbb759489e992442718ec849e657ce8ff94724b4e0044c244a500bb3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Access to information</topic><topic>Automation</topic><topic>Communication</topic><topic>COVID-19</topic><topic>Credibility</topic><topic>Data Science</topic><topic>False information</topic><topic>Health Literacy</topic><topic>Humans</topic><topic>Information dissemination</topic><topic>Information processing</topic><topic>Learning algorithms</topic><topic>Machine Learning</topic><topic>Social Media</topic><topic>Social networks</topic><topic>User generated content</topic><topic>Websites</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Di Sotto, Stefano</creatorcontrib><creatorcontrib>Viviani, Marco</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Public Health Database</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>International journal of environmental research and public health</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Di Sotto, Stefano</au><au>Viviani, Marco</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Health Misinformation Detection in the Social Web: An Overview and a Data Science Approach</atitle><jtitle>International journal of environmental research and public health</jtitle><addtitle>Int J Environ Res Public Health</addtitle><date>2022-02-15</date><risdate>2022</risdate><volume>19</volume><issue>4</issue><spage>2173</spage><pages>2173-</pages><issn>1660-4601</issn><issn>1661-7827</issn><eissn>1660-4601</eissn><abstract>The increasing availability of online content these days raises several questions about effective access to information. In particular, the possibility for almost everyone to generate content with no traditional intermediary, if on the one hand led to a process of "information democratization", on the other hand, has negatively affected the genuineness of the information disseminated. This issue is particularly relevant when accessing health information, which impacts both the individual and societal level. Often, laypersons do not have sufficient health literacy when faced with the decision to rely or not rely on this information, and expert users cannot cope with such a large amount of content. For these reasons, there is a need to develop automated solutions that can assist both experts and non-experts in discerning between genuine and non-genuine health information. To make a contribution in this area, in this paper we proceed to the study and analysis of distinct groups of features and machine learning techniques that can be effective to assess misinformation in online health-related content, whether in the form of Web pages or social media content. To this aim, and for evaluation purposes, we consider several publicly available datasets that have only recently been generated for the assessment of health misinformation under different perspectives.</abstract><cop>Switzerland</cop><pub>MDPI AG</pub><pmid>35206359</pmid><doi>10.3390/ijerph19042173</doi><orcidid>https://orcid.org/0000-0002-2274-9050</orcidid><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1660-4601
ispartof	International journal of environmental research and public health, 2022-02, Vol.19 (4), p.2173
issn	1660-4601 1661-7827 1660-4601
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8872515
source	MDPI - Multidisciplinary Digital Publishing Institute; MEDLINE; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; PubMed Central; Free Full-Text Journals in Chemistry; PubMed Central Open Access
subjects	Access to information Automation Communication COVID-19 Credibility Data Science False information Health Literacy Humans Information dissemination Information processing Learning algorithms Machine Learning Social Media Social networks User generated content Websites
title	Health Misinformation Detection in the Social Web: An Overview and a Data Science Approach
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-11T02%3A16%3A10IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Health%20Misinformation%20Detection%20in%20the%20Social%20Web:%20An%20Overview%20and%20a%20Data%20Science%20Approach&rft.jtitle=International%20journal%20of%20environmental%20research%20and%20public%20health&rft.au=Di%20Sotto,%20Stefano&rft.date=2022-02-15&rft.volume=19&rft.issue=4&rft.spage=2173&rft.pages=2173-&rft.issn=1660-4601&rft.eissn=1660-4601&rft_id=info:doi/10.3390/ijerph19042173&rft_dat=%3Cproquest_pubme%3E2633896153%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2632748763&rft_id=info:pmid/35206359&rfr_iscdi=true