A literature survey on multimodal and multilingual automatic hate speech identification

Social media is a more common and powerful platform for communication to share views about any topic or article, which consequently leads to unstructured toxic, and hateful conversations. Curbing hate speeches has emerged as a critical challenge globally. In this regard, Social media platforms are u...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Multimedia systems 2023-06, Vol.29 (3), p.1203-1230
Hauptverfasser: Chhabra, Anusha, Vishwakarma, Dinesh Kumar
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1230
container_issue 3
container_start_page 1203
container_title Multimedia systems
container_volume 29
creator Chhabra, Anusha
Vishwakarma, Dinesh Kumar
description Social media is a more common and powerful platform for communication to share views about any topic or article, which consequently leads to unstructured toxic, and hateful conversations. Curbing hate speeches has emerged as a critical challenge globally. In this regard, Social media platforms are using modern statistical tools of AI technologies to process and eliminate toxic data to minimize hate crimes globally. Demanding the dire need, machine and deep learning-based techniques are getting more attention in analyzing these kinds of data. This survey presents a comprehensive analysis of hate speech definitions along with the motivation for detection and standard textual analysis methods that play a crucial role in identifying hate speech. State-of-the-art hate speech identification methods are also discussed, highlighting handcrafted feature-based and deep learning-based algorithms by considering multimodal and multilingual inputs and stating the pros and cons of each. Survey also presents popular benchmark datasets of hate speech/offensive language detection specifying their challenges, the methods for achieving top classification scores, and dataset characteristics such as the number of samples, modalities, language(s), number of classes, etc. Additionally, performance metrics are described, and classification scores of popular hate speech methods are mentioned. The conclusion and future research directions are presented at the end of the survey. Compared with earlier surveys, this paper gives a better presentation of multimodal and multilingual hate speech detection through well-organized comparisons, challenges, and the latest evaluation techniques, along with their best performances.
doi_str_mv 10.1007/s00530-023-01051-8
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2821009397</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2821009397</sourcerecordid><originalsourceid>FETCH-LOGICAL-c319t-fd89aa6b8bcd45ad9183e98107a7270babe163ffd8bc0fe62867fb89f20256843</originalsourceid><addsrcrecordid>eNp9kE1LAzEQhoMoWKt_wFPAc3SS7G6SYyl-QcGL4jFkd5M2ZT9qkhX6742u4M3TMDPPOwMPQtcUbimAuIsAJQcCjBOgUFIiT9CCFpwRKiU7RQtQBSOFqtg5uohxD0BFxWGB3le488kGk6ZgcZzCpz3iccD91CXfj63psBnaue38sJ2-B1Mae5N8g3cm5dDB2maHfWuH5J1v8mYcLtGZM120V791id4e7l_XT2Tz8vi8Xm1Iw6lKxLVSGVPVsm7aojStopJbJSkII5iA2tSWVtxlrG7A2YrJSrhaKseAlZUs-BLdzHcPYfyYbEx6P05hyC81kyy7UVyJTLGZasIYY7BOH4LvTThqCvpboJ4F6ixQ_wjUMof4HIoZHrY2_J3-J_UFE-F0sg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2821009397</pqid></control><display><type>article</type><title>A literature survey on multimodal and multilingual automatic hate speech identification</title><source>Springer Nature - Complete Springer Journals</source><creator>Chhabra, Anusha ; Vishwakarma, Dinesh Kumar</creator><creatorcontrib>Chhabra, Anusha ; Vishwakarma, Dinesh Kumar</creatorcontrib><description>Social media is a more common and powerful platform for communication to share views about any topic or article, which consequently leads to unstructured toxic, and hateful conversations. Curbing hate speeches has emerged as a critical challenge globally. In this regard, Social media platforms are using modern statistical tools of AI technologies to process and eliminate toxic data to minimize hate crimes globally. Demanding the dire need, machine and deep learning-based techniques are getting more attention in analyzing these kinds of data. This survey presents a comprehensive analysis of hate speech definitions along with the motivation for detection and standard textual analysis methods that play a crucial role in identifying hate speech. State-of-the-art hate speech identification methods are also discussed, highlighting handcrafted feature-based and deep learning-based algorithms by considering multimodal and multilingual inputs and stating the pros and cons of each. Survey also presents popular benchmark datasets of hate speech/offensive language detection specifying their challenges, the methods for achieving top classification scores, and dataset characteristics such as the number of samples, modalities, language(s), number of classes, etc. Additionally, performance metrics are described, and classification scores of popular hate speech methods are mentioned. The conclusion and future research directions are presented at the end of the survey. Compared with earlier surveys, this paper gives a better presentation of multimodal and multilingual hate speech detection through well-organized comparisons, challenges, and the latest evaluation techniques, along with their best performances.</description><identifier>ISSN: 0942-4962</identifier><identifier>EISSN: 1432-1882</identifier><identifier>DOI: 10.1007/s00530-023-01051-8</identifier><language>eng</language><publisher>Berlin/Heidelberg: Springer Berlin Heidelberg</publisher><subject>Algorithms ; Classification ; Computer Communication Networks ; Computer Graphics ; Computer Science ; Cryptology ; Data Storage Representation ; Datasets ; Deep learning ; Digital media ; Hate speech ; Identification methods ; Literature reviews ; Machine learning ; Multilingualism ; Multimedia Information Systems ; Operating Systems ; Performance measurement ; Regular Paper ; Social networks</subject><ispartof>Multimedia systems, 2023-06, Vol.29 (3), p.1203-1230</ispartof><rights>The Author(s), under exclusive licence to Springer-Verlag GmbH Germany, part of Springer Nature 2023. Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c319t-fd89aa6b8bcd45ad9183e98107a7270babe163ffd8bc0fe62867fb89f20256843</citedby><cites>FETCH-LOGICAL-c319t-fd89aa6b8bcd45ad9183e98107a7270babe163ffd8bc0fe62867fb89f20256843</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://link.springer.com/content/pdf/10.1007/s00530-023-01051-8$$EPDF$$P50$$Gspringer$$H</linktopdf><linktohtml>$$Uhttps://link.springer.com/10.1007/s00530-023-01051-8$$EHTML$$P50$$Gspringer$$H</linktohtml><link.rule.ids>314,776,780,27901,27902,41464,42533,51294</link.rule.ids></links><search><creatorcontrib>Chhabra, Anusha</creatorcontrib><creatorcontrib>Vishwakarma, Dinesh Kumar</creatorcontrib><title>A literature survey on multimodal and multilingual automatic hate speech identification</title><title>Multimedia systems</title><addtitle>Multimedia Systems</addtitle><description>Social media is a more common and powerful platform for communication to share views about any topic or article, which consequently leads to unstructured toxic, and hateful conversations. Curbing hate speeches has emerged as a critical challenge globally. In this regard, Social media platforms are using modern statistical tools of AI technologies to process and eliminate toxic data to minimize hate crimes globally. Demanding the dire need, machine and deep learning-based techniques are getting more attention in analyzing these kinds of data. This survey presents a comprehensive analysis of hate speech definitions along with the motivation for detection and standard textual analysis methods that play a crucial role in identifying hate speech. State-of-the-art hate speech identification methods are also discussed, highlighting handcrafted feature-based and deep learning-based algorithms by considering multimodal and multilingual inputs and stating the pros and cons of each. Survey also presents popular benchmark datasets of hate speech/offensive language detection specifying their challenges, the methods for achieving top classification scores, and dataset characteristics such as the number of samples, modalities, language(s), number of classes, etc. Additionally, performance metrics are described, and classification scores of popular hate speech methods are mentioned. The conclusion and future research directions are presented at the end of the survey. Compared with earlier surveys, this paper gives a better presentation of multimodal and multilingual hate speech detection through well-organized comparisons, challenges, and the latest evaluation techniques, along with their best performances.</description><subject>Algorithms</subject><subject>Classification</subject><subject>Computer Communication Networks</subject><subject>Computer Graphics</subject><subject>Computer Science</subject><subject>Cryptology</subject><subject>Data Storage Representation</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Digital media</subject><subject>Hate speech</subject><subject>Identification methods</subject><subject>Literature reviews</subject><subject>Machine learning</subject><subject>Multilingualism</subject><subject>Multimedia Information Systems</subject><subject>Operating Systems</subject><subject>Performance measurement</subject><subject>Regular Paper</subject><subject>Social networks</subject><issn>0942-4962</issn><issn>1432-1882</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kE1LAzEQhoMoWKt_wFPAc3SS7G6SYyl-QcGL4jFkd5M2ZT9qkhX6742u4M3TMDPPOwMPQtcUbimAuIsAJQcCjBOgUFIiT9CCFpwRKiU7RQtQBSOFqtg5uohxD0BFxWGB3le488kGk6ZgcZzCpz3iccD91CXfj63psBnaue38sJ2-B1Mae5N8g3cm5dDB2maHfWuH5J1v8mYcLtGZM120V791id4e7l_XT2Tz8vi8Xm1Iw6lKxLVSGVPVsm7aojStopJbJSkII5iA2tSWVtxlrG7A2YrJSrhaKseAlZUs-BLdzHcPYfyYbEx6P05hyC81kyy7UVyJTLGZasIYY7BOH4LvTThqCvpboJ4F6ixQ_wjUMof4HIoZHrY2_J3-J_UFE-F0sg</recordid><startdate>20230601</startdate><enddate>20230601</enddate><creator>Chhabra, Anusha</creator><creator>Vishwakarma, Dinesh Kumar</creator><general>Springer Berlin Heidelberg</general><general>Springer Nature B.V</general><scope>AAYXX</scope><scope>CITATION</scope></search><sort><creationdate>20230601</creationdate><title>A literature survey on multimodal and multilingual automatic hate speech identification</title><author>Chhabra, Anusha ; Vishwakarma, Dinesh Kumar</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c319t-fd89aa6b8bcd45ad9183e98107a7270babe163ffd8bc0fe62867fb89f20256843</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>Algorithms</topic><topic>Classification</topic><topic>Computer Communication Networks</topic><topic>Computer Graphics</topic><topic>Computer Science</topic><topic>Cryptology</topic><topic>Data Storage Representation</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Digital media</topic><topic>Hate speech</topic><topic>Identification methods</topic><topic>Literature reviews</topic><topic>Machine learning</topic><topic>Multilingualism</topic><topic>Multimedia Information Systems</topic><topic>Operating Systems</topic><topic>Performance measurement</topic><topic>Regular Paper</topic><topic>Social networks</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Chhabra, Anusha</creatorcontrib><creatorcontrib>Vishwakarma, Dinesh Kumar</creatorcontrib><collection>CrossRef</collection><jtitle>Multimedia systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Chhabra, Anusha</au><au>Vishwakarma, Dinesh Kumar</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A literature survey on multimodal and multilingual automatic hate speech identification</atitle><jtitle>Multimedia systems</jtitle><stitle>Multimedia Systems</stitle><date>2023-06-01</date><risdate>2023</risdate><volume>29</volume><issue>3</issue><spage>1203</spage><epage>1230</epage><pages>1203-1230</pages><issn>0942-4962</issn><eissn>1432-1882</eissn><abstract>Social media is a more common and powerful platform for communication to share views about any topic or article, which consequently leads to unstructured toxic, and hateful conversations. Curbing hate speeches has emerged as a critical challenge globally. In this regard, Social media platforms are using modern statistical tools of AI technologies to process and eliminate toxic data to minimize hate crimes globally. Demanding the dire need, machine and deep learning-based techniques are getting more attention in analyzing these kinds of data. This survey presents a comprehensive analysis of hate speech definitions along with the motivation for detection and standard textual analysis methods that play a crucial role in identifying hate speech. State-of-the-art hate speech identification methods are also discussed, highlighting handcrafted feature-based and deep learning-based algorithms by considering multimodal and multilingual inputs and stating the pros and cons of each. Survey also presents popular benchmark datasets of hate speech/offensive language detection specifying their challenges, the methods for achieving top classification scores, and dataset characteristics such as the number of samples, modalities, language(s), number of classes, etc. Additionally, performance metrics are described, and classification scores of popular hate speech methods are mentioned. The conclusion and future research directions are presented at the end of the survey. Compared with earlier surveys, this paper gives a better presentation of multimodal and multilingual hate speech detection through well-organized comparisons, challenges, and the latest evaluation techniques, along with their best performances.</abstract><cop>Berlin/Heidelberg</cop><pub>Springer Berlin Heidelberg</pub><doi>10.1007/s00530-023-01051-8</doi><tpages>28</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0942-4962
ispartof Multimedia systems, 2023-06, Vol.29 (3), p.1203-1230
issn 0942-4962
1432-1882
language eng
recordid cdi_proquest_journals_2821009397
source Springer Nature - Complete Springer Journals
subjects Algorithms
Classification
Computer Communication Networks
Computer Graphics
Computer Science
Cryptology
Data Storage Representation
Datasets
Deep learning
Digital media
Hate speech
Identification methods
Literature reviews
Machine learning
Multilingualism
Multimedia Information Systems
Operating Systems
Performance measurement
Regular Paper
Social networks
title A literature survey on multimodal and multilingual automatic hate speech identification
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-18T23%3A20%3A54IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20literature%20survey%20on%20multimodal%20and%20multilingual%20automatic%20hate%20speech%20identification&rft.jtitle=Multimedia%20systems&rft.au=Chhabra,%20Anusha&rft.date=2023-06-01&rft.volume=29&rft.issue=3&rft.spage=1203&rft.epage=1230&rft.pages=1203-1230&rft.issn=0942-4962&rft.eissn=1432-1882&rft_id=info:doi/10.1007/s00530-023-01051-8&rft_dat=%3Cproquest_cross%3E2821009397%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2821009397&rft_id=info:pmid/&rfr_iscdi=true