Syntax and prejudice: ethically-charged biases of a syntax-based hate speech recognizer unveiled

Hate speech recognizers (HSRs) can be the panacea for containing hate in social media or can result in the biggest form of prejudice-based censorship hindering people to express their true selves. In this paper, we hypothesized how massive use of syntax can reduce the prejudice effect in HSRs. To ex...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	PeerJ. Computer science 2022-02, Vol.8, p.e859-e859, Article e859
Hauptverfasser:	Mastromattei, Michele, Ranaldi, Leonardo, Fallucchi, Francesca, Zanzotto, Fabio Massimo
Format:	Artikel
Sprache:	eng
Schlagworte:	Artificial Intelligence Bias Censorship Data Mining and Machine Learning Datasets Ethics Explainability Hate speech Natural Language and Speech Neural networks Social networks Syntax
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Hate speech recognizers (HSRs) can be the panacea for containing hate in social media or can result in the biggest form of prejudice-based censorship hindering people to express their true selves. In this paper, we hypothesized how massive use of syntax can reduce the prejudice effect in HSRs. To explore this hypothesis, we propose Unintended-bias Visualizer based on Kermit modeling ( ): a syntax-based HSR, which is endowed with syntax heat parse trees used as a post-hoc explanation of classifications. KERM-HATE significantly outperforms BERT-based, RoBERTa-based and XLNet-based HSR on standard datasets. Surprisingly this result is not sufficient. In fact, the post-hoc analysis on novel datasets on recent divisive topics shows that even KERM-HATE carries the prejudice distilled from the initial corpus. Therefore, although tests on standard datasets may show higher performance, syntax alone cannot drive the "attention" of HSRs to ethically-unbiased features.
ISSN:	2376-5992 2376-5992
DOI:	10.7717/peerj-cs.859