SGHateCheck: Functional Tests for Detecting Hate Speech in Low-Resource Languages of Singapore
To address the limitations of current hate speech detection models, we introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic and cultural context of Singapore and Southeast Asia. It extends the functional testing approach of HateCheck and MHC, employing large language models...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | To address the limitations of current hate speech detection models, we
introduce \textsf{SGHateCheck}, a novel framework designed for the linguistic
and cultural context of Singapore and Southeast Asia. It extends the functional
testing approach of HateCheck and MHC, employing large language models for
translation and paraphrasing into Singapore's main languages, and refining
these with native annotators. \textsf{SGHateCheck} reveals critical flaws in
state-of-the-art models, highlighting their inadequacy in sensitive content
moderation. This work aims to foster the development of more effective hate
speech detection tools for diverse linguistic environments, particularly for
Singapore and Southeast Asia contexts. |
---|---|
DOI: | 10.48550/arxiv.2405.01842 |