Duluth at SemEval-2019 Task 6: Lexical Approaches to Identify and Categorize Offensive Tweets
This paper describes the Duluth systems that participated in SemEval--2019 Task 6, Identifying and Categorizing Offensive Language in Social Media (OffensEval). For the most part these systems took traditional Machine Learning approaches that built classifiers from lexical features found in manually...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper describes the Duluth systems that participated in SemEval--2019
Task 6, Identifying and Categorizing Offensive Language in Social Media
(OffensEval). For the most part these systems took traditional Machine Learning
approaches that built classifiers from lexical features found in manually
labeled training data. However, our most successful system for classifying a
tweet as offensive (or not) was a rule-based black--list approach, and we also
experimented with combining the training data from two different but related
SemEval tasks. Our best systems in each of the three OffensEval tasks placed in
the middle of the comparative evaluation, ranking 57th of 103 in task A, 39th
of 75 in task B, and 44th of 65 in task C. |
---|---|
DOI: | 10.48550/arxiv.2007.12949 |