Uchaguzi-2022: A Dataset of Citizen Reports on the 2022 Kenyan Election
Online reporting platforms have enabled citizens around the world to collectively share their opinions and report in real time on events impacting their local communities. Systematically organizing (e.g., categorizing by attributes) and geotagging large amounts of crowdsourced information is crucial...
Gespeichert in:
Hauptverfasser: | , , , , , , , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Online reporting platforms have enabled citizens around the world to
collectively share their opinions and report in real time on events impacting
their local communities. Systematically organizing (e.g., categorizing by
attributes) and geotagging large amounts of crowdsourced information is crucial
to ensuring that accurate and meaningful insights can be drawn from this data
and used by policy makers to bring about positive change. These tasks, however,
typically require extensive manual annotation efforts. In this paper we present
Uchaguzi-2022, a dataset of 14k categorized and geotagged citizen reports
related to the 2022 Kenyan General Election containing mentions of
election-related issues such as official misconduct, vote count irregularities,
and acts of violence. We use this dataset to investigate whether language
models can assist in scalably categorizing and geotagging reports, thus
highlighting its potential application in the AI for Social Good space. |
---|---|
DOI: | 10.48550/arxiv.2412.13098 |