Indonesian Biodiversity-related Tweets Including Health, Food Security, and Environmental Management Issues for Sentiment Analysis
The dataset was gathered using Twitter API services for around 30 particular biodiversity-related keywords with dates ranging from January 2020 to March 2023. This data was then refined by filtering out irrelevant information, including non-Indonesian language content, non-Biodiversity data, spam, a...
Gespeichert in:
1. Verfasser: | |
---|---|
Format: | Dataset |
Sprache: | eng |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The dataset was gathered using Twitter API services for around 30 particular biodiversity-related keywords with dates ranging from January 2020 to March 2023. This data was then refined by filtering out irrelevant information, including non-Indonesian language content, non-Biodiversity data, spam, and duplicate entries. Independent analysts undertook the task of manually assigning sentiment labels to the dataset. These eighteen individuals consisted of twelve researchers and engineers specializing in natural language processing, of which two held Ph.D. degrees, nine had MSc degrees, and one had a BSc degree. Additionally, four lecturers and two experts in natural language processing, each with a Ph.D. or MSc degree, contributed to the labeling process. The sentiments were divided into three classes, and the principle of majority voting determined the final class label. |
---|---|
DOI: | 10.17632/xtk9wsxjjr.2 |