A Dataset of State-Censored Tweets
ICWSM , 2021, Vol.15, p.1009 Many governments impose traditional censorship methods on social media platforms. Instead of removing it completely, many social media companies, including Twitter, only withhold the content from the requesting country. This makes such content still accessible outside of...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | ICWSM , 2021, Vol.15, p.1009 Many governments impose traditional censorship methods on social media
platforms. Instead of removing it completely, many social media companies,
including Twitter, only withhold the content from the requesting country. This
makes such content still accessible outside of the censored region, allowing
for an excellent setting in which to study government censorship on social
media. We mine such content using the Internet Archive's Twitter Stream Grab.
We release a dataset of 583,437 tweets by 155,715 users that were censored
between 2012-2020 July. We also release 4,301 accounts that were censored in
their entirety. Additionally, we release a set of 22,083,759 supplemental
tweets made up of all tweets by users with at least one censored tweet as well
as instances of other users retweeting the censored user. We provide an
exploratory analysis of this dataset. Our dataset will not only aid in the
study of government censorship but will also aid in studying hate speech
detection and the effect of censorship on social media users. The dataset is
publicly available at https://doi.org/10.5281/zenodo.4439509 |
---|---|
DOI: | 10.48550/arxiv.2101.05919 |