Less is More: Semi-Supervised Causal Inference for Detecting Pathogenic Users in Social Media
Recent years have witnessed a surge of manipulation of public opinion and political events by malicious social media actors. These users are referred to as "Pathogenic Social Media (PSM)" accounts. PSMs are key users in spreading misinformation in social media to viral proportions. These a...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Recent years have witnessed a surge of manipulation of public opinion and
political events by malicious social media actors. These users are referred to
as "Pathogenic Social Media (PSM)" accounts. PSMs are key users in spreading
misinformation in social media to viral proportions. These accounts can be
either controlled by real users or automated bots. Identification of PSMs is
thus of utmost importance for social media authorities. The burden usually
falls to automatic approaches that can identify these accounts and protect
social media reputation. However, lack of sufficient labeled examples for
devising and training sophisticated approaches to combat these accounts is
still one of the foremost challenges facing social media firms. In contrast,
unlabeled data is abundant and cheap to obtain thanks to massive user-generated
data. In this paper, we propose a semi-supervised causal inference PSM
detection framework, SemiPsm, to compensate for the lack of labeled data. In
particular, the proposed method leverages unlabeled data in the form of
manifold regularization and only relies on cascade information. This is in
contrast to the existing approaches that use exhaustive feature engineering
(e.g., profile information, network structure, etc.). Evidence from empirical
experiments on a real-world ISIS-related dataset from Twitter suggests
promising results of utilizing unlabeled instances for detecting PSMs. |
---|---|
DOI: | 10.48550/arxiv.1903.01693 |