PUMAD: PU Metric learning for anomaly detection

Anomaly detection task, which identifies abnormal patterns in data, has been widely applied to various domains. Most recent work on anomaly detection have focused on an accurate modeling of the normal data based on unsupervised methods. To get a satisfactory anomaly detection accuracy, they need pur...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Information sciences 2020-06, Vol.523, p.167-183
Hauptverfasser:	Ju, Hyunjun, Lee, Dongha, Hwang, Junyoung, Namkung, Junghyun, Yu, Hwanjo
Format:	Artikel
Sprache:	eng
Schlagworte:	Anomaly detection Computer Science Computer Science, Information Systems Metric learning PU Learning Science & Technology Technology
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Anomaly detection task, which identifies abnormal patterns in data, has been widely applied to various domains. Most recent work on anomaly detection have focused on an accurate modeling of the normal data based on unsupervised methods. To get a satisfactory anomaly detection accuracy, they need pure normal data without abnormal data. This scenario requires many labels to get pure normal data. In many real-world scenarios, there exist abundant unlabeled data and a limited number of partially labeled anomalies. This paper proposes a novel anomaly detection method, PUMAD, which uses a Positive and Unlabeled (PU) learning approach to learn from abundant unlabeled data and a small number of partially labeled anomalies (i.e., positives). PUMAD successfully works on the anomaly detection scenario by exploiting deep metric learning with a hashing-based filtering method. Extensive experimental results on real-world benchmark datasets demonstrate that our approach based on PU learning is effective to detect anomalies. PUMAD achieves a much higher accuracy of up to 24% than state-of-the-art competitors.
ISSN:	0020-0255 1872-6291
DOI:	10.1016/j.ins.2020.03.021