PUMAD: PU Metric learning for anomaly detection
Anomaly detection task, which identifies abnormal patterns in data, has been widely applied to various domains. Most recent work on anomaly detection have focused on an accurate modeling of the normal data based on unsupervised methods. To get a satisfactory anomaly detection accuracy, they need pur...
Gespeichert in:
Veröffentlicht in: | Information sciences 2020-06, Vol.523, p.167-183 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Anomaly detection task, which identifies abnormal patterns in data, has been widely applied to various domains. Most recent work on anomaly detection have focused on an accurate modeling of the normal data based on unsupervised methods. To get a satisfactory anomaly detection accuracy, they need pure normal data without abnormal data. This scenario requires many labels to get pure normal data. In many real-world scenarios, there exist abundant unlabeled data and a limited number of partially labeled anomalies. This paper proposes a novel anomaly detection method, PUMAD, which uses a Positive and Unlabeled (PU) learning approach to learn from abundant unlabeled data and a small number of partially labeled anomalies (i.e., positives). PUMAD successfully works on the anomaly detection scenario by exploiting deep metric learning with a hashing-based filtering method. Extensive experimental results on real-world benchmark datasets demonstrate that our approach based on PU learning is effective to detect anomalies. PUMAD achieves a much higher accuracy of up to 24% than state-of-the-art competitors. |
---|---|
ISSN: | 0020-0255 1872-6291 |
DOI: | 10.1016/j.ins.2020.03.021 |