Sampling Method for Fast Training of Support Vector Data Description

Support Vector Data Description (SVDD) is a popular outlier detection technique which constructs a flexible description of the input data. SVDD computation time is high for large training datasets which limits its use in big-data process-monitoring applications. We propose a new iterative sampling-b...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	arXiv.org 2016-09
Hauptverfasser:	Chaudhuri, Arin, Kakde, Deovrat, Jahja, Maria, Xiao, Wei, Jiang, Hansi, Kong, Seunghyun, Peredriy, Sergiy
Format:	Artikel
Sprache:	eng
Schlagworte:	Computer Science - Learning Data analysis Data management Iterative methods Outliers (statistics) Sampling Statistics - Applications Statistics - Machine Learning Training
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Support Vector Data Description (SVDD) is a popular outlier detection technique which constructs a flexible description of the input data. SVDD computation time is high for large training datasets which limits its use in big-data process-monitoring applications. We propose a new iterative sampling-based method for SVDD training. The method incrementally learns the training data description at each iteration by computing SVDD on an independent random sample selected with replacement from the training data set. The experimental results indicate that the proposed method is extremely fast and provides a good data description .
ISSN:	2331-8422
DOI:	10.48550/arxiv.1606.05382