Outlier detection based on transitive closure

Outlier detection is an important task in data mining because outliers may bring either new knowledge or potential threats. Much of recent research has focused on measuring the local difference between an outlier and its nearest neighbors, some of which may be unsuitable reference objects. Thus, loc...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Intelligent data analysis 2015-01, Vol.19 (1), p.145-160
Hauptverfasser: Wan, Jiaqiang, Zhu, Qingsheng, Lei, Dajiang, Lu, Jiaxi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Outlier detection is an important task in data mining because outliers may bring either new knowledge or potential threats. Much of recent research has focused on measuring the local difference between an outlier and its nearest neighbors, some of which may be unsuitable reference objects. Thus, local difference cannot represent true outlying-ness. On the basis of this conclusion, we propose a new outlying-ness measure that reflects the connectivity of any object to the main body of a data set. For any object p, the outlying-ness is denoted by the connectivity from the k-th most similar neighbor to p. The proposed measure is applicable to arbitrary-density and arbitrarily-shaped data. It is uninfluenced by unsuitable reference objects and effectively identifies outlying clusters without the need for clustering algorithms and additional parameters.
ISSN:1088-467X
1571-4128
DOI:10.3233/IDA-140701