Outliers influence to the point distance distribution normality within the data clusters

In order to verify the cluster analysis results, a normality test is being applied to the distribution of data point's distances from their cluster center. The presence of the outlier points within the input data can however influence this method in a negative way. Therefore, a normality test w...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Malkic, J., Sarajlic, N., Hadzic, D.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In order to verify the cluster analysis results, a normality test is being applied to the distribution of data point's distances from their cluster center. The presence of the outlier points within the input data can however influence this method in a negative way. Therefore, a normality test will show better results in recognizing and assessing the clusters if the outlier presence is reduced. This fact is being confirmed by empirically comparing the normality test results for the clusters produced by different cluster analyses methods on the same data set.
DOI:10.1109/TELFOR.2012.6419542