Outliers influence to the point distance distribution normality within the data clusters
In order to verify the cluster analysis results, a normality test is being applied to the distribution of data point's distances from their cluster center. The presence of the outlier points within the input data can however influence this method in a negative way. Therefore, a normality test w...
Gespeichert in:
Hauptverfasser: | , , |
---|---|
Format: | Tagungsbericht |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In order to verify the cluster analysis results, a normality test is being applied to the distribution of data point's distances from their cluster center. The presence of the outlier points within the input data can however influence this method in a negative way. Therefore, a normality test will show better results in recognizing and assessing the clusters if the outlier presence is reduced. This fact is being confirmed by empirically comparing the normality test results for the clusters produced by different cluster analyses methods on the same data set. |
---|---|
DOI: | 10.1109/TELFOR.2012.6419542 |