Density-based clustering with non-continuous data

Density-based clustering relies on the idea of associating groups with regions of the sample space characterized by high density of the probability distribution underlying the observations. While this approach to cluster analysis exhibits some desirable properties, its use is necessarily limited to...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computational statistics 2016-06, Vol.31 (2), p.771-798
Hauptverfasser:	Azzalini, Adelchi, Menardi, Giovanna
Format:	Artikel
Sprache:	eng
Schlagworte:	Cluster analysis Clustering Computer simulation Economic Theory/Quantitative Economics/Mathematical Methods Illustrations Mathematical models Mathematics and Statistics Original Paper Probability Probability and Statistics in Computer Science Probability distribution Probability Theory and Stochastic Processes Recommendations Samples Statistical analysis Statistics Studies Variables
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Density-based clustering relies on the idea of associating groups with regions of the sample space characterized by high density of the probability distribution underlying the observations. While this approach to cluster analysis exhibits some desirable properties, its use is necessarily limited to continuous data only. The present contribution proposes a simple but working way to circumvent this problem, based on the identification of continuous components underlying the non-continuous variables. The basic idea is explored in a number of variants applied to simulated data, confirming the practical effectiveness of the technique and leading to recommendations for its practical usage. Some illustrations using real data are also presented.
ISSN:	0943-4062 1613-9658
DOI:	10.1007/s00180-016-0644-8