Chameleon algorithm based on improved natural neighbor graph generating sub-clusters

Chameleon algorithm is a hierarchical clustering based on dynamic modeling. It can find high-quality clusters with different shapes, sizes and densities. However, Chameleon algorithm requires user-specified k when constructing sparse graph, which directly influences the clustering performance. In ad...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied intelligence (Dordrecht, Netherlands) Netherlands), 2021-11, Vol.51 (11), p.8399-8415
Hauptverfasser: Zhang, Yuru, Ding, Shifei, Wang, Yanru, Hou, Haiwei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Chameleon algorithm is a hierarchical clustering based on dynamic modeling. It can find high-quality clusters with different shapes, sizes and densities. However, Chameleon algorithm requires user-specified k when constructing sparse graph, which directly influences the clustering performance. In addition, the graph-partitioning technology used in the original algorithm, hMetis algorithm, is hard to build operation environment, and the number of partitions needs to be specified. These problems are arduous to determine without prior knowledge. In order to overcome the first problem, this paper introduces an improved natural neighbor method to construct a sparse graph, which can reflect the initial sparseness of the data. To address the second problems, this paper proposes a new method of generating sub-clusters in sparse graphs, which is simple and objective. In summary, this paper proposes Chameleon Algorithm Based on Improved Natural Neighbor Graph Generating Sub-clusters (INNGS-Chameleon). This algorithm is tested on 8 synthetic data sets and 10 UCI data sets. The results are compared with the Chameleon algorithm, its improved algorithm and several classic algorithms. The experimental results show that the INNGS-Chameleon algorithm is feasible and effective.
ISSN:0924-669X
1573-7497
DOI:10.1007/s10489-021-02389-0