Self-adaptive two-stage density clustering method with fuzzy connectivity

Density Peak Clustering (DPC) was proposed in the journal Science in 2014 and has been widely applied in many fields due to its simplicity and effectiveness. However, there are few studies on the effectiveness of DPC algorithm and its variants on non-clean data sets. Inspired by the idea that DPC al...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied soft computing 2024-03, Vol.154, p.111355, Article 111355
Hauptverfasser: Qiao, Kaikai, Chen, Jiawei, Duan, Shukai
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Density Peak Clustering (DPC) was proposed in the journal Science in 2014 and has been widely applied in many fields due to its simplicity and effectiveness. However, there are few studies on the effectiveness of DPC algorithm and its variants on non-clean data sets. Inspired by the idea that DPC algorithm combines density and distance when determining clustering center, this paper creatively designs a two-stage density clustering method with fuzzy connectivity (TS-DCM). It could be used to distinguish different cluster partitions and further identify noise points and sample points. In addition, this paper also introduces a new clustering index: fuzzy connectivity, which could not only adjust the selection of DPC cutoff distance, but also provide a reference for adaptive adjustment of TS-DCM parameter selection, greatly improving the operating efficiency of the clustering algorithm. At the same time, a self-adaptive two-stage density clustering method (STS-DCM) is proposed to adjust the selection of parameters according to the feedback of clustering results. Finally, compared with other traditional and popular clustering algorithms, it is verified that the proposed algorithm has significant advantages in speed and accuracy. Moreover, for non-clean data sets, the algorithm is robust and effective. •Propose fuzzy connectivity as an important index for density clustering;•A two-stage density clustering algorithm(TS-DCM) is proposed;•A novel STS-DCM algorithm to determine the parameter selection.
ISSN:1568-4946
1872-9681
DOI:10.1016/j.asoc.2024.111355