Selection of Cluster Hierarchy Depth in Hierarchical Clustering Using K-Means Algorithm

Many papers have shown that the hierarchical clustering method takes good-performance, but is limited because of its quadratic time complexity. In contrast, with a large number of variables, K-means has a time complexity that is linear in the number of documents, but is thought to produce inferior c...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Shinwon Lee, Wonhee Lee, Sungjong Chung, Dongun An, Ingeun Bok, Hongjin Ryu
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Clustering algorithms Clustering methods Information analysis Information retrieval Information technology Merging Metasearch Natural languages Partitioning algorithms Speech
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Many papers have shown that the hierarchical clustering method takes good-performance, but is limited because of its quadratic time complexity. In contrast, with a large number of variables, K-means has a time complexity that is linear in the number of documents, but is thought to produce inferior clusters. Think of the factor of simplify, high-quality and high-efficiency, we combine the two approaches providing a new system named CONDOR system with hierarchical structure based on document clustering using K-means algorithm. Evaluated the performance on different hierarchy depth and initial uncertain centroid number based on variational relative document amount correspond to given queries. Comparing with regular method that the initial centroids have been established in advance, our method performance has been improved a lot.
DOI:	10.1109/ISITC.2007.5