Cluster Analysis

Clustering is a machine learning tool used to cluster similar data based on the similarities in its characteristics. Clustering techniques are used when the specific target or the expected output is not known to the data analyst. Hierarchical clustering is a series of partitions running from a singl...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Balusamy, Balamurugan, Abirami R, Nandhini, Kadry, Seifedine, Gandomi, Amir H
Format:	Buchkapitel
Sprache:	eng
Schlagworte:	General References hard clustering hierarchical clustering K‐means algorithm outlier detection techniques representative‐based clustering partitions soft clustering Software Engineering
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Clustering is a machine learning tool used to cluster similar data based on the similarities in its characteristics. Clustering techniques are used when the specific target or the expected output is not known to the data analyst. Hierarchical clustering is a series of partitions running from a single cluster or reversely a single large cluster can be iteratively divided into smaller clusters. There are two types of hierarchical clustering: agglomerative clustering; and divisive clustering. Basically, there are two types of clustering, namely, hard clustering and soft clustering. The K‐means algorithm performs hard clustering—the data points are assigned to only one cluster based on their distances from the centroid of the cluster. Representative‐based clustering partitions the given data set with n data points in an N‐dimensional space. There are three types of outlier detection techniques based on the availability of training data set. They are: supervised outlier detection; semi‐supervised outlier detection; and unsupervised outlier detection.
DOI:	10.1002/9781119701859.ch9