Robust mixture model-based clustering with genetic algorithm approach

In this paper, we address the robustness issue of maximum likelihood based methods in data clustering. Probabilistic mixture model has been a well known approach to cluster analysis. However, as they rely on maximum likelihood estimation (MLE), the algorithms are often very sensitive to noise and ou...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Intelligent data analysis 2011-01, Vol.15 (3), p.357-373
Hauptverfasser: Thang, Nguyen Duc, Chen, Lihui, Chan, Chee Keong
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this paper, we address the robustness issue of maximum likelihood based methods in data clustering. Probabilistic mixture model has been a well known approach to cluster analysis. However, as they rely on maximum likelihood estimation (MLE), the algorithms are often very sensitive to noise and outliers. In this work, we implement a variant of the classical mixture model-based clustering (M2C) following a proposed general framework for handling outliers. Genetic Algorithm (GA) is incorporated into the framework to produce a novel algorithm called GA-based Partial M2C (GA-PM2C). Analytical and experimental studies show that GA-PM2C can overcome the negative impact of outliers in data clustering, hence provides highly accurate and reliable clustering results. It also exhibits excellent consistency in performance and low sensitivity to initializations.
ISSN:1088-467X
1571-4128
DOI:10.3233/IDA-2010-0472