Robust mixture model-based clustering with genetic algorithm approach
In this paper, we address the robustness issue of maximum likelihood based methods in data clustering. Probabilistic mixture model has been a well known approach to cluster analysis. However, as they rely on maximum likelihood estimation (MLE), the algorithms are often very sensitive to noise and ou...
Gespeichert in:
Veröffentlicht in: | Intelligent data analysis 2011-01, Vol.15 (3), p.357-373 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | In this paper, we address the robustness issue of maximum likelihood based methods in data clustering. Probabilistic mixture model has been a well known approach to cluster analysis. However, as they rely on maximum likelihood estimation (MLE), the algorithms are often very sensitive to noise and outliers. In this work, we implement a variant of the classical mixture model-based clustering (M2C) following a proposed general framework for handling outliers. Genetic Algorithm (GA) is incorporated into the framework to produce a novel algorithm called GA-based Partial M2C (GA-PM2C). Analytical and experimental studies show that GA-PM2C can overcome the negative impact of outliers in data clustering, hence provides highly accurate and reliable clustering results. It also exhibits excellent consistency in performance and low sensitivity to initializations. |
---|---|
ISSN: | 1088-467X 1571-4128 |
DOI: | 10.3233/IDA-2010-0472 |