How to Avoid Spurious Cluster Validation? A Methodological Investigation on Simulated and fMRI Data

This paper presents an evaluation of a common approach that has been considered as a promising option for exploratory fMRI data analyses. The approach includes two stages: creating from the data a sequence of partitions with increasing number of subsets (clustering) and selecting the one partition i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:NeuroImage (Orlando, Fla.) Fla.), 2002-09, Vol.17 (1), p.431-446
Hauptverfasser: Möller, Ulrich, Ligges, Marc, Georgiewa, Petra, Grünling, Carolin, Kaiser, Werner A., Witte, Herbert, Blanz, Bernhard
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper presents an evaluation of a common approach that has been considered as a promising option for exploratory fMRI data analyses. The approach includes two stages: creating from the data a sequence of partitions with increasing number of subsets (clustering) and selecting the one partition in this sequence that exhibits the clearest indications of an existing structure (cluster validation). In order to achieve that the selected partition is actually the best characterization of the data structure, previous studies were directed to find the most appropriate validity function(s). In our analysis protocol, we first optimize the sequence of partitions according to the given objective function. Our study showed that an insufficient optimization of the partition, for one or more numbers of clusters, can easily yield a spurious validation result which, in turn, may lead the analyst to a misleading interpretation of the fMRI experiment. However, a sufficient optimization, for each included number of clusters, provided the basis for a reliable, adequate characterization of the data. Furthermore, it enabled an adequate evaluation of the validity functions. These findings were obtained independently for three clustering algorithms (representing the hard and fuzzy clustering variant) and three up-to-date cluster validity functions. The findings were derived from analyses of Gaussian clusters, simulated data sets that mimic typical fMRI response signals, and real fMRI data. Based on our results we propose a number of options of how to configure improved clustering tools.
ISSN:1053-8119
1095-9572
DOI:10.1006/nimg.2002.1166