Classification of Dive Profiles: A Comparison of Statistical Clustering Techniques and Unsupervised Artifical Neural Networks

Recent advances in technology for sampling diving behavior have enabled enormous datasets to be collected on a variety of diving animals. Methods used to analyze these data vary considerably across studies, complicating interspecific comparisons. The primary problem is that methods for analyzing lar...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of agricultural, biological, and environmental statistics biological, and environmental statistics, 1998-12, Vol.3 (4), p.383-404
Hauptverfasser: Schreer, Jason F., R. J. O'Hara Hines, Kovacs, Kit M.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Recent advances in technology for sampling diving behavior have enabled enormous datasets to be collected on a variety of diving animals. Methods used to analyze these data vary considerably across studies, complicating interspecific comparisons. The primary problem is that methods for analyzing large datasets of dive profiles have not been clearly defined. This study examines various algorithms for analyzing multivariate observations and assesses their suitability for classifying diving data. These include k-means and fuzzy c-means clustering techniques from the field of statistics, and Kohonen self-organizing map (SOM) and fuzzy adaptive resonance theory (ART) from the field of artificial neural networks. A Monte Carlo simulation was performed on artificially generated data, with known solutions, to test clustering performance under various conditions (i.e., well defined or overlapping groups, varying numbers of variables, varying numbers of groups, and autocorrelated or independent variables). Performance was also tested on real datasets from Adélie penguins (Pygoscelis adeliae), southern elephant seals (Mirounga leonina), and Weddell seals (Leptonychotes weddellii). K-means, fuzzy c-means, and SOM all performed equally well on the artificially generated data while fuzzy ART had misclassification rates that were twice as high. All techniques showed decreasing performance with increasing overlap among groups and increasing numbers of groups, but increasing performance with increasing numbers of variables. Fuzzy ART was the most sensitive to the varying simulation parameters. When clustering real data, both c-means and SOM classified observations into clusters that were closer together (relative to k-means) and hence had less distinct boundaries separating the clusters. K-means performed as well as c-means and SOM, but its classification of real data was more logical when compared to the actual dive profiles. K-means is also the most readily available technique on statistical software packages. considering all of these factors, k-means appears to be the best method among those examined for grouping dive profiles.
ISSN:1085-7117
DOI:10.2307/1400572