Principal points of a multivariate mixture distribution

A set of n -principal points of a distribution is defined as a set of n points that optimally represent the distribution in terms of mean squared distance. It provides an optimal n -point-approximation of the distribution. However, it is in general difficult to find a set of principal points of a mu...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of multivariate analysis 2011-02, Vol.102 (2), p.213-224
Hauptverfasser: Matsuura, Shun, Kurata, Hiroshi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A set of n -principal points of a distribution is defined as a set of n points that optimally represent the distribution in terms of mean squared distance. It provides an optimal n -point-approximation of the distribution. However, it is in general difficult to find a set of principal points of a multivariate distribution. Tarpey et al. [T. Tarpey, L. Li, B. Flury, Principal points and self-consistent points of elliptical distributions, Ann. Statist. 23 (1995) 103–112] established a theorem which states that any set of n -principal points of an elliptically symmetric distribution is in the linear subspace spanned by some principal eigenvectors of the covariance matrix. This theorem, called a “principal subspace theorem”, is a strong tool for the calculation of principal points. In practice, we often come across distributions consisting of several subgroups. Hence it is of interest to know whether the principal subspace theorem remains valid even under such complex distributions. In this paper, we define a multivariate location mixture model. A theorem is established that clarifies a linear subspace in which n -principal points exist.
ISSN:0047-259X
1095-7243
DOI:10.1016/j.jmva.2010.08.009