ADJUSTING SYSTEMATIC BIAS IN HIGH DIMENSIONAL PRINCIPAL COMPONENT SCORES

Principal component analysis continues to be a powerful tool for the dimension reduction of high-dimensional data. We assume a variance-diverging model and use the high-dimension low-sample-size asymptotics to show that even though the principal component directions are not consistent, the sample an...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Statistica Sinica 2022-04, Vol.32 (2), p.939-959
1. Verfasser: Jung, Sungkyu
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Principal component analysis continues to be a powerful tool for the dimension reduction of high-dimensional data. We assume a variance-diverging model and use the high-dimension low-sample-size asymptotics to show that even though the principal component directions are not consistent, the sample and prediction principal component scores can be useful in revealing the population structure. We further show that these scores are biased, and that the bias is asymptotically decomposed into rotation and scaling parts. We propose bias-adjustment methods that are shown to be consistent and work well in high-dimensional situations with small sample sizes. The potential advantage of the bias adjustment is demonstrated in a classification setting.
ISSN:1017-0405
1996-8507
DOI:10.5705/ss.202019.0400