ADJUSTING SYSTEMATIC BIAS IN HIGH DIMENSIONAL PRINCIPAL COMPONENT SCORES
Principal component analysis continues to be a powerful tool for the dimension reduction of high-dimensional data. We assume a variance-diverging model and use the high-dimension low-sample-size asymptotics to show that even though the principal component directions are not consistent, the sample an...
Gespeichert in:
Veröffentlicht in: | Statistica Sinica 2022-04, Vol.32 (2), p.939-959 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Principal component analysis continues to be a powerful tool for the dimension reduction of high-dimensional data. We assume a variance-diverging model and use the high-dimension low-sample-size asymptotics to show that even though the principal component directions are not consistent, the sample and prediction principal component scores can be useful in revealing the population structure. We further show that these scores are biased, and that the bias is asymptotically decomposed into rotation and scaling parts. We propose bias-adjustment methods that are shown to be consistent and work well in high-dimensional situations with small sample sizes. The potential advantage of the bias adjustment is demonstrated in a classification setting. |
---|---|
ISSN: | 1017-0405 1996-8507 |
DOI: | 10.5705/ss.202019.0400 |