Data Reduction Using a Discrete Wavelet Transform in Discriminant Analysis of Very High Dimensionality Data

We present a method of data reduction using a wavelet transform in discriminant analysis when the number of variables is much greater than the number of observations. The method is illustrated with a prostate cancer study, where the sample size is 248, and the number of variables is 48,538 (generate...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Biometrics 2003-03, Vol.59 (1), p.143-151
Hauptverfasser: Qu, Yinsheng, Adam, Bao‐ling, Thornquist, Mark, Potter, John D, Thompson, Mary Lou, Yasui, Yutaka, Davis, John, Schellhammer, Paul F, Cazares, Lisa, Clements, MaryAnn, Wright, George L., Jr, Feng, Ziding
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We present a method of data reduction using a wavelet transform in discriminant analysis when the number of variables is much greater than the number of observations. The method is illustrated with a prostate cancer study, where the sample size is 248, and the number of variables is 48,538 (generated using the ProteinChip technology). Using a discrete wavelet transform, the 48,538 data points are represented by 1271 wavelet coefficients. Information criteria identified 11 of the 1271 wavelet coefficients with the highest discriminatory power. The linear classifier with the 11 wavelet coefficients detected prostate cancer in a separate test set with a sensitivity of 97% and specificity of 100%.
ISSN:0006-341X
1541-0420
DOI:10.1111/1541-0420.00017