A general framework of nonparametric feature selection in high‐dimensional data

Nonparametric feature selection for high‐dimensional data is an important and challenging problem in the fields of statistics and machine learning. Most of the existing methods for feature selection focus on parametric or additive models which may suffer from model misspecification. In this paper, w...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Biometrics 2023-06, Vol.79 (2), p.951-963
Hauptverfasser:	Yu, Hang, Wang, Yuanjia, Zeng, Donglin
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Computational geometry Computer Simulation Convexity Data analysis Feature selection Fisher consistency Hilbert space Kernel functions Machine Learning Mathematical models Multivariate analysis Nonparametric statistics Optimization oracle property Parameters reproducing kernel Hilbert space Statistical analysis tensor product kernel Tensors variable selection
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Nonparametric feature selection for high‐dimensional data is an important and challenging problem in the fields of statistics and machine learning. Most of the existing methods for feature selection focus on parametric or additive models which may suffer from model misspecification. In this paper, we propose a new framework to perform nonparametric feature selection for both regression and classification problems. Under this framework, we learn prediction functions through empirical risk minimization over a reproducing kernel Hilbert space. The space is generated by a novel tensor product kernel, which depends on a set of parameters that determines the importance of the features. Computationally, we minimize the empirical risk with a penalty to estimate the prediction and kernel parameters simultaneously. The solution can be obtained by iteratively solving convex optimization problems. We study the theoretical property of the kernel feature space and prove the oracle selection property and Fisher consistency of our proposed method. Finally, we demonstrate the superior performance of our approach compared to existing methods via extensive simulation studies and applications to two real studies.
ISSN:	0006-341X 1541-0420
DOI:	10.1111/biom.13664