Distributed Conditional Feature Screening via Pearson Partial Correlation with FDR Control
This paper studies the distributed conditional feature screening for massive data with ultrahigh-dimensional features. Specifically, three distributed partial correlation feature screening methods (SAPS, ACPS and JDPS methods) are firstly proposed based on Pearson partial correlation. The correspond...
Gespeichert in:
Hauptverfasser: | , |
---|---|
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | This paper studies the distributed conditional feature screening for massive
data with ultrahigh-dimensional features. Specifically, three distributed
partial correlation feature screening methods (SAPS, ACPS and JDPS methods) are
firstly proposed based on Pearson partial correlation. The corresponding
consistency of distributed estimation and the sure screening property of
feature screening methods are established. Secondly, because using a hard
threshold in feature screening will lead to a high false discovery rate (FDR),
this paper develops a two-step distributed feature screening method based on
knockoff technique to control the FDR. It is shown that the proposed method can
control the FDR in the finite sample, and also enjoys the sure screening
property under some conditions. Different from the existing screening methods,
this paper not only considers the influence of a conditional variable on both
the response variable and feature variables in variable screening, but also
studies the FDR control issue. Finally, the effectiveness of the proposed
methods is confirmed by numerical simulations and a real data analysis. |
---|---|
DOI: | 10.48550/arxiv.2403.05792 |