Distributed Conditional Feature Screening via Pearson Partial Correlation with FDR Control

This paper studies the distributed conditional feature screening for massive data with ultrahigh-dimensional features. Specifically, three distributed partial correlation feature screening methods (SAPS, ACPS and JDPS methods) are firstly proposed based on Pearson partial correlation. The correspond...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Pang, Naiwen, Xia, Xiaochao
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:This paper studies the distributed conditional feature screening for massive data with ultrahigh-dimensional features. Specifically, three distributed partial correlation feature screening methods (SAPS, ACPS and JDPS methods) are firstly proposed based on Pearson partial correlation. The corresponding consistency of distributed estimation and the sure screening property of feature screening methods are established. Secondly, because using a hard threshold in feature screening will lead to a high false discovery rate (FDR), this paper develops a two-step distributed feature screening method based on knockoff technique to control the FDR. It is shown that the proposed method can control the FDR in the finite sample, and also enjoys the sure screening property under some conditions. Different from the existing screening methods, this paper not only considers the influence of a conditional variable on both the response variable and feature variables in variable screening, but also studies the FDR control issue. Finally, the effectiveness of the proposed methods is confirmed by numerical simulations and a real data analysis.
DOI:10.48550/arxiv.2403.05792