FDR control for linear log-contrast models with high-dimensional compositional covariates
Linear log-contrast models have been widely used to describe the relationship between the response of interest and the compositional covariates, in which one central task is to identify the significant compositional covariates while controlling the false discovery rate (FDR) at a nominal level. To a...
Gespeichert in:
Veröffentlicht in: | Computational statistics & data analysis 2024-09, Vol.197, p.107973, Article 107973 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Linear log-contrast models have been widely used to describe the relationship between the response of interest and the compositional covariates, in which one central task is to identify the significant compositional covariates while controlling the false discovery rate (FDR) at a nominal level. To achieve this goal, a new FDR control method is proposed for linear log-contrast models with high-dimensional compositional covariates. An appealing feature of the proposed method is that it completely bypasses the traditional p-values and utilizes only the symmetry property of the test statistic for the unimportant compositional covariates to give an upper bound of the FDR. Under some regularity conditions, the FDR can be asymptotically controlled at the nominal level for the proposed method in theory, and the theoretical power is also proven to approach one as the sample size tends to infinity. The finite-sample performance of the proposed method is evaluated through extensive simulation studies, and applications to microbiome compositional datasets are also provided. |
---|---|
ISSN: | 0167-9473 1872-7352 |
DOI: | 10.1016/j.csda.2024.107973 |