MANOCCA: a robust and computationally efficient test of covariance in high-dimension multivariate omics data

Abstract Multivariate analysis is becoming central in studies investigating high-throughput molecular data, yet, some important features of these data are seldom explored. Here, we present MANOCCA (Multivariate Analysis of Conditional CovAriance), a powerful method to test for the effect of a predic...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Briefings in bioinformatics 2024-05, Vol.25 (4)
Hauptverfasser: Boetto, Christophe, Frouin, Arthur, Henches, Léo, Auvergne, Antoine, Suzuki, Yuka, Patin, Etienne, Bredon, Marius, Chiu, Alec, Consortium, Milieu Interieur, Sankararaman, Sriram, Zaitlen, Noah, Kennedy, Sean P, Quintana-Murci, Lluis, Duffy, Darragh, Sokol, Harry, Aschard, Hugues
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Abstract Multivariate analysis is becoming central in studies investigating high-throughput molecular data, yet, some important features of these data are seldom explored. Here, we present MANOCCA (Multivariate Analysis of Conditional CovAriance), a powerful method to test for the effect of a predictor on the covariance matrix of a multivariate outcome. The proposed test is by construction orthogonal to tests based on the mean and variance and is able to capture effects that are missed by both approaches. We first compare the performances of MANOCCA with existing correlation-based methods and show that MANOCCA is the only test correctly calibrated in simulation mimicking omics data. We then investigate the impact of reducing the dimensionality of the data using principal component analysis when the sample size is smaller than the number of pairwise covariance terms analysed. We show that, in many realistic scenarios, the maximum power can be achieved with a limited number of components. Finally, we apply MANOCCA to 1000 healthy individuals from the Milieu Interieur cohort, to assess the effect of health, lifestyle and genetic factors on the covariance of two sets of phenotypes, blood biomarkers and flow cytometry–based immune phenotypes. Our analyses identify significant associations between multiple factors and the covariance of both omics data.
ISSN:1467-5463
1477-4054
DOI:10.1093/bib/bbae272