A Mixed-Effects Model for Powerful Association Tests in Integrative Functional Genomics

Genome-wide association studies (GWASs) have successfully identified thousands of genetic variants for many complex diseases; however, these variants explain only a small fraction of the heritability. Recently, genetic association studies that leverage external transcriptome data have received much...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:American journal of human genetics 2018-05, Vol.102 (5), p.904-919
Hauptverfasser: Su, Yu-Ru, Di, Chongzhi, Bien, Stephanie, Huang, Licai, Dong, Xinyuan, Abecasis, Goncalo, Berndt, Sonja, Bezieau, Stephane, Brenner, Hermann, Caan, Bette, Casey, Graham, Chang-Claude, Jenny, Chanock, Stephen, Chen, Sai, Connolly, Charles, Curtis, Keith, Figueiredo, Jane, Gala, Manish, Gallinger, Steven, Harrison, Tabitha, Hoffmeister, Michael, Hopper, John, Huyghe, Jeroen R., Jenkins, Mark, Joshi, Amit, Le Marchand, Loic, Newcomb, Polly, Nickerson, Deborah, Potter, John, Schoen, Robert, Slattery, Martha, White, Emily, Zanke, Brent, Peters, Ulrike, Hsu, Li
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Genome-wide association studies (GWASs) have successfully identified thousands of genetic variants for many complex diseases; however, these variants explain only a small fraction of the heritability. Recently, genetic association studies that leverage external transcriptome data have received much attention and shown promise for discovering novel variants. One such approach, PrediXcan, is to use predicted gene expression through genetic regulation. However, there are limitations in this approach. The predicted gene expression may be biased, resulting from regularized regression applied to moderately sample-sized reference studies. Further, some variants can individually influence disease risk through alternative functional mechanisms besides expression. Thus, testing only the association of predicted gene expression as proposed in PrediXcan will potentially lose power. To tackle these challenges, we consider a unified mixed effects model that formulates the association of intermediate phenotypes such as imputed gene expression through fixed effects, while allowing residual effects of individual variants to be random. We consider a set-based score testing framework, MiST (mixed effects score test), and propose two data-driven combination approaches to jointly test for the fixed and random effects. We establish the asymptotic distributions, which enable rapid calculation of p values for genome-wide analyses, and provide p values for fixed and random effects separately to enhance interpretability over GWASs. Extensive simulations demonstrate that our approaches are more powerful than existing ones. We apply our approach to a large-scale GWAS of colorectal cancer and identify two genes, POU5F1B and ATF1, which would have otherwise been missed by PrediXcan, after adjusting for all known loci.
ISSN:0002-9297
1537-6605
DOI:10.1016/j.ajhg.2018.03.019