Using genetic prediction from known complex disease Loci to guide the design of next-generation sequencing experiments

A central focus of complex disease genetics after genome-wide association studies (GWAS) is to identify low frequency and rare risk variants, which may account for an important fraction of disease heritability unexplained by GWAS. A profusion of studies using next-generation sequencing are seeking s...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	PloS one 2013-10, Vol.8 (10), p.e76328-e76328
Hauptverfasser:	Jostins, Luke, Levine, Adam P, Barrett, Jeffrey C
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Alleles Computer simulation Design Diabetes Epidemiology Gene Frequency Gene sequencing Genetic aspects Genetic Predisposition to Disease Genetics Genome-wide association studies Genome-Wide Association Study Genomes Genomics Genotype Health risk assessment Health risks Heritability High-Throughput Nucleotide Sequencing Humans Inflammatory bowel disease Loci Medical research Models, Genetic Penetrance Phenotype Population genetics Quantitative Trait Loci Risk Risk analysis Risk factors Studies Type 2 diabetes Voice recognition
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	A central focus of complex disease genetics after genome-wide association studies (GWAS) is to identify low frequency and rare risk variants, which may account for an important fraction of disease heritability unexplained by GWAS. A profusion of studies using next-generation sequencing are seeking such risk alleles. We describe how already-known complex trait loci (largely from GWAS) can be used to guide the design of these new studies by selecting cases, controls, or families who are most likely to harbor undiscovered risk alleles. We show that genetic risk prediction can select unrelated cases from large cohorts who are enriched for unknown risk factors, or multiply-affected families that are more likely to harbor high-penetrance risk alleles. We derive the frequency of an undiscovered risk allele in selected cases and controls, and show how this relates to the variance explained by the risk score, the disease prevalence and the population frequency of the risk allele. We also describe a new method for informing the design of sequencing studies using genetic risk prediction in large partially-genotyped families using an extension of the Inside-Outside algorithm for inference on trees. We explore several study design scenarios using both simulated and real data, and show that in many cases genetic risk prediction can provide significant increases in power to detect low-frequency and rare risk alleles. The same approach can also be used to aid discovery of non-genetic risk factors, suggesting possible future utility of genetic risk prediction in conventional epidemiology. Software implementing the methods in this paper is available in the R package Mangrove.
ISSN:	1932-6203 1932-6203
DOI:	10.1371/journal.pone.0076328