A critical evaluation of genomic control methods for genetic association studies
Population stratification is an important potential confounder of genetic case‐control association studies. For replication studies, limited availability of samples may lead to imbalanced sampling from heterogeneous populations. Genomic control (GC) can be used to correct χ2 test statistics which ar...
Gespeichert in:
Veröffentlicht in: | Genetic epidemiology 2009-05, Vol.33 (4), p.290-298 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Population stratification is an important potential confounder of genetic case‐control association studies. For replication studies, limited availability of samples may lead to imbalanced sampling from heterogeneous populations. Genomic control (GC) can be used to correct χ2 test statistics which are presumed to be inflated by a factor λ; this may be estimated by a summary χ2 value (λmedian or λmean) from a set of unlinked markers. Many studies applying GC methods have used fewer than 50 unlinked markers and an important question is whether this can adequately correct for population stratification. We assess the behavior of GC methods in imbalanced case‐control studies using simulation. SNPs are sampled from two subpopulations with intra‐continental levels of FST (≤0.005) and sampling schemata ranging from balanced to completely imbalanced between subpopulations. The sampling properties of λmedian and λmean are explored using 6–1,600 unlinked markers to estimate Type 1 error and power empirically. GC corrections based on the χ2‐distribution (GCmedian or GCmean) can be anti‐conservative even when more than 100 single nucleotide polymorphisms (SNPs) are genotyped and realistic levels of population stratification exist. The GCF procedure performs well over a wider range of conditions, only becoming anti‐conservative at low levels of α and with fewer than 25 SNPs genotyped. A substantial loss of power can arise when population stratification is present, but this is largely independent of the number of SNPs used. A literature survey shows that most studies applying GC have used GCmedian or GCmean, rather than GCF, which is the most appropriate GC correction method. Genet. Epidemiol. 2009. © 2008 Wiley Liss, Inc. |
---|---|
ISSN: | 0741-0395 1098-2272 |
DOI: | 10.1002/gepi.20379 |