Simple estimates of haplotype relative risks in case-control data

Methods of varying complexity have been proposed to efficiently estimate haplotype relative risks in case‐control data. Our goal was to compare methods that estimate associations between disease conditions and common haplotypes in large case‐control studies such that haplotype imputation is done onc...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Genetic epidemiology 2006-09, Vol.30 (6), p.485-494
Hauptverfasser:	French, Benjamin, Lumley, Thomas, Monks, Stephanie A., Rice, Kenneth M., Hindorff, Lucia A., Reiner, Alexander P., Psaty, Bruce M.
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Biological and medical sciences Case-Control Studies Classical genetics, quantitative genetics, hybrids Computer Simulation Fundamental and applied biological sciences. Psychology gene-environment interaction Genetic Predisposition to Disease Genetics of eukaryotes. Biological and molecular evolution Genotype Haplotypes - genetics Human Humans imputation Incidence Likelihood Functions linkage disequilibrium Models, Genetic phase ambiguity Polymorphism, Single Nucleotide - genetics Renin-Angiotensin System - genetics Risk Factors unphased genotypes weighted logistic regression
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Methods of varying complexity have been proposed to efficiently estimate haplotype relative risks in case‐control data. Our goal was to compare methods that estimate associations between disease conditions and common haplotypes in large case‐control studies such that haplotype imputation is done once as a simple data‐processing step. We performed a simulation study based on haplotype frequencies for two renin‐angiotensin system genes. The iterative and noniterative methods we compared involved fitting a weighted logistic regression, but differed in how the probability weights were specified. We also quantified the amount of ambiguity in the simulated genes. For one gene, there was essentially no uncertainty in the imputed diplotypes and every method performed well. For the other, ∼60% of individuals had an unambiguous diplotype, and ∼90% had a highest posterior probability greater than 0.75. For this gene, all methods performed well under no genetic effects, moderate effects, and strong effects tagged by a single nucleotide polymorphism (SNP). Noniterative methods produced biased estimates under strong effects not tagged by an SNP. For the most likely diplotype, median bias of the log‐relative risks ranged between −0.49 and 0.22 over all haplotypes. For all possible diplotypes, median bias ranged between −0.73 and 0.08. Results were similar under interaction with a binary covariate. Noniterative weighted logistic regression provides valid tests for genetic associations and reliable estimates of modest effects of common haplotypes, and can be implemented in standard software. The potential for phase ambiguity does not necessarily imply uncertainty in imputed diplotypes, especially in large studies of common haplotypes. Genet. Epidemiol. 2006. © 2006 Wiley‐Liss, Inc.
ISSN:	0741-0395 1098-2272
DOI:	10.1002/gepi.20161