Bayesian method to predict individual SNP genotypes from gene expression data

Eric Schadt and colleagues report a Bayesian method to predict individual SNP genotypes based on RNA expression data. Using simulations and empirical data sets, they show that it is possible to infer a genotypic barcode specific to an individual, although the identification of an individual as a par...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature genetics 2012-05, Vol.44 (5), p.603-608
Hauptverfasser: Schadt, Eric E, Woo, Sangsoon, Hao, Ke
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Eric Schadt and colleagues report a Bayesian method to predict individual SNP genotypes based on RNA expression data. Using simulations and empirical data sets, they show that it is possible to infer a genotypic barcode specific to an individual, although the identification of an individual as a participant in a study is limited by factors such as the availability of large-scale expression quantitative trait loci (eQTLs) and expression data sets. RNA profiling can be used to capture the expression patterns of many genes that are associated with expression quantitative trait loci (eQTLs). Employing published putative cis eQTLs, we developed a Bayesian approach to predict SNP genotypes that is based only on RNA expression data. We show that predicted genotypes can accurately and uniquely identify individuals in large populations. When inferring genotypes from an expression data set using eQTLs of the same tissue type (but from an independent cohort), we were able to resolve 99% of the identities of individuals in the cohort at P adjusted ≤ 1 × 10 −5 . When eQTLs derived from one tissue were used to predict genotypes using expression data from a different tissue, the identities of 90% of the study subjects could be resolved at P adjusted ≤ 1 × 10 −5 . We discuss the implications of deriving genotypic information from RNA data deposited in the public domain.
ISSN:1061-4036
1546-1718
DOI:10.1038/ng.2248