Sources of gene expression variation in a globally diverse human cohort
Genetic variation that influences gene expression and splicing is a key source of phenotypic diversity 1 – 5 . Although invaluable, studies investigating these links in humans have been strongly biased towards participants of European ancestries, which constrains generalizability and hinders evoluti...
Gespeichert in:
Veröffentlicht in: | Nature (London) 2024-08, Vol.632 (8023), p.122-130 |
---|---|
Hauptverfasser: | , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Genetic variation that influences gene expression and splicing is a key source of phenotypic diversity
1
–
5
. Although invaluable, studies investigating these links in humans have been strongly biased towards participants of European ancestries, which constrains generalizability and hinders evolutionary research. Here to address these limitations, we developed MAGE, an open-access RNA sequencing dataset of lymphoblastoid cell lines from 731 individuals from the 1000 Genomes Project
6
, spread across 5 continental groups and 26 populations. Most variation in gene expression (92%) and splicing (95%) was distributed within versus between populations, which mirrored the variation in DNA sequence. We mapped associations between genetic variants and expression and splicing of nearby genes (
cis
-expression quantitative trait loci (eQTLs) and
cis
-splicing QTLs (sQTLs), respectively). We identified more than 15,000 putatively causal eQTLs and more than 16,000 putatively causal sQTLs that are enriched for relevant epigenomic signatures. These include 1,310 eQTLs and 1,657 sQTLs that are largely private to underrepresented populations. Our data further indicate that the magnitude and direction of causal eQTL effects are highly consistent across populations. Moreover, the apparent ‘population-specific’ effects observed in previous studies were largely driven by low resolution or additional independent eQTLs of the same genes that were not detected. Together, our study expands our understanding of human gene expression diversity and provides an inclusive resource for studying the evolution and function of human genomes.
A new open-access RNA sequencing dataset, MAGE, of 731 individuals across geographically diverse human populations provides a valuable resource to study genetic diversity and evolution and expands the capacity to identify new genetic associations. |
---|---|
ISSN: | 0028-0836 1476-4687 1476-4687 |
DOI: | 10.1038/s41586-024-07708-2 |