Rare coding variant analysis for human diseases across biobanks and ancestries
Large-scale sequencing has enabled unparalleled opportunities to investigate the role of rare coding variation in human phenotypic variability. Here, we present a pan-ancestry analysis of sequencing data from three large biobanks, including the All of Us research program. Using mixed-effects models,...
Gespeichert in:
Veröffentlicht in: | Nature genetics 2024-09, Vol.56 (9), p.1811-1820 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Large-scale sequencing has enabled unparalleled opportunities to investigate the role of rare coding variation in human phenotypic variability. Here, we present a pan-ancestry analysis of sequencing data from three large biobanks, including the All of Us research program. Using mixed-effects models, we performed gene-based rare variant testing for 601 diseases across 748,879 individuals, including 155,236 with ancestry dissimilar to European. We identified 363 significant associations, which highlighted core genes for the human disease phenome and identified potential novel associations, including
UBR3
for cardiometabolic disease and
YLPM1
for psychiatric disease. Pan-ancestry burden testing represented an inclusive and useful approach for discovery in diverse datasets, although we also highlight the importance of ancestry-specific sensitivity analyses in this setting. Finally, we found that effect sizes for rare protein-disrupting variants were concordant between samples similar to European ancestry and other genetic ancestries (
β
Deming
= 0.7–1.0). Our results have implications for multi-ancestry and cross-biobank approaches in sequencing association studies for human disease.
Gene-based rare variant analyses for 601 diseases across 748,879 individuals from three biobanks identify 363 significant associations and highlight important considerations for multi-ancestry and cross-biobank sequencing studies. |
---|---|
ISSN: | 1061-4036 1546-1718 1546-1718 |
DOI: | 10.1038/s41588-024-01894-5 |