Rare variants discovery by extensive whole-genome sequencing of the Han Chinese population in Taiwan: Applications to cardiovascular medicine

[Display omitted] A population-specific genomic reference is important for research and clinical practice, yet it remains unavailable for Han Chinese (HC) in Taiwan. We report the first whole genome sequencing (WGS) database of HC (1000 Taiwanese genome (1KTW-WGS)) and demonstrate several applicatio...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of advanced research 2021-05, Vol.30, p.147-158
Hauptverfasser: Juang, Jyh-Ming Jimmy, Lu, Tzu-Pin, Su, Ming-Wei, Lin, Chien-Wei, Yang, Jenn-Hwai, Chu, Hou-Wei, Chen, Chien-Hsiun, Hsiao, Yi-Wen, Lee, Chien-Yueh, Chiang, Li-Mei, Yu, Qi-You, Hsiao, Chuhsing Kate, Chen, Ching-Yu Julius, Wu, Pei-Ei, Pai, Chien-Hua, Chuang, Eric Y., Shen, Chen-Yang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] A population-specific genomic reference is important for research and clinical practice, yet it remains unavailable for Han Chinese (HC) in Taiwan. We report the first whole genome sequencing (WGS) database of HC (1000 Taiwanese genome (1KTW-WGS)) and demonstrate several applications to cardiovascular medicine. Whole genomes of 997 HC were sequenced to at least 30X depth. A total of 20,117 relatively healthy HC individuals were genotyped using a customized Axiom GWAS array. We performed a genome-wide genotype imputation technique using IMPUTE2. We identified 26.7 million single-nucleotide variants (SNVs) and 4.2 million insertions-deletions. Of the SNVs, 16.1% were novel relative to dbSNP (build 152), and 34.2% were novel relative to gnomAD. A total of 18,450 healthy HC individuals were genotyped using a customized Genome-Wide Association Study (GWAS) array. We identified hypertension-associated variants and developed a hypertension prediction model based on the correlation between the WGS data and GWAS data (combined clinical and genetic models, AUC 0.887), and also identified 3 novel hyperlipidemia-associated variants. Each individual carried an average of 16.42 (SD = 3.72) disease-causing variants. Additionally, we established an online SCN5A (an important cardiac gene) database that can be used to explore racial differences. Finally, pharmacogenetics studies identified HC population-specific SNVs in genes (CYP2C9 and VKORC1) involved in drug metabolism and blood clotting. This research demonstrates the benefits of constructing a population-specific genomic reference database for precision medicine.
ISSN:2090-1232
2090-1224
DOI:10.1016/j.jare.2020.12.003