Genetic architecture of complex traits and disease risk predictors

Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Scientific reports 2020-07, Vol.10 (1), p.12055-12055, Article 12055
Hauptverfasser: Yong, Soke Yuen, Raben, Timothy G., Lello, Louis, Hsu, Stephen D. H.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 12055
container_issue 1
container_start_page 12055
container_title Scientific reports
container_volume 10
creator Yong, Soke Yuen
Raben, Timothy G.
Lello, Louis
Hsu, Stephen D. H.
description Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied, a large amount of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.
doi_str_mv 10.1038/s41598-020-68881-8
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7374622</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2426177589</sourcerecordid><originalsourceid>FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</originalsourceid><addsrcrecordid>eNp9kUtP3TAQha2KqiDKH2BRReqGTVp7_N4gFVRoJaRuytpynAkYcuOLnaD239dwKa8F3tjyfHNmjg4h-4x-YZSbr0UwaU1LgbbKGMNa847sABWyBQ6w9ey9TfZKuaL1SLCC2Q9km4OyQmrYIUenOOEcQ-NzuIwzhnnJ2KShCWm1HvFPM2cf59L4qW_6WNAXbHIs1806Yx_DnHL5SN4Pfiy493DvkvOT77-Pf7Rnv05_Hn87a4MEmNugWAda0tB3qKjvtWBUe821Goy3PnglO2PR1K9qyqgBaK-GTnDWWcGHnu-Sw43ueulW2Aec6m6jW-e48vmvSz66l5UpXrqLdOvqDKEAqsDBg0BONwuW2a1iCTiOfsK0FAcCFNNaGlvRz6_Qq7Tkqdq7o6RmlhtZKdhQIadSMg6PyzDq7lJym5RcNeTuU3KmNn16buOx5X8mFeAboNTSdIH5afYbsv8ADuadhA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2425719385</pqid></control><display><type>article</type><title>Genetic architecture of complex traits and disease risk predictors</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Nature Free</source><source>PubMed Central</source><source>Springer Nature OA/Free Journals</source><source>Free Full-Text Journals in Chemistry</source><creator>Yong, Soke Yuen ; Raben, Timothy G. ; Lello, Louis ; Hsu, Stephen D. H.</creator><creatorcontrib>Yong, Soke Yuen ; Raben, Timothy G. ; Lello, Louis ; Hsu, Stephen D. H.</creatorcontrib><description>Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied, a large amount of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-020-68881-8</identifier><identifier>PMID: 32694572</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/114 ; 631/208/212 ; 692/308 ; Adults ; Algorithms ; Biobanks ; Blood pressure ; Body height ; Bone density ; Breast cancer ; Cardiac arrhythmia ; Cardiovascular disease ; Cardiovascular diseases ; Cluster Analysis ; Cognitive ability ; Coronary artery disease ; Coronary vessels ; Datasets ; Diabetes ; Diabetes mellitus ; Fibrillation ; Genetic Association Studies ; Genetic diversity ; Genetic Predisposition to Disease ; Genetic variance ; Genomes ; Health risk assessment ; Health risks ; Heart diseases ; Heritability ; Humanities and Social Sciences ; Humans ; Mammography ; Models, Genetic ; multidisciplinary ; Multifactorial Inheritance ; Phenotypes ; Polymorphism, Single Nucleotide ; Quantitative Trait, Heritable ; Science ; Science (multidisciplinary) ; Single-nucleotide polymorphism ; Whole Exome Sequencing ; Womens health</subject><ispartof>Scientific reports, 2020-07, Vol.10 (1), p.12055-12055, Article 12055</ispartof><rights>The Author(s) 2020</rights><rights>The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</citedby><cites>FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374622/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374622/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27924,27925,41120,42189,51576,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32694572$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Yong, Soke Yuen</creatorcontrib><creatorcontrib>Raben, Timothy G.</creatorcontrib><creatorcontrib>Lello, Louis</creatorcontrib><creatorcontrib>Hsu, Stephen D. H.</creatorcontrib><title>Genetic architecture of complex traits and disease risk predictors</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied, a large amount of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.</description><subject>631/114</subject><subject>631/208/212</subject><subject>692/308</subject><subject>Adults</subject><subject>Algorithms</subject><subject>Biobanks</subject><subject>Blood pressure</subject><subject>Body height</subject><subject>Bone density</subject><subject>Breast cancer</subject><subject>Cardiac arrhythmia</subject><subject>Cardiovascular disease</subject><subject>Cardiovascular diseases</subject><subject>Cluster Analysis</subject><subject>Cognitive ability</subject><subject>Coronary artery disease</subject><subject>Coronary vessels</subject><subject>Datasets</subject><subject>Diabetes</subject><subject>Diabetes mellitus</subject><subject>Fibrillation</subject><subject>Genetic Association Studies</subject><subject>Genetic diversity</subject><subject>Genetic Predisposition to Disease</subject><subject>Genetic variance</subject><subject>Genomes</subject><subject>Health risk assessment</subject><subject>Health risks</subject><subject>Heart diseases</subject><subject>Heritability</subject><subject>Humanities and Social Sciences</subject><subject>Humans</subject><subject>Mammography</subject><subject>Models, Genetic</subject><subject>multidisciplinary</subject><subject>Multifactorial Inheritance</subject><subject>Phenotypes</subject><subject>Polymorphism, Single Nucleotide</subject><subject>Quantitative Trait, Heritable</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Single-nucleotide polymorphism</subject><subject>Whole Exome Sequencing</subject><subject>Womens health</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kUtP3TAQha2KqiDKH2BRReqGTVp7_N4gFVRoJaRuytpynAkYcuOLnaD239dwKa8F3tjyfHNmjg4h-4x-YZSbr0UwaU1LgbbKGMNa847sABWyBQ6w9ey9TfZKuaL1SLCC2Q9km4OyQmrYIUenOOEcQ-NzuIwzhnnJ2KShCWm1HvFPM2cf59L4qW_6WNAXbHIs1806Yx_DnHL5SN4Pfiy493DvkvOT77-Pf7Rnv05_Hn87a4MEmNugWAda0tB3qKjvtWBUe821Goy3PnglO2PR1K9qyqgBaK-GTnDWWcGHnu-Sw43ueulW2Aec6m6jW-e48vmvSz66l5UpXrqLdOvqDKEAqsDBg0BONwuW2a1iCTiOfsK0FAcCFNNaGlvRz6_Qq7Tkqdq7o6RmlhtZKdhQIadSMg6PyzDq7lJym5RcNeTuU3KmNn16buOx5X8mFeAboNTSdIH5afYbsv8ADuadhA</recordid><startdate>20200721</startdate><enddate>20200721</enddate><creator>Yong, Soke Yuen</creator><creator>Raben, Timothy G.</creator><creator>Lello, Louis</creator><creator>Hsu, Stephen D. H.</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20200721</creationdate><title>Genetic architecture of complex traits and disease risk predictors</title><author>Yong, Soke Yuen ; Raben, Timothy G. ; Lello, Louis ; Hsu, Stephen D. H.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>631/114</topic><topic>631/208/212</topic><topic>692/308</topic><topic>Adults</topic><topic>Algorithms</topic><topic>Biobanks</topic><topic>Blood pressure</topic><topic>Body height</topic><topic>Bone density</topic><topic>Breast cancer</topic><topic>Cardiac arrhythmia</topic><topic>Cardiovascular disease</topic><topic>Cardiovascular diseases</topic><topic>Cluster Analysis</topic><topic>Cognitive ability</topic><topic>Coronary artery disease</topic><topic>Coronary vessels</topic><topic>Datasets</topic><topic>Diabetes</topic><topic>Diabetes mellitus</topic><topic>Fibrillation</topic><topic>Genetic Association Studies</topic><topic>Genetic diversity</topic><topic>Genetic Predisposition to Disease</topic><topic>Genetic variance</topic><topic>Genomes</topic><topic>Health risk assessment</topic><topic>Health risks</topic><topic>Heart diseases</topic><topic>Heritability</topic><topic>Humanities and Social Sciences</topic><topic>Humans</topic><topic>Mammography</topic><topic>Models, Genetic</topic><topic>multidisciplinary</topic><topic>Multifactorial Inheritance</topic><topic>Phenotypes</topic><topic>Polymorphism, Single Nucleotide</topic><topic>Quantitative Trait, Heritable</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Single-nucleotide polymorphism</topic><topic>Whole Exome Sequencing</topic><topic>Womens health</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yong, Soke Yuen</creatorcontrib><creatorcontrib>Raben, Timothy G.</creatorcontrib><creatorcontrib>Lello, Louis</creatorcontrib><creatorcontrib>Hsu, Stephen D. H.</creatorcontrib><collection>Springer Nature OA/Free Journals</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yong, Soke Yuen</au><au>Raben, Timothy G.</au><au>Lello, Louis</au><au>Hsu, Stephen D. H.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Genetic architecture of complex traits and disease risk predictors</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2020-07-21</date><risdate>2020</risdate><volume>10</volume><issue>1</issue><spage>12055</spage><epage>12055</epage><pages>12055-12055</pages><artnum>12055</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied, a large amount of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>32694572</pmid><doi>10.1038/s41598-020-68881-8</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 2045-2322
ispartof Scientific reports, 2020-07, Vol.10 (1), p.12055-12055, Article 12055
issn 2045-2322
2045-2322
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7374622
source MEDLINE; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Nature Free; PubMed Central; Springer Nature OA/Free Journals; Free Full-Text Journals in Chemistry
subjects 631/114
631/208/212
692/308
Adults
Algorithms
Biobanks
Blood pressure
Body height
Bone density
Breast cancer
Cardiac arrhythmia
Cardiovascular disease
Cardiovascular diseases
Cluster Analysis
Cognitive ability
Coronary artery disease
Coronary vessels
Datasets
Diabetes
Diabetes mellitus
Fibrillation
Genetic Association Studies
Genetic diversity
Genetic Predisposition to Disease
Genetic variance
Genomes
Health risk assessment
Health risks
Heart diseases
Heritability
Humanities and Social Sciences
Humans
Mammography
Models, Genetic
multidisciplinary
Multifactorial Inheritance
Phenotypes
Polymorphism, Single Nucleotide
Quantitative Trait, Heritable
Science
Science (multidisciplinary)
Single-nucleotide polymorphism
Whole Exome Sequencing
Womens health
title Genetic architecture of complex traits and disease risk predictors
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T04%3A09%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Genetic%20architecture%20of%20complex%20traits%20and%20disease%20risk%20predictors&rft.jtitle=Scientific%20reports&rft.au=Yong,%20Soke%20Yuen&rft.date=2020-07-21&rft.volume=10&rft.issue=1&rft.spage=12055&rft.epage=12055&rft.pages=12055-12055&rft.artnum=12055&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-020-68881-8&rft_dat=%3Cproquest_pubme%3E2426177589%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2425719385&rft_id=info:pmid/32694572&rfr_iscdi=true