Genetic architecture of complex traits and disease risk predictors
Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized...
Gespeichert in:
Veröffentlicht in: | Scientific reports 2020-07, Vol.10 (1), p.12055-12055, Article 12055 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 12055 |
---|---|
container_issue | 1 |
container_start_page | 12055 |
container_title | Scientific reports |
container_volume | 10 |
creator | Yong, Soke Yuen Raben, Timothy G. Lello, Louis Hsu, Stephen D. H. |
description | Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied,
a large amount
of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously. |
doi_str_mv | 10.1038/s41598-020-68881-8 |
format | Article |
fullrecord | <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7374622</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2426177589</sourcerecordid><originalsourceid>FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</originalsourceid><addsrcrecordid>eNp9kUtP3TAQha2KqiDKH2BRReqGTVp7_N4gFVRoJaRuytpynAkYcuOLnaD239dwKa8F3tjyfHNmjg4h-4x-YZSbr0UwaU1LgbbKGMNa847sABWyBQ6w9ey9TfZKuaL1SLCC2Q9km4OyQmrYIUenOOEcQ-NzuIwzhnnJ2KShCWm1HvFPM2cf59L4qW_6WNAXbHIs1806Yx_DnHL5SN4Pfiy493DvkvOT77-Pf7Rnv05_Hn87a4MEmNugWAda0tB3qKjvtWBUe821Goy3PnglO2PR1K9qyqgBaK-GTnDWWcGHnu-Sw43ueulW2Aec6m6jW-e48vmvSz66l5UpXrqLdOvqDKEAqsDBg0BONwuW2a1iCTiOfsK0FAcCFNNaGlvRz6_Qq7Tkqdq7o6RmlhtZKdhQIadSMg6PyzDq7lJym5RcNeTuU3KmNn16buOx5X8mFeAboNTSdIH5afYbsv8ADuadhA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2425719385</pqid></control><display><type>article</type><title>Genetic architecture of complex traits and disease risk predictors</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Nature Free</source><source>PubMed Central</source><source>Springer Nature OA/Free Journals</source><source>Free Full-Text Journals in Chemistry</source><creator>Yong, Soke Yuen ; Raben, Timothy G. ; Lello, Louis ; Hsu, Stephen D. H.</creator><creatorcontrib>Yong, Soke Yuen ; Raben, Timothy G. ; Lello, Louis ; Hsu, Stephen D. H.</creatorcontrib><description>Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied,
a large amount
of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.</description><identifier>ISSN: 2045-2322</identifier><identifier>EISSN: 2045-2322</identifier><identifier>DOI: 10.1038/s41598-020-68881-8</identifier><identifier>PMID: 32694572</identifier><language>eng</language><publisher>London: Nature Publishing Group UK</publisher><subject>631/114 ; 631/208/212 ; 692/308 ; Adults ; Algorithms ; Biobanks ; Blood pressure ; Body height ; Bone density ; Breast cancer ; Cardiac arrhythmia ; Cardiovascular disease ; Cardiovascular diseases ; Cluster Analysis ; Cognitive ability ; Coronary artery disease ; Coronary vessels ; Datasets ; Diabetes ; Diabetes mellitus ; Fibrillation ; Genetic Association Studies ; Genetic diversity ; Genetic Predisposition to Disease ; Genetic variance ; Genomes ; Health risk assessment ; Health risks ; Heart diseases ; Heritability ; Humanities and Social Sciences ; Humans ; Mammography ; Models, Genetic ; multidisciplinary ; Multifactorial Inheritance ; Phenotypes ; Polymorphism, Single Nucleotide ; Quantitative Trait, Heritable ; Science ; Science (multidisciplinary) ; Single-nucleotide polymorphism ; Whole Exome Sequencing ; Womens health</subject><ispartof>Scientific reports, 2020-07, Vol.10 (1), p.12055-12055, Article 12055</ispartof><rights>The Author(s) 2020</rights><rights>The Author(s) 2020. This work is published under http://creativecommons.org/licenses/by/4.0/ (the “License”). Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</citedby><cites>FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374622/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC7374622/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,727,780,784,864,885,27924,27925,41120,42189,51576,53791,53793</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/32694572$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Yong, Soke Yuen</creatorcontrib><creatorcontrib>Raben, Timothy G.</creatorcontrib><creatorcontrib>Lello, Louis</creatorcontrib><creatorcontrib>Hsu, Stephen D. H.</creatorcontrib><title>Genetic architecture of complex traits and disease risk predictors</title><title>Scientific reports</title><addtitle>Sci Rep</addtitle><addtitle>Sci Rep</addtitle><description>Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied,
a large amount
of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.</description><subject>631/114</subject><subject>631/208/212</subject><subject>692/308</subject><subject>Adults</subject><subject>Algorithms</subject><subject>Biobanks</subject><subject>Blood pressure</subject><subject>Body height</subject><subject>Bone density</subject><subject>Breast cancer</subject><subject>Cardiac arrhythmia</subject><subject>Cardiovascular disease</subject><subject>Cardiovascular diseases</subject><subject>Cluster Analysis</subject><subject>Cognitive ability</subject><subject>Coronary artery disease</subject><subject>Coronary vessels</subject><subject>Datasets</subject><subject>Diabetes</subject><subject>Diabetes mellitus</subject><subject>Fibrillation</subject><subject>Genetic Association Studies</subject><subject>Genetic diversity</subject><subject>Genetic Predisposition to Disease</subject><subject>Genetic variance</subject><subject>Genomes</subject><subject>Health risk assessment</subject><subject>Health risks</subject><subject>Heart diseases</subject><subject>Heritability</subject><subject>Humanities and Social Sciences</subject><subject>Humans</subject><subject>Mammography</subject><subject>Models, Genetic</subject><subject>multidisciplinary</subject><subject>Multifactorial Inheritance</subject><subject>Phenotypes</subject><subject>Polymorphism, Single Nucleotide</subject><subject>Quantitative Trait, Heritable</subject><subject>Science</subject><subject>Science (multidisciplinary)</subject><subject>Single-nucleotide polymorphism</subject><subject>Whole Exome Sequencing</subject><subject>Womens health</subject><issn>2045-2322</issn><issn>2045-2322</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2020</creationdate><recordtype>article</recordtype><sourceid>C6C</sourceid><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNp9kUtP3TAQha2KqiDKH2BRReqGTVp7_N4gFVRoJaRuytpynAkYcuOLnaD239dwKa8F3tjyfHNmjg4h-4x-YZSbr0UwaU1LgbbKGMNa847sABWyBQ6w9ey9TfZKuaL1SLCC2Q9km4OyQmrYIUenOOEcQ-NzuIwzhnnJ2KShCWm1HvFPM2cf59L4qW_6WNAXbHIs1806Yx_DnHL5SN4Pfiy493DvkvOT77-Pf7Rnv05_Hn87a4MEmNugWAda0tB3qKjvtWBUe821Goy3PnglO2PR1K9qyqgBaK-GTnDWWcGHnu-Sw43ueulW2Aec6m6jW-e48vmvSz66l5UpXrqLdOvqDKEAqsDBg0BONwuW2a1iCTiOfsK0FAcCFNNaGlvRz6_Qq7Tkqdq7o6RmlhtZKdhQIadSMg6PyzDq7lJym5RcNeTuU3KmNn16buOx5X8mFeAboNTSdIH5afYbsv8ADuadhA</recordid><startdate>20200721</startdate><enddate>20200721</enddate><creator>Yong, Soke Yuen</creator><creator>Raben, Timothy G.</creator><creator>Lello, Louis</creator><creator>Hsu, Stephen D. H.</creator><general>Nature Publishing Group UK</general><general>Nature Publishing Group</general><scope>C6C</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7X7</scope><scope>7XB</scope><scope>88A</scope><scope>88E</scope><scope>88I</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AEUYN</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M2P</scope><scope>M7P</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>Q9U</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20200721</creationdate><title>Genetic architecture of complex traits and disease risk predictors</title><author>Yong, Soke Yuen ; Raben, Timothy G. ; Lello, Louis ; Hsu, Stephen D. H.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c522t-c61b2750cdbe60ad74107a7376f8a9aca65b89e8a7302086f20d6fb431b943fd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2020</creationdate><topic>631/114</topic><topic>631/208/212</topic><topic>692/308</topic><topic>Adults</topic><topic>Algorithms</topic><topic>Biobanks</topic><topic>Blood pressure</topic><topic>Body height</topic><topic>Bone density</topic><topic>Breast cancer</topic><topic>Cardiac arrhythmia</topic><topic>Cardiovascular disease</topic><topic>Cardiovascular diseases</topic><topic>Cluster Analysis</topic><topic>Cognitive ability</topic><topic>Coronary artery disease</topic><topic>Coronary vessels</topic><topic>Datasets</topic><topic>Diabetes</topic><topic>Diabetes mellitus</topic><topic>Fibrillation</topic><topic>Genetic Association Studies</topic><topic>Genetic diversity</topic><topic>Genetic Predisposition to Disease</topic><topic>Genetic variance</topic><topic>Genomes</topic><topic>Health risk assessment</topic><topic>Health risks</topic><topic>Heart diseases</topic><topic>Heritability</topic><topic>Humanities and Social Sciences</topic><topic>Humans</topic><topic>Mammography</topic><topic>Models, Genetic</topic><topic>multidisciplinary</topic><topic>Multifactorial Inheritance</topic><topic>Phenotypes</topic><topic>Polymorphism, Single Nucleotide</topic><topic>Quantitative Trait, Heritable</topic><topic>Science</topic><topic>Science (multidisciplinary)</topic><topic>Single-nucleotide polymorphism</topic><topic>Whole Exome Sequencing</topic><topic>Womens health</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Yong, Soke Yuen</creatorcontrib><creatorcontrib>Raben, Timothy G.</creatorcontrib><creatorcontrib>Lello, Louis</creatorcontrib><creatorcontrib>Hsu, Stephen D. H.</creatorcontrib><collection>Springer Nature OA/Free Journals</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Biology Database (Alumni Edition)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Science Database (Alumni Edition)</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest One Sustainability</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Science Database</collection><collection>Biological Science Database</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central Basic</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Scientific reports</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Yong, Soke Yuen</au><au>Raben, Timothy G.</au><au>Lello, Louis</au><au>Hsu, Stephen D. H.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Genetic architecture of complex traits and disease risk predictors</atitle><jtitle>Scientific reports</jtitle><stitle>Sci Rep</stitle><addtitle>Sci Rep</addtitle><date>2020-07-21</date><risdate>2020</risdate><volume>10</volume><issue>1</issue><spage>12055</spage><epage>12055</epage><pages>12055-12055</pages><artnum>12055</artnum><issn>2045-2322</issn><eissn>2045-2322</eissn><abstract>Genomic prediction of complex human traits (e.g., height, cognitive ability, bone density) and disease risks (e.g., breast cancer, diabetes, heart disease, atrial fibrillation) has advanced considerably in recent years. Using data from the UK Biobank, predictors have been constructed using penalized algorithms that favor sparsity: i.e., which use as few genetic variants as possible. We analyze the specific genetic variants (SNPs) utilized in these predictors, which can vary from dozens to as many as thirty thousand. We find that the fraction of SNPs in or near genic regions varies widely by phenotype. For the majority of disease conditions studied,
a large amount
of the variance is accounted for by SNPs outside of coding regions. The state of these SNPs cannot be determined from exome-sequencing data. This suggests that exome data alone will miss much of the heritability for these traits—i.e., existing PRS cannot be computed from exome data alone. We also study the fraction of SNPs and of variance that is in common between pairs of predictors. The DNA regions used in disease risk predictors so far constructed seem to be largely disjoint (with a few interesting exceptions), suggesting that individual genetic disease risks are largely uncorrelated. It seems possible in theory for an individual to be a low-risk outlier in all conditions simultaneously.</abstract><cop>London</cop><pub>Nature Publishing Group UK</pub><pmid>32694572</pmid><doi>10.1038/s41598-020-68881-8</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 2045-2322 |
ispartof | Scientific reports, 2020-07, Vol.10 (1), p.12055-12055, Article 12055 |
issn | 2045-2322 2045-2322 |
language | eng |
recordid | cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_7374622 |
source | MEDLINE; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Nature Free; PubMed Central; Springer Nature OA/Free Journals; Free Full-Text Journals in Chemistry |
subjects | 631/114 631/208/212 692/308 Adults Algorithms Biobanks Blood pressure Body height Bone density Breast cancer Cardiac arrhythmia Cardiovascular disease Cardiovascular diseases Cluster Analysis Cognitive ability Coronary artery disease Coronary vessels Datasets Diabetes Diabetes mellitus Fibrillation Genetic Association Studies Genetic diversity Genetic Predisposition to Disease Genetic variance Genomes Health risk assessment Health risks Heart diseases Heritability Humanities and Social Sciences Humans Mammography Models, Genetic multidisciplinary Multifactorial Inheritance Phenotypes Polymorphism, Single Nucleotide Quantitative Trait, Heritable Science Science (multidisciplinary) Single-nucleotide polymorphism Whole Exome Sequencing Womens health |
title | Genetic architecture of complex traits and disease risk predictors |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-08T04%3A09%3A55IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Genetic%20architecture%20of%20complex%20traits%20and%20disease%20risk%20predictors&rft.jtitle=Scientific%20reports&rft.au=Yong,%20Soke%20Yuen&rft.date=2020-07-21&rft.volume=10&rft.issue=1&rft.spage=12055&rft.epage=12055&rft.pages=12055-12055&rft.artnum=12055&rft.issn=2045-2322&rft.eissn=2045-2322&rft_id=info:doi/10.1038/s41598-020-68881-8&rft_dat=%3Cproquest_pubme%3E2426177589%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2425719385&rft_id=info:pmid/32694572&rfr_iscdi=true |