Statistical estimation of correlated genome associations to a quantitative trait network
Many complex disease syndromes, such as asthma, consist of a large number of highly related, rather than independent, clinical or molecular phenotypes. This raises a new technical challenge in identifying genetic variations associated simultaneously with correlated traits. In this study, we propose...
Gespeichert in:
Veröffentlicht in: | PLoS genetics 2009-08, Vol.5 (8), p.e1000587-e1000587 |
---|---|
Hauptverfasser: | , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | e1000587 |
---|---|
container_issue | 8 |
container_start_page | e1000587 |
container_title | PLoS genetics |
container_volume | 5 |
creator | Kim, Seyoung Xing, Eric P |
description | Many complex disease syndromes, such as asthma, consist of a large number of highly related, rather than independent, clinical or molecular phenotypes. This raises a new technical challenge in identifying genetic variations associated simultaneously with correlated traits. In this study, we propose a new statistical framework called graph-guided fused lasso (GFlasso) to directly and effectively incorporate the correlation structure of multiple quantitative traits such as clinical metrics and gene expressions in association analysis. Our approach represents correlation information explicitly among the quantitative traits as a quantitative trait network (QTN) and then leverages this network to encode structured regularization functions in a multivariate regression model over the genotypes and traits. The result is that the genetic markers that jointly influence subgroups of highly correlated traits can be detected jointly with high sensitivity and specificity. While most of the traditional methods examined each phenotype independently and combined the results afterwards, our approach analyzes all of the traits jointly in a single statistical framework. This allows our method to borrow information across correlated phenotypes to discover the genetic markers that perturb a subset of the correlated traits synergistically. Using simulated datasets based on the HapMap consortium and an asthma dataset, we compared the performance of our method with other methods based on single-marker analysis and regression-based methods that do not use any of the relational information in the traits. We found that our method showed an increased power in detecting causal variants affecting correlated traits. Our results showed that, when correlation patterns among traits in a QTN are considered explicitly and directly during a structured multivariate genome association analysis using our proposed methods, the power of detecting true causal SNPs with possibly pleiotropic effects increased significantly without compromising performance on non-pleiotropic SNPs. |
doi_str_mv | 10.1371/journal.pgen.1000587 |
format | Article |
fullrecord | <record><control><sourceid>gale_plos_</sourceid><recordid>TN_cdi_plos_journals_1313495030</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A207644443</galeid><doaj_id>oai_doaj_org_article_1c472cc7a25c4097891cc48d39c081ac</doaj_id><sourcerecordid>A207644443</sourcerecordid><originalsourceid>FETCH-LOGICAL-c765t-b36ebe7ae6c0f7cb676fe81adf8a425d3606a1576884dc619bb00d9e6451b8e63</originalsourceid><addsrcrecordid>eNqVk02P0zAQhiMEYpeFf4Agp5U4tNjxZy5IqxUflVasxALiZjnOpHVx467tLPDvcdsAzQlIDnacZ97xvPYUxVOM5pgI_HLth9BrN98uoZ9jhBCT4l5xihkjM0ERvX80PykexbhGiDBZi4fFCa65RIzI0-LLTdLJxmSNdiXkcZM_fV_6rjQ-BHA6QVvmFH4DpY7RG7sHYpl8qcvbQffJ7iTuoExB21T2kL758PVx8aDTLsKTcTwrPr15_fHy3ezq-u3i8uJqZgRnadYQDg0IDdygTpiGC96BxLrtpKYVawlHXGMmuJS0NRzXTYNQWwOnDDcSODkrnh90t85HNZoSFSaY0JohgjKxOBCt12u1DbnE8EN5bdV-wYel0iEb4EBhQ0VljNAVMxTVQtbYGCpbUhuUN2Wy1qsx29BsoDXQ56LdRHT6p7crtfR3qhK4RnK33fNRIPjbIRuuNjYacE734IeouGCykkz8FawwEjQ7lMH5AVzqXIHtO58Tm_y2sLHG99DZvH5RIcFpfkgOeDEJyEyC72mphxjV4ubDf7Dv_529_jxlz4_YFWiXVtG7YX-1piA9gCb4GAN0v73GSO364NeRq10fqLEPctiz43P6EzRefPITRocEvw</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>21074425</pqid></control><display><type>article</type><title>Statistical estimation of correlated genome associations to a quantitative trait network</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals</source><source>Public Library of Science (PLoS)</source><source>PubMed Central</source><creator>Kim, Seyoung ; Xing, Eric P</creator><contributor>Storey, John D.</contributor><creatorcontrib>Kim, Seyoung ; Xing, Eric P ; Storey, John D.</creatorcontrib><description>Many complex disease syndromes, such as asthma, consist of a large number of highly related, rather than independent, clinical or molecular phenotypes. This raises a new technical challenge in identifying genetic variations associated simultaneously with correlated traits. In this study, we propose a new statistical framework called graph-guided fused lasso (GFlasso) to directly and effectively incorporate the correlation structure of multiple quantitative traits such as clinical metrics and gene expressions in association analysis. Our approach represents correlation information explicitly among the quantitative traits as a quantitative trait network (QTN) and then leverages this network to encode structured regularization functions in a multivariate regression model over the genotypes and traits. The result is that the genetic markers that jointly influence subgroups of highly correlated traits can be detected jointly with high sensitivity and specificity. While most of the traditional methods examined each phenotype independently and combined the results afterwards, our approach analyzes all of the traits jointly in a single statistical framework. This allows our method to borrow information across correlated phenotypes to discover the genetic markers that perturb a subset of the correlated traits synergistically. Using simulated datasets based on the HapMap consortium and an asthma dataset, we compared the performance of our method with other methods based on single-marker analysis and regression-based methods that do not use any of the relational information in the traits. We found that our method showed an increased power in detecting causal variants affecting correlated traits. Our results showed that, when correlation patterns among traits in a QTN are considered explicitly and directly during a structured multivariate genome association analysis using our proposed methods, the power of detecting true causal SNPs with possibly pleiotropic effects increased significantly without compromising performance on non-pleiotropic SNPs.</description><identifier>ISSN: 1553-7404</identifier><identifier>ISSN: 1553-7390</identifier><identifier>EISSN: 1553-7404</identifier><identifier>DOI: 10.1371/journal.pgen.1000587</identifier><identifier>PMID: 19680538</identifier><language>eng</language><publisher>United States: Public Library of Science</publisher><subject>Asthma ; Asthma - genetics ; Computational Biology/Genomics ; Computational Biology/Population Genetics ; Computer Simulation ; Data Interpretation, Statistical ; Data mining ; Disease ; Gene expression ; Genes ; Genetic variation ; Genetics ; Genetics and Genomics/Bioinformatics ; Genetics and Genomics/Population Genetics ; Genome ; Genome, Human ; Genomes ; Humans ; Hypothesis testing ; Methods ; Models, Genetic ; Models, Statistical ; Quantitative trait loci ; Quantitative Trait, Heritable ; Studies</subject><ispartof>PLoS genetics, 2009-08, Vol.5 (8), p.e1000587-e1000587</ispartof><rights>COPYRIGHT 2009 Public Library of Science</rights><rights>Kim, Xing. 2009</rights><rights>2009 Kim, Xing. This is an open-access article distributed under the terms of the Creative Commons Attribution License, which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited: Kim S, Xing EP (2009) Statistical Estimation of Correlated Genome Associations to a Quantitative Trait Network. PLoS Genet 5(8): e1000587. doi:10.1371/journal.pgen.1000587</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c765t-b36ebe7ae6c0f7cb676fe81adf8a425d3606a1576884dc619bb00d9e6451b8e63</citedby><cites>FETCH-LOGICAL-c765t-b36ebe7ae6c0f7cb676fe81adf8a425d3606a1576884dc619bb00d9e6451b8e63</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC2719086/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC2719086/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,723,776,780,860,881,2095,2914,23846,27903,27904,53769,53771,79346,79347</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/19680538$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Storey, John D.</contributor><creatorcontrib>Kim, Seyoung</creatorcontrib><creatorcontrib>Xing, Eric P</creatorcontrib><title>Statistical estimation of correlated genome associations to a quantitative trait network</title><title>PLoS genetics</title><addtitle>PLoS Genet</addtitle><description>Many complex disease syndromes, such as asthma, consist of a large number of highly related, rather than independent, clinical or molecular phenotypes. This raises a new technical challenge in identifying genetic variations associated simultaneously with correlated traits. In this study, we propose a new statistical framework called graph-guided fused lasso (GFlasso) to directly and effectively incorporate the correlation structure of multiple quantitative traits such as clinical metrics and gene expressions in association analysis. Our approach represents correlation information explicitly among the quantitative traits as a quantitative trait network (QTN) and then leverages this network to encode structured regularization functions in a multivariate regression model over the genotypes and traits. The result is that the genetic markers that jointly influence subgroups of highly correlated traits can be detected jointly with high sensitivity and specificity. While most of the traditional methods examined each phenotype independently and combined the results afterwards, our approach analyzes all of the traits jointly in a single statistical framework. This allows our method to borrow information across correlated phenotypes to discover the genetic markers that perturb a subset of the correlated traits synergistically. Using simulated datasets based on the HapMap consortium and an asthma dataset, we compared the performance of our method with other methods based on single-marker analysis and regression-based methods that do not use any of the relational information in the traits. We found that our method showed an increased power in detecting causal variants affecting correlated traits. Our results showed that, when correlation patterns among traits in a QTN are considered explicitly and directly during a structured multivariate genome association analysis using our proposed methods, the power of detecting true causal SNPs with possibly pleiotropic effects increased significantly without compromising performance on non-pleiotropic SNPs.</description><subject>Asthma</subject><subject>Asthma - genetics</subject><subject>Computational Biology/Genomics</subject><subject>Computational Biology/Population Genetics</subject><subject>Computer Simulation</subject><subject>Data Interpretation, Statistical</subject><subject>Data mining</subject><subject>Disease</subject><subject>Gene expression</subject><subject>Genes</subject><subject>Genetic variation</subject><subject>Genetics</subject><subject>Genetics and Genomics/Bioinformatics</subject><subject>Genetics and Genomics/Population Genetics</subject><subject>Genome</subject><subject>Genome, Human</subject><subject>Genomes</subject><subject>Humans</subject><subject>Hypothesis testing</subject><subject>Methods</subject><subject>Models, Genetic</subject><subject>Models, Statistical</subject><subject>Quantitative trait loci</subject><subject>Quantitative Trait, Heritable</subject><subject>Studies</subject><issn>1553-7404</issn><issn>1553-7390</issn><issn>1553-7404</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2009</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>DOA</sourceid><recordid>eNqVk02P0zAQhiMEYpeFf4Agp5U4tNjxZy5IqxUflVasxALiZjnOpHVx467tLPDvcdsAzQlIDnacZ97xvPYUxVOM5pgI_HLth9BrN98uoZ9jhBCT4l5xihkjM0ERvX80PykexbhGiDBZi4fFCa65RIzI0-LLTdLJxmSNdiXkcZM_fV_6rjQ-BHA6QVvmFH4DpY7RG7sHYpl8qcvbQffJ7iTuoExB21T2kL758PVx8aDTLsKTcTwrPr15_fHy3ezq-u3i8uJqZgRnadYQDg0IDdygTpiGC96BxLrtpKYVawlHXGMmuJS0NRzXTYNQWwOnDDcSODkrnh90t85HNZoSFSaY0JohgjKxOBCt12u1DbnE8EN5bdV-wYel0iEb4EBhQ0VljNAVMxTVQtbYGCpbUhuUN2Wy1qsx29BsoDXQ56LdRHT6p7crtfR3qhK4RnK33fNRIPjbIRuuNjYacE734IeouGCykkz8FawwEjQ7lMH5AVzqXIHtO58Tm_y2sLHG99DZvH5RIcFpfkgOeDEJyEyC72mphxjV4ubDf7Dv_529_jxlz4_YFWiXVtG7YX-1piA9gCb4GAN0v73GSO364NeRq10fqLEPctiz43P6EzRefPITRocEvw</recordid><startdate>20090801</startdate><enddate>20090801</enddate><creator>Kim, Seyoung</creator><creator>Xing, Eric P</creator><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>IOV</scope><scope>ISN</scope><scope>ISR</scope><scope>7TM</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope></search><sort><creationdate>20090801</creationdate><title>Statistical estimation of correlated genome associations to a quantitative trait network</title><author>Kim, Seyoung ; Xing, Eric P</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c765t-b36ebe7ae6c0f7cb676fe81adf8a425d3606a1576884dc619bb00d9e6451b8e63</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2009</creationdate><topic>Asthma</topic><topic>Asthma - genetics</topic><topic>Computational Biology/Genomics</topic><topic>Computational Biology/Population Genetics</topic><topic>Computer Simulation</topic><topic>Data Interpretation, Statistical</topic><topic>Data mining</topic><topic>Disease</topic><topic>Gene expression</topic><topic>Genes</topic><topic>Genetic variation</topic><topic>Genetics</topic><topic>Genetics and Genomics/Bioinformatics</topic><topic>Genetics and Genomics/Population Genetics</topic><topic>Genome</topic><topic>Genome, Human</topic><topic>Genomes</topic><topic>Humans</topic><topic>Hypothesis testing</topic><topic>Methods</topic><topic>Models, Genetic</topic><topic>Models, Statistical</topic><topic>Quantitative trait loci</topic><topic>Quantitative Trait, Heritable</topic><topic>Studies</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Kim, Seyoung</creatorcontrib><creatorcontrib>Xing, Eric P</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Opposing Viewpoints</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>Nucleic Acids Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS genetics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Kim, Seyoung</au><au>Xing, Eric P</au><au>Storey, John D.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Statistical estimation of correlated genome associations to a quantitative trait network</atitle><jtitle>PLoS genetics</jtitle><addtitle>PLoS Genet</addtitle><date>2009-08-01</date><risdate>2009</risdate><volume>5</volume><issue>8</issue><spage>e1000587</spage><epage>e1000587</epage><pages>e1000587-e1000587</pages><issn>1553-7404</issn><issn>1553-7390</issn><eissn>1553-7404</eissn><abstract>Many complex disease syndromes, such as asthma, consist of a large number of highly related, rather than independent, clinical or molecular phenotypes. This raises a new technical challenge in identifying genetic variations associated simultaneously with correlated traits. In this study, we propose a new statistical framework called graph-guided fused lasso (GFlasso) to directly and effectively incorporate the correlation structure of multiple quantitative traits such as clinical metrics and gene expressions in association analysis. Our approach represents correlation information explicitly among the quantitative traits as a quantitative trait network (QTN) and then leverages this network to encode structured regularization functions in a multivariate regression model over the genotypes and traits. The result is that the genetic markers that jointly influence subgroups of highly correlated traits can be detected jointly with high sensitivity and specificity. While most of the traditional methods examined each phenotype independently and combined the results afterwards, our approach analyzes all of the traits jointly in a single statistical framework. This allows our method to borrow information across correlated phenotypes to discover the genetic markers that perturb a subset of the correlated traits synergistically. Using simulated datasets based on the HapMap consortium and an asthma dataset, we compared the performance of our method with other methods based on single-marker analysis and regression-based methods that do not use any of the relational information in the traits. We found that our method showed an increased power in detecting causal variants affecting correlated traits. Our results showed that, when correlation patterns among traits in a QTN are considered explicitly and directly during a structured multivariate genome association analysis using our proposed methods, the power of detecting true causal SNPs with possibly pleiotropic effects increased significantly without compromising performance on non-pleiotropic SNPs.</abstract><cop>United States</cop><pub>Public Library of Science</pub><pmid>19680538</pmid><doi>10.1371/journal.pgen.1000587</doi><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 1553-7404 |
ispartof | PLoS genetics, 2009-08, Vol.5 (8), p.e1000587-e1000587 |
issn | 1553-7404 1553-7390 1553-7404 |
language | eng |
recordid | cdi_plos_journals_1313495030 |
source | MEDLINE; DOAJ Directory of Open Access Journals; Elektronische Zeitschriftenbibliothek - Frei zugängliche E-Journals; Public Library of Science (PLoS); PubMed Central |
subjects | Asthma Asthma - genetics Computational Biology/Genomics Computational Biology/Population Genetics Computer Simulation Data Interpretation, Statistical Data mining Disease Gene expression Genes Genetic variation Genetics Genetics and Genomics/Bioinformatics Genetics and Genomics/Population Genetics Genome Genome, Human Genomes Humans Hypothesis testing Methods Models, Genetic Models, Statistical Quantitative trait loci Quantitative Trait, Heritable Studies |
title | Statistical estimation of correlated genome associations to a quantitative trait network |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-26T06%3A21%3A04IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_plos_&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Statistical%20estimation%20of%20correlated%20genome%20associations%20to%20a%20quantitative%20trait%20network&rft.jtitle=PLoS%20genetics&rft.au=Kim,%20Seyoung&rft.date=2009-08-01&rft.volume=5&rft.issue=8&rft.spage=e1000587&rft.epage=e1000587&rft.pages=e1000587-e1000587&rft.issn=1553-7404&rft.eissn=1553-7404&rft_id=info:doi/10.1371/journal.pgen.1000587&rft_dat=%3Cgale_plos_%3EA207644443%3C/gale_plos_%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=21074425&rft_id=info:pmid/19680538&rft_galeid=A207644443&rft_doaj_id=oai_doaj_org_article_1c472cc7a25c4097891cc48d39c081ac&rfr_iscdi=true |