Intrinsically Multivariate Predictive Genes

Canalizing genes possess such broad regulatory power, and their action sweeps across a such a wide swath of processes that the full set of affected genes are not highly correlated under normal conditions. When not active, the controlling gene will not be predictable to any significant degree by its...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE journal of selected topics in signal processing 2008-06, Vol.2 (3), p.424-439
Hauptverfasser: Martins, D.C., Braga-Neto, U.M., Hashimoto, R.F., Bittner, M.L., Dougherty, E.R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 439
container_issue 3
container_start_page 424
container_title IEEE journal of selected topics in signal processing
container_volume 2
creator Martins, D.C.
Braga-Neto, U.M.
Hashimoto, R.F.
Bittner, M.L.
Dougherty, E.R.
description Canalizing genes possess such broad regulatory power, and their action sweeps across a such a wide swath of processes that the full set of affected genes are not highly correlated under normal conditions. When not active, the controlling gene will not be predictable to any significant degree by its subject genes, either alone or in groups, since their behavior will be highly varied relative to the inactive controlling gene. When the controlling gene is active, its behavior is not well predicted by any one of its targets, but can be very well predicted by groups of genes under its control. To investigate this question, we introduce in this paper the concept of intrinsically multivariate predictive (IMP) genes, and present a mathematical study of IMP in the context of binary genes with respect to the coefficient of determination (CoD), which measures the predictive power of a set of genes with respect to a target gene. A set of predictor genes is said to be IMP for a target gene if all properly contained subsets of the predictor set are bad predictors of the target but the full predictor set predicts the target with great accuracy. We show that logic of prediction, predictive power, covariance between predictors, and the entropy of the joint probability distribution of the predictors jointly affect the appearance of IMP genes. In particular, we show that high-predictive power, small covariance among predictors, a large entropy of the joint probability distribution of predictors, and certain logics, such as XOR in the 2-predictor case, are factors that favor the appearance of IMP. The IMP concept is applied to characterize the behavior of the gene DUSP1, which exhibits control over a central, process-integrating signaling pathway, thereby providing preliminary evidence that IMP can be used as a criterion for discovery of canalizing genes.
doi_str_mv 10.1109/JSTSP.2008.923841
format Article
fullrecord <record><control><sourceid>proquest_RIE</sourceid><recordid>TN_cdi_proquest_miscellaneous_34505905</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4550558</ieee_id><sourcerecordid>875091340</sourcerecordid><originalsourceid>FETCH-LOGICAL-c386t-1763015df00604700fe3ec776270db382ce64845ccc471d252839258a092a2163</originalsourceid><addsrcrecordid>eNqFkU1LAzEQhoMoWKs_QLwUD3qQrZPv5ChFa6ViofUcYjYLKdttTXaF_nt3XfHgQU8zA887zPAgdI5hjDHo26flarkYEwA11oQqhg_QAGuGM2CKHXY9JRnjnB6jk5TWAFwKzAboZlbVMVQpOFuW-9FzU9bhw8Zgaz9aRJ8H185-NPWVT6foqLBl8mffdYheH-5Xk8ds_jKdTe7mmaNK1BmWggLmeQEggEmAwlPvpBREQv5GFXFetEdx5xyTOCecKKoJVxY0sQQLOkTX_d5d3L43PtVmE5LzZWkrv22SUZKDxpRBS179SVLGgWvg_4IEhBBYdxsvf4HrbROr9l2jBFEtoroDcQ-5uE0p-sLsYtjYuDcYTKfDfOkwnQ7T62gzF30meO9_-NYIcK7oJ7I_gvw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>862819086</pqid></control><display><type>article</type><title>Intrinsically Multivariate Predictive Genes</title><source>IEEE Electronic Library (IEL)</source><creator>Martins, D.C. ; Braga-Neto, U.M. ; Hashimoto, R.F. ; Bittner, M.L. ; Dougherty, E.R.</creator><creatorcontrib>Martins, D.C. ; Braga-Neto, U.M. ; Hashimoto, R.F. ; Bittner, M.L. ; Dougherty, E.R.</creatorcontrib><description>Canalizing genes possess such broad regulatory power, and their action sweeps across a such a wide swath of processes that the full set of affected genes are not highly correlated under normal conditions. When not active, the controlling gene will not be predictable to any significant degree by its subject genes, either alone or in groups, since their behavior will be highly varied relative to the inactive controlling gene. When the controlling gene is active, its behavior is not well predicted by any one of its targets, but can be very well predicted by groups of genes under its control. To investigate this question, we introduce in this paper the concept of intrinsically multivariate predictive (IMP) genes, and present a mathematical study of IMP in the context of binary genes with respect to the coefficient of determination (CoD), which measures the predictive power of a set of genes with respect to a target gene. A set of predictor genes is said to be IMP for a target gene if all properly contained subsets of the predictor set are bad predictors of the target but the full predictor set predicts the target with great accuracy. We show that logic of prediction, predictive power, covariance between predictors, and the entropy of the joint probability distribution of the predictors jointly affect the appearance of IMP genes. In particular, we show that high-predictive power, small covariance among predictors, a large entropy of the joint probability distribution of predictors, and certain logics, such as XOR in the 2-predictor case, are factors that favor the appearance of IMP. The IMP concept is applied to characterize the behavior of the gene DUSP1, which exhibits control over a central, process-integrating signaling pathway, thereby providing preliminary evidence that IMP can be used as a criterion for discovery of canalizing genes.</description><identifier>ISSN: 1932-4553</identifier><identifier>EISSN: 1941-0484</identifier><identifier>DOI: 10.1109/JSTSP.2008.923841</identifier><identifier>CODEN: IJSTGY</identifier><language>eng</language><publisher>New York: IEEE</publisher><subject>Active control ; Bioinformatics ; Biological networks ; canalization ; Cancer ; Computer science ; Covariance ; Criteria ; Entropy ; Genes ; Genomics ; IMP ; intrinsically multivariate prediction ; Irrigation ; Logic ; Mathematical analysis ; melanoma ; microarray ; Power measurement ; Probability distribution ; Stress ; Studies ; transcriptional regulation</subject><ispartof>IEEE journal of selected topics in signal processing, 2008-06, Vol.2 (3), p.424-439</ispartof><rights>Copyright The Institute of Electrical and Electronics Engineers, Inc. (IEEE) 2008</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c386t-1763015df00604700fe3ec776270db382ce64845ccc471d252839258a092a2163</citedby><cites>FETCH-LOGICAL-c386t-1763015df00604700fe3ec776270db382ce64845ccc471d252839258a092a2163</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4550558$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>314,780,784,796,27923,27924,54757</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4550558$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Martins, D.C.</creatorcontrib><creatorcontrib>Braga-Neto, U.M.</creatorcontrib><creatorcontrib>Hashimoto, R.F.</creatorcontrib><creatorcontrib>Bittner, M.L.</creatorcontrib><creatorcontrib>Dougherty, E.R.</creatorcontrib><title>Intrinsically Multivariate Predictive Genes</title><title>IEEE journal of selected topics in signal processing</title><addtitle>JSTSP</addtitle><description>Canalizing genes possess such broad regulatory power, and their action sweeps across a such a wide swath of processes that the full set of affected genes are not highly correlated under normal conditions. When not active, the controlling gene will not be predictable to any significant degree by its subject genes, either alone or in groups, since their behavior will be highly varied relative to the inactive controlling gene. When the controlling gene is active, its behavior is not well predicted by any one of its targets, but can be very well predicted by groups of genes under its control. To investigate this question, we introduce in this paper the concept of intrinsically multivariate predictive (IMP) genes, and present a mathematical study of IMP in the context of binary genes with respect to the coefficient of determination (CoD), which measures the predictive power of a set of genes with respect to a target gene. A set of predictor genes is said to be IMP for a target gene if all properly contained subsets of the predictor set are bad predictors of the target but the full predictor set predicts the target with great accuracy. We show that logic of prediction, predictive power, covariance between predictors, and the entropy of the joint probability distribution of the predictors jointly affect the appearance of IMP genes. In particular, we show that high-predictive power, small covariance among predictors, a large entropy of the joint probability distribution of predictors, and certain logics, such as XOR in the 2-predictor case, are factors that favor the appearance of IMP. The IMP concept is applied to characterize the behavior of the gene DUSP1, which exhibits control over a central, process-integrating signaling pathway, thereby providing preliminary evidence that IMP can be used as a criterion for discovery of canalizing genes.</description><subject>Active control</subject><subject>Bioinformatics</subject><subject>Biological networks</subject><subject>canalization</subject><subject>Cancer</subject><subject>Computer science</subject><subject>Covariance</subject><subject>Criteria</subject><subject>Entropy</subject><subject>Genes</subject><subject>Genomics</subject><subject>IMP</subject><subject>intrinsically multivariate prediction</subject><subject>Irrigation</subject><subject>Logic</subject><subject>Mathematical analysis</subject><subject>melanoma</subject><subject>microarray</subject><subject>Power measurement</subject><subject>Probability distribution</subject><subject>Stress</subject><subject>Studies</subject><subject>transcriptional regulation</subject><issn>1932-4553</issn><issn>1941-0484</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2008</creationdate><recordtype>article</recordtype><sourceid>RIE</sourceid><recordid>eNqFkU1LAzEQhoMoWKs_QLwUD3qQrZPv5ChFa6ViofUcYjYLKdttTXaF_nt3XfHgQU8zA887zPAgdI5hjDHo26flarkYEwA11oQqhg_QAGuGM2CKHXY9JRnjnB6jk5TWAFwKzAboZlbVMVQpOFuW-9FzU9bhw8Zgaz9aRJ8H185-NPWVT6foqLBl8mffdYheH-5Xk8ds_jKdTe7mmaNK1BmWggLmeQEggEmAwlPvpBREQv5GFXFetEdx5xyTOCecKKoJVxY0sQQLOkTX_d5d3L43PtVmE5LzZWkrv22SUZKDxpRBS179SVLGgWvg_4IEhBBYdxsvf4HrbROr9l2jBFEtoroDcQ-5uE0p-sLsYtjYuDcYTKfDfOkwnQ7T62gzF30meO9_-NYIcK7oJ7I_gvw</recordid><startdate>20080601</startdate><enddate>20080601</enddate><creator>Martins, D.C.</creator><creator>Braga-Neto, U.M.</creator><creator>Hashimoto, R.F.</creator><creator>Bittner, M.L.</creator><creator>Dougherty, E.R.</creator><general>IEEE</general><general>The Institute of Electrical and Electronics Engineers, Inc. (IEEE)</general><scope>97E</scope><scope>RIA</scope><scope>RIE</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SP</scope><scope>8FD</scope><scope>H8D</scope><scope>L7M</scope><scope>FR3</scope><scope>P64</scope><scope>RC3</scope></search><sort><creationdate>20080601</creationdate><title>Intrinsically Multivariate Predictive Genes</title><author>Martins, D.C. ; Braga-Neto, U.M. ; Hashimoto, R.F. ; Bittner, M.L. ; Dougherty, E.R.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c386t-1763015df00604700fe3ec776270db382ce64845ccc471d252839258a092a2163</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2008</creationdate><topic>Active control</topic><topic>Bioinformatics</topic><topic>Biological networks</topic><topic>canalization</topic><topic>Cancer</topic><topic>Computer science</topic><topic>Covariance</topic><topic>Criteria</topic><topic>Entropy</topic><topic>Genes</topic><topic>Genomics</topic><topic>IMP</topic><topic>intrinsically multivariate prediction</topic><topic>Irrigation</topic><topic>Logic</topic><topic>Mathematical analysis</topic><topic>melanoma</topic><topic>microarray</topic><topic>Power measurement</topic><topic>Probability distribution</topic><topic>Stress</topic><topic>Studies</topic><topic>transcriptional regulation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Martins, D.C.</creatorcontrib><creatorcontrib>Braga-Neto, U.M.</creatorcontrib><creatorcontrib>Hashimoto, R.F.</creatorcontrib><creatorcontrib>Bittner, M.L.</creatorcontrib><creatorcontrib>Dougherty, E.R.</creatorcontrib><collection>IEEE All-Society Periodicals Package (ASPP) 2005-present</collection><collection>IEEE All-Society Periodicals Package (ASPP) 1998-Present</collection><collection>IEEE Electronic Library (IEL)</collection><collection>CrossRef</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>Aerospace Database</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Genetics Abstracts</collection><jtitle>IEEE journal of selected topics in signal processing</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Martins, D.C.</au><au>Braga-Neto, U.M.</au><au>Hashimoto, R.F.</au><au>Bittner, M.L.</au><au>Dougherty, E.R.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Intrinsically Multivariate Predictive Genes</atitle><jtitle>IEEE journal of selected topics in signal processing</jtitle><stitle>JSTSP</stitle><date>2008-06-01</date><risdate>2008</risdate><volume>2</volume><issue>3</issue><spage>424</spage><epage>439</epage><pages>424-439</pages><issn>1932-4553</issn><eissn>1941-0484</eissn><coden>IJSTGY</coden><abstract>Canalizing genes possess such broad regulatory power, and their action sweeps across a such a wide swath of processes that the full set of affected genes are not highly correlated under normal conditions. When not active, the controlling gene will not be predictable to any significant degree by its subject genes, either alone or in groups, since their behavior will be highly varied relative to the inactive controlling gene. When the controlling gene is active, its behavior is not well predicted by any one of its targets, but can be very well predicted by groups of genes under its control. To investigate this question, we introduce in this paper the concept of intrinsically multivariate predictive (IMP) genes, and present a mathematical study of IMP in the context of binary genes with respect to the coefficient of determination (CoD), which measures the predictive power of a set of genes with respect to a target gene. A set of predictor genes is said to be IMP for a target gene if all properly contained subsets of the predictor set are bad predictors of the target but the full predictor set predicts the target with great accuracy. We show that logic of prediction, predictive power, covariance between predictors, and the entropy of the joint probability distribution of the predictors jointly affect the appearance of IMP genes. In particular, we show that high-predictive power, small covariance among predictors, a large entropy of the joint probability distribution of predictors, and certain logics, such as XOR in the 2-predictor case, are factors that favor the appearance of IMP. The IMP concept is applied to characterize the behavior of the gene DUSP1, which exhibits control over a central, process-integrating signaling pathway, thereby providing preliminary evidence that IMP can be used as a criterion for discovery of canalizing genes.</abstract><cop>New York</cop><pub>IEEE</pub><doi>10.1109/JSTSP.2008.923841</doi><tpages>16</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISSN: 1932-4553
ispartof IEEE journal of selected topics in signal processing, 2008-06, Vol.2 (3), p.424-439
issn 1932-4553
1941-0484
language eng
recordid cdi_proquest_miscellaneous_34505905
source IEEE Electronic Library (IEL)
subjects Active control
Bioinformatics
Biological networks
canalization
Cancer
Computer science
Covariance
Criteria
Entropy
Genes
Genomics
IMP
intrinsically multivariate prediction
Irrigation
Logic
Mathematical analysis
melanoma
microarray
Power measurement
Probability distribution
Stress
Studies
transcriptional regulation
title Intrinsically Multivariate Predictive Genes
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T16%3A35%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_RIE&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Intrinsically%20Multivariate%20Predictive%20Genes&rft.jtitle=IEEE%20journal%20of%20selected%20topics%20in%20signal%20processing&rft.au=Martins,%20D.C.&rft.date=2008-06-01&rft.volume=2&rft.issue=3&rft.spage=424&rft.epage=439&rft.pages=424-439&rft.issn=1932-4553&rft.eissn=1941-0484&rft.coden=IJSTGY&rft_id=info:doi/10.1109/JSTSP.2008.923841&rft_dat=%3Cproquest_RIE%3E875091340%3C/proquest_RIE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=862819086&rft_id=info:pmid/&rft_ieee_id=4550558&rfr_iscdi=true