On the necessity and biological significance of threshold-free regulon prediction outputs

The in silico prediction of cis -acting elements in a genome is an efficient way to quickly obtain an overview of the biological processes controlled by a trans -acting factor, and connections between regulatory networks. Several regulon prediction web tools are available, designed to identify DNA m...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Molecular bioSystems 2015-02, Vol.11 (2), p.333-337
Hauptverfasser: Rigali, Sébastien, Nivelle, Renaud, Tocquin, Pierre
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 337
container_issue 2
container_start_page 333
container_title Molecular bioSystems
container_volume 11
creator Rigali, Sébastien
Nivelle, Renaud
Tocquin, Pierre
description The in silico prediction of cis -acting elements in a genome is an efficient way to quickly obtain an overview of the biological processes controlled by a trans -acting factor, and connections between regulatory networks. Several regulon prediction web tools are available, designed to identify DNA motifs predicted to be bound by transcription factors using position weight matrix-based algorithms. In this paper we expose and discuss the conflicting objectives of software creators (bioinformaticians) and software users (biologists), who aim for reliable and exhaustive prediction outputs, respectively. Software makers, concerned with providing tools that minimise the number of false positive hits, often impose a stringent threshold score for a sequence to be included in the list of the putative cis -acting sites. This rigidity eventually results in the identification of strongly reliable but largely straightforward sites, i.e. those associated with genes already anticipated to be targeted by the studied transcription factor. Importantly, this biased identification of strongly bound sequences contrasts with the biological reality where, in many circumstances, a weak DNA-protein interaction is required for the appropriate gene's expression. We show here a series of transcriptionally controlled systems involving weakly bound cis -acting elements that could never have been discovered because of the policy of preventing software users from modifying the screening parameters. Proposing only trustworthy prediction outputs thus prevents biologists from fully utilising their knowledge background and deciding to analyse statistically irrelevant hits that could nonetheless be potentially involved in subtle, unexpected, though essential cis - trans relationships. Regulon prediction tools are trendy in system biology but software developers, concerned for providing reliable outputs, often impose rigid screening parameters. We discuss and illustrate why biologists should utilize tools that allow lowering threshold scores set by default in order to discover unexpected but essential cis - trans relationships.
doi_str_mv 10.1039/c4mb00485j
format Article
fullrecord <record><control><sourceid>proquest_rsc_p</sourceid><recordid>TN_cdi_rsc_primary_c4mb00485j</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1652409579</sourcerecordid><originalsourceid>FETCH-LOGICAL-c415t-f885fb5cb641d0271a3cd983dfb3252451d286cd240b9a747200308bdf2927cc3</originalsourceid><addsrcrecordid>eNp9kUtv1TAQhS0Eog_YsAeFXYWU4mfiLOGKR1FRNyDByvJjnLryjYOdIPXf15Byu2M1RzrfzGjmIPSC4HOC2fDW8r3BmEtx8wgdk57TlmJBHh909-MInZRygzGTnOCn6IgKJntByTH6eTU1yzU0E1goJSy3jZ5cY0KKaQxWx6aEcQq-yslCk3yFM5TrFF3rM0CTYVxjmpo5gwt2CVWmdZnXpTxDT7yOBZ7f11P0_eOHb7vP7eXVp4vdu8vWciKW1kspvBHWdJw4THuimXWDZM4bRgXlgjgqO-sox2bQPe9pPQNL4zwdaG8tO0VsmxsDjKBSNkH9pirpsOk1jkpbZUBR2klVf4I7WbvOtq45p18rlEXtQ7EQo54grUWRru7Gg-iHir7ZUJtTKRm8mnPY63yrCFZ_AlA7_vX93wC-VPjV_dzV7MEd0H8fr8DLDcjFHtyHBKv_-n--mp1nd96llfs</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1652409579</pqid></control><display><type>article</type><title>On the necessity and biological significance of threshold-free regulon prediction outputs</title><source>MEDLINE</source><source>Royal Society Of Chemistry Journals 2008-</source><source>Alma/SFX Local Collection</source><creator>Rigali, Sébastien ; Nivelle, Renaud ; Tocquin, Pierre</creator><creatorcontrib>Rigali, Sébastien ; Nivelle, Renaud ; Tocquin, Pierre</creatorcontrib><description>The in silico prediction of cis -acting elements in a genome is an efficient way to quickly obtain an overview of the biological processes controlled by a trans -acting factor, and connections between regulatory networks. Several regulon prediction web tools are available, designed to identify DNA motifs predicted to be bound by transcription factors using position weight matrix-based algorithms. In this paper we expose and discuss the conflicting objectives of software creators (bioinformaticians) and software users (biologists), who aim for reliable and exhaustive prediction outputs, respectively. Software makers, concerned with providing tools that minimise the number of false positive hits, often impose a stringent threshold score for a sequence to be included in the list of the putative cis -acting sites. This rigidity eventually results in the identification of strongly reliable but largely straightforward sites, i.e. those associated with genes already anticipated to be targeted by the studied transcription factor. Importantly, this biased identification of strongly bound sequences contrasts with the biological reality where, in many circumstances, a weak DNA-protein interaction is required for the appropriate gene's expression. We show here a series of transcriptionally controlled systems involving weakly bound cis -acting elements that could never have been discovered because of the policy of preventing software users from modifying the screening parameters. Proposing only trustworthy prediction outputs thus prevents biologists from fully utilising their knowledge background and deciding to analyse statistically irrelevant hits that could nonetheless be potentially involved in subtle, unexpected, though essential cis - trans relationships. Regulon prediction tools are trendy in system biology but software developers, concerned for providing reliable outputs, often impose rigid screening parameters. We discuss and illustrate why biologists should utilize tools that allow lowering threshold scores set by default in order to discover unexpected but essential cis - trans relationships.</description><identifier>ISSN: 1742-206X</identifier><identifier>ISSN: 1742-2051</identifier><identifier>EISSN: 1742-2051</identifier><identifier>DOI: 10.1039/c4mb00485j</identifier><identifier>PMID: 25387521</identifier><language>eng</language><publisher>England: Royal Society of Chemistry</publisher><subject>Algorithms ; Base Sequence ; Biochemistry, biophysics &amp; molecular biology ; Biochimie, biophysique &amp; biologie moléculaire ; Computational Biology - methods ; DNA - genetics ; DNA - metabolism ; Internet ; Life sciences ; Microbiologie ; Microbiology ; Molecular Sequence Data ; Regulon - genetics ; Response Elements - genetics ; Sciences du vivant ; Software ; Transcription Factors - metabolism ; Transcription, Genetic</subject><ispartof>Molecular bioSystems, 2015-02, Vol.11 (2), p.333-337</ispartof><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c415t-f885fb5cb641d0271a3cd983dfb3252451d286cd240b9a747200308bdf2927cc3</citedby><cites>FETCH-LOGICAL-c415t-f885fb5cb641d0271a3cd983dfb3252451d286cd240b9a747200308bdf2927cc3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>230,314,776,780,881,27901,27902</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/25387521$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Rigali, Sébastien</creatorcontrib><creatorcontrib>Nivelle, Renaud</creatorcontrib><creatorcontrib>Tocquin, Pierre</creatorcontrib><title>On the necessity and biological significance of threshold-free regulon prediction outputs</title><title>Molecular bioSystems</title><addtitle>Mol Biosyst</addtitle><description>The in silico prediction of cis -acting elements in a genome is an efficient way to quickly obtain an overview of the biological processes controlled by a trans -acting factor, and connections between regulatory networks. Several regulon prediction web tools are available, designed to identify DNA motifs predicted to be bound by transcription factors using position weight matrix-based algorithms. In this paper we expose and discuss the conflicting objectives of software creators (bioinformaticians) and software users (biologists), who aim for reliable and exhaustive prediction outputs, respectively. Software makers, concerned with providing tools that minimise the number of false positive hits, often impose a stringent threshold score for a sequence to be included in the list of the putative cis -acting sites. This rigidity eventually results in the identification of strongly reliable but largely straightforward sites, i.e. those associated with genes already anticipated to be targeted by the studied transcription factor. Importantly, this biased identification of strongly bound sequences contrasts with the biological reality where, in many circumstances, a weak DNA-protein interaction is required for the appropriate gene's expression. We show here a series of transcriptionally controlled systems involving weakly bound cis -acting elements that could never have been discovered because of the policy of preventing software users from modifying the screening parameters. Proposing only trustworthy prediction outputs thus prevents biologists from fully utilising their knowledge background and deciding to analyse statistically irrelevant hits that could nonetheless be potentially involved in subtle, unexpected, though essential cis - trans relationships. Regulon prediction tools are trendy in system biology but software developers, concerned for providing reliable outputs, often impose rigid screening parameters. We discuss and illustrate why biologists should utilize tools that allow lowering threshold scores set by default in order to discover unexpected but essential cis - trans relationships.</description><subject>Algorithms</subject><subject>Base Sequence</subject><subject>Biochemistry, biophysics &amp; molecular biology</subject><subject>Biochimie, biophysique &amp; biologie moléculaire</subject><subject>Computational Biology - methods</subject><subject>DNA - genetics</subject><subject>DNA - metabolism</subject><subject>Internet</subject><subject>Life sciences</subject><subject>Microbiologie</subject><subject>Microbiology</subject><subject>Molecular Sequence Data</subject><subject>Regulon - genetics</subject><subject>Response Elements - genetics</subject><subject>Sciences du vivant</subject><subject>Software</subject><subject>Transcription Factors - metabolism</subject><subject>Transcription, Genetic</subject><issn>1742-206X</issn><issn>1742-2051</issn><issn>1742-2051</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2015</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNp9kUtv1TAQhS0Eog_YsAeFXYWU4mfiLOGKR1FRNyDByvJjnLryjYOdIPXf15Byu2M1RzrfzGjmIPSC4HOC2fDW8r3BmEtx8wgdk57TlmJBHh909-MInZRygzGTnOCn6IgKJntByTH6eTU1yzU0E1goJSy3jZ5cY0KKaQxWx6aEcQq-yslCk3yFM5TrFF3rM0CTYVxjmpo5gwt2CVWmdZnXpTxDT7yOBZ7f11P0_eOHb7vP7eXVp4vdu8vWciKW1kspvBHWdJw4THuimXWDZM4bRgXlgjgqO-sox2bQPe9pPQNL4zwdaG8tO0VsmxsDjKBSNkH9pirpsOk1jkpbZUBR2klVf4I7WbvOtq45p18rlEXtQ7EQo54grUWRru7Gg-iHir7ZUJtTKRm8mnPY63yrCFZ_AlA7_vX93wC-VPjV_dzV7MEd0H8fr8DLDcjFHtyHBKv_-n--mp1nd96llfs</recordid><startdate>20150201</startdate><enddate>20150201</enddate><creator>Rigali, Sébastien</creator><creator>Nivelle, Renaud</creator><creator>Tocquin, Pierre</creator><general>Royal Society of Chemistry</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>Q33</scope></search><sort><creationdate>20150201</creationdate><title>On the necessity and biological significance of threshold-free regulon prediction outputs</title><author>Rigali, Sébastien ; Nivelle, Renaud ; Tocquin, Pierre</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c415t-f885fb5cb641d0271a3cd983dfb3252451d286cd240b9a747200308bdf2927cc3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2015</creationdate><topic>Algorithms</topic><topic>Base Sequence</topic><topic>Biochemistry, biophysics &amp; molecular biology</topic><topic>Biochimie, biophysique &amp; biologie moléculaire</topic><topic>Computational Biology - methods</topic><topic>DNA - genetics</topic><topic>DNA - metabolism</topic><topic>Internet</topic><topic>Life sciences</topic><topic>Microbiologie</topic><topic>Microbiology</topic><topic>Molecular Sequence Data</topic><topic>Regulon - genetics</topic><topic>Response Elements - genetics</topic><topic>Sciences du vivant</topic><topic>Software</topic><topic>Transcription Factors - metabolism</topic><topic>Transcription, Genetic</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Rigali, Sébastien</creatorcontrib><creatorcontrib>Nivelle, Renaud</creatorcontrib><creatorcontrib>Tocquin, Pierre</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>Université de Liège - Open Repository and Bibliography (ORBI)</collection><jtitle>Molecular bioSystems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Rigali, Sébastien</au><au>Nivelle, Renaud</au><au>Tocquin, Pierre</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>On the necessity and biological significance of threshold-free regulon prediction outputs</atitle><jtitle>Molecular bioSystems</jtitle><addtitle>Mol Biosyst</addtitle><date>2015-02-01</date><risdate>2015</risdate><volume>11</volume><issue>2</issue><spage>333</spage><epage>337</epage><pages>333-337</pages><issn>1742-206X</issn><issn>1742-2051</issn><eissn>1742-2051</eissn><abstract>The in silico prediction of cis -acting elements in a genome is an efficient way to quickly obtain an overview of the biological processes controlled by a trans -acting factor, and connections between regulatory networks. Several regulon prediction web tools are available, designed to identify DNA motifs predicted to be bound by transcription factors using position weight matrix-based algorithms. In this paper we expose and discuss the conflicting objectives of software creators (bioinformaticians) and software users (biologists), who aim for reliable and exhaustive prediction outputs, respectively. Software makers, concerned with providing tools that minimise the number of false positive hits, often impose a stringent threshold score for a sequence to be included in the list of the putative cis -acting sites. This rigidity eventually results in the identification of strongly reliable but largely straightforward sites, i.e. those associated with genes already anticipated to be targeted by the studied transcription factor. Importantly, this biased identification of strongly bound sequences contrasts with the biological reality where, in many circumstances, a weak DNA-protein interaction is required for the appropriate gene's expression. We show here a series of transcriptionally controlled systems involving weakly bound cis -acting elements that could never have been discovered because of the policy of preventing software users from modifying the screening parameters. Proposing only trustworthy prediction outputs thus prevents biologists from fully utilising their knowledge background and deciding to analyse statistically irrelevant hits that could nonetheless be potentially involved in subtle, unexpected, though essential cis - trans relationships. Regulon prediction tools are trendy in system biology but software developers, concerned for providing reliable outputs, often impose rigid screening parameters. We discuss and illustrate why biologists should utilize tools that allow lowering threshold scores set by default in order to discover unexpected but essential cis - trans relationships.</abstract><cop>England</cop><pub>Royal Society of Chemistry</pub><pmid>25387521</pmid><doi>10.1039/c4mb00485j</doi><tpages>5</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1742-206X
ispartof Molecular bioSystems, 2015-02, Vol.11 (2), p.333-337
issn 1742-206X
1742-2051
1742-2051
language eng
recordid cdi_rsc_primary_c4mb00485j
source MEDLINE; Royal Society Of Chemistry Journals 2008-; Alma/SFX Local Collection
subjects Algorithms
Base Sequence
Biochemistry, biophysics & molecular biology
Biochimie, biophysique & biologie moléculaire
Computational Biology - methods
DNA - genetics
DNA - metabolism
Internet
Life sciences
Microbiologie
Microbiology
Molecular Sequence Data
Regulon - genetics
Response Elements - genetics
Sciences du vivant
Software
Transcription Factors - metabolism
Transcription, Genetic
title On the necessity and biological significance of threshold-free regulon prediction outputs
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-28T18%3A27%3A00IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_rsc_p&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=On%20the%20necessity%20and%20biological%20significance%20of%20threshold-free%20regulon%20prediction%20outputs&rft.jtitle=Molecular%20bioSystems&rft.au=Rigali,%20S%C3%A9bastien&rft.date=2015-02-01&rft.volume=11&rft.issue=2&rft.spage=333&rft.epage=337&rft.pages=333-337&rft.issn=1742-206X&rft.eissn=1742-2051&rft_id=info:doi/10.1039/c4mb00485j&rft_dat=%3Cproquest_rsc_p%3E1652409579%3C/proquest_rsc_p%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1652409579&rft_id=info:pmid/25387521&rfr_iscdi=true