Finding subtle motifs with variable gaps in unaligned DNA sequences
Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem i...
Gespeichert in:
Veröffentlicht in: | Computer methods and programs in biomedicine 2003, Vol.70 (1), p.11-20 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 20 |
---|---|
container_issue | 1 |
container_start_page | 11 |
container_title | Computer methods and programs in biomedicine |
container_volume | 70 |
creator | Hu, Yuh-Jyh |
description | Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms. |
doi_str_mv | 10.1016/S0169-2607(01)00198-5 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_72875349</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0169260701001985</els_id><sourcerecordid>72875349</sourcerecordid><originalsourceid>FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</originalsourceid><addsrcrecordid>eNqFkMtOwzAQRS0EgvL4BFA2IFgExk7sxCtUFQpIFSyAteVXilGaFDsB8fe4bQRLNjPS6MzM1UHoGMMlBsyunmPhKWFQnAO-AMC8TOkWGuGyIGlBGd1Go19kD-2H8A4AhFK2i_YwyVmJSTZCk6lrjGvmSehVV9tk0XauCsmX696ST-mdVHE4l8uQuCbpG1m7eWNNcvM4ToL96G2jbThEO5Wsgz0a-gF6nd6-TO7T2dPdw2Q8S3XOeJfyrDSEkUxpA8xI4JhhonBVUc4MtRnNFYaSK5tlRnGiodCWUVZBVWoVe3aAzjZ3l76Nr0MnFi5oW9eysW0fREHKgmY5_xfEJct5zlkE6QbUvg3B20osvVtI_y0wiJVmsdYsVg4FYLHWLGjcOxke9Gphzd_W4DUCpwMgg5Z15WWjXfjj8hxihiJy1xvORm-fznoRtFtJNc5b3QnTun-i_ABYMphQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>18649496</pqid></control><display><type>article</type><title>Finding subtle motifs with variable gaps in unaligned DNA sequences</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Hu, Yuh-Jyh</creator><creatorcontrib>Hu, Yuh-Jyh</creatorcontrib><description>Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.</description><identifier>ISSN: 0169-2607</identifier><identifier>EISSN: 1872-7565</identifier><identifier>DOI: 10.1016/S0169-2607(01)00198-5</identifier><identifier>PMID: 12468123</identifier><language>eng</language><publisher>Shannon: Elsevier Ireland Ltd</publisher><subject>Algorithms ; Biological and medical sciences ; DNA - chemistry ; Gaps ; Gene Expression Regulation ; Gene regulation ; Medical sciences ; Motif detection ; Sequence Analysis, DNA ; Subtle signals ; Transcription factors</subject><ispartof>Computer methods and programs in biomedicine, 2003, Vol.70 (1), p.11-20</ispartof><rights>2002 Elsevier Science Ireland Ltd</rights><rights>2003 INIST-CNRS</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</citedby><cites>FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/S0169-2607(01)00198-5$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,4024,27923,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=14408647$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/12468123$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hu, Yuh-Jyh</creatorcontrib><title>Finding subtle motifs with variable gaps in unaligned DNA sequences</title><title>Computer methods and programs in biomedicine</title><addtitle>Comput Methods Programs Biomed</addtitle><description>Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.</description><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>DNA - chemistry</subject><subject>Gaps</subject><subject>Gene Expression Regulation</subject><subject>Gene regulation</subject><subject>Medical sciences</subject><subject>Motif detection</subject><subject>Sequence Analysis, DNA</subject><subject>Subtle signals</subject><subject>Transcription factors</subject><issn>0169-2607</issn><issn>1872-7565</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkMtOwzAQRS0EgvL4BFA2IFgExk7sxCtUFQpIFSyAteVXilGaFDsB8fe4bQRLNjPS6MzM1UHoGMMlBsyunmPhKWFQnAO-AMC8TOkWGuGyIGlBGd1Go19kD-2H8A4AhFK2i_YwyVmJSTZCk6lrjGvmSehVV9tk0XauCsmX696ST-mdVHE4l8uQuCbpG1m7eWNNcvM4ToL96G2jbThEO5Wsgz0a-gF6nd6-TO7T2dPdw2Q8S3XOeJfyrDSEkUxpA8xI4JhhonBVUc4MtRnNFYaSK5tlRnGiodCWUVZBVWoVe3aAzjZ3l76Nr0MnFi5oW9eysW0fREHKgmY5_xfEJct5zlkE6QbUvg3B20osvVtI_y0wiJVmsdYsVg4FYLHWLGjcOxke9Gphzd_W4DUCpwMgg5Z15WWjXfjj8hxihiJy1xvORm-fznoRtFtJNc5b3QnTun-i_ABYMphQ</recordid><startdate>2003</startdate><enddate>2003</enddate><creator>Hu, Yuh-Jyh</creator><general>Elsevier Ireland Ltd</general><general>Elsevier Science</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>7X8</scope></search><sort><creationdate>2003</creationdate><title>Finding subtle motifs with variable gaps in unaligned DNA sequences</title><author>Hu, Yuh-Jyh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>DNA - chemistry</topic><topic>Gaps</topic><topic>Gene Expression Regulation</topic><topic>Gene regulation</topic><topic>Medical sciences</topic><topic>Motif detection</topic><topic>Sequence Analysis, DNA</topic><topic>Subtle signals</topic><topic>Transcription factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Yuh-Jyh</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Computer methods and programs in biomedicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hu, Yuh-Jyh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Finding subtle motifs with variable gaps in unaligned DNA sequences</atitle><jtitle>Computer methods and programs in biomedicine</jtitle><addtitle>Comput Methods Programs Biomed</addtitle><date>2003</date><risdate>2003</risdate><volume>70</volume><issue>1</issue><spage>11</spage><epage>20</epage><pages>11-20</pages><issn>0169-2607</issn><eissn>1872-7565</eissn><abstract>Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.</abstract><cop>Shannon</cop><pub>Elsevier Ireland Ltd</pub><pmid>12468123</pmid><doi>10.1016/S0169-2607(01)00198-5</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0169-2607 |
ispartof | Computer methods and programs in biomedicine, 2003, Vol.70 (1), p.11-20 |
issn | 0169-2607 1872-7565 |
language | eng |
recordid | cdi_proquest_miscellaneous_72875349 |
source | MEDLINE; Elsevier ScienceDirect Journals Complete |
subjects | Algorithms Biological and medical sciences DNA - chemistry Gaps Gene Expression Regulation Gene regulation Medical sciences Motif detection Sequence Analysis, DNA Subtle signals Transcription factors |
title | Finding subtle motifs with variable gaps in unaligned DNA sequences |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T05%3A17%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Finding%20subtle%20motifs%20with%20variable%20gaps%20in%20unaligned%20DNA%20sequences&rft.jtitle=Computer%20methods%20and%20programs%20in%20biomedicine&rft.au=Hu,%20Yuh-Jyh&rft.date=2003&rft.volume=70&rft.issue=1&rft.spage=11&rft.epage=20&rft.pages=11-20&rft.issn=0169-2607&rft.eissn=1872-7565&rft_id=info:doi/10.1016/S0169-2607(01)00198-5&rft_dat=%3Cproquest_cross%3E72875349%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=18649496&rft_id=info:pmid/12468123&rft_els_id=S0169260701001985&rfr_iscdi=true |