Finding subtle motifs with variable gaps in unaligned DNA sequences

Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computer methods and programs in biomedicine 2003, Vol.70 (1), p.11-20
1. Verfasser: Hu, Yuh-Jyh
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 20
container_issue 1
container_start_page 11
container_title Computer methods and programs in biomedicine
container_volume 70
creator Hu, Yuh-Jyh
description Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.
doi_str_mv 10.1016/S0169-2607(01)00198-5
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_72875349</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0169260701001985</els_id><sourcerecordid>72875349</sourcerecordid><originalsourceid>FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</originalsourceid><addsrcrecordid>eNqFkMtOwzAQRS0EgvL4BFA2IFgExk7sxCtUFQpIFSyAteVXilGaFDsB8fe4bQRLNjPS6MzM1UHoGMMlBsyunmPhKWFQnAO-AMC8TOkWGuGyIGlBGd1Go19kD-2H8A4AhFK2i_YwyVmJSTZCk6lrjGvmSehVV9tk0XauCsmX696ST-mdVHE4l8uQuCbpG1m7eWNNcvM4ToL96G2jbThEO5Wsgz0a-gF6nd6-TO7T2dPdw2Q8S3XOeJfyrDSEkUxpA8xI4JhhonBVUc4MtRnNFYaSK5tlRnGiodCWUVZBVWoVe3aAzjZ3l76Nr0MnFi5oW9eysW0fREHKgmY5_xfEJct5zlkE6QbUvg3B20osvVtI_y0wiJVmsdYsVg4FYLHWLGjcOxke9Gphzd_W4DUCpwMgg5Z15WWjXfjj8hxihiJy1xvORm-fznoRtFtJNc5b3QnTun-i_ABYMphQ</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>18649496</pqid></control><display><type>article</type><title>Finding subtle motifs with variable gaps in unaligned DNA sequences</title><source>MEDLINE</source><source>Elsevier ScienceDirect Journals Complete</source><creator>Hu, Yuh-Jyh</creator><creatorcontrib>Hu, Yuh-Jyh</creatorcontrib><description>Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.</description><identifier>ISSN: 0169-2607</identifier><identifier>EISSN: 1872-7565</identifier><identifier>DOI: 10.1016/S0169-2607(01)00198-5</identifier><identifier>PMID: 12468123</identifier><language>eng</language><publisher>Shannon: Elsevier Ireland Ltd</publisher><subject>Algorithms ; Biological and medical sciences ; DNA - chemistry ; Gaps ; Gene Expression Regulation ; Gene regulation ; Medical sciences ; Motif detection ; Sequence Analysis, DNA ; Subtle signals ; Transcription factors</subject><ispartof>Computer methods and programs in biomedicine, 2003, Vol.70 (1), p.11-20</ispartof><rights>2002 Elsevier Science Ireland Ltd</rights><rights>2003 INIST-CNRS</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</citedby><cites>FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/S0169-2607(01)00198-5$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,4024,27923,27924,27925,45995</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=14408647$$DView record in Pascal Francis$$Hfree_for_read</backlink><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/12468123$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Hu, Yuh-Jyh</creatorcontrib><title>Finding subtle motifs with variable gaps in unaligned DNA sequences</title><title>Computer methods and programs in biomedicine</title><addtitle>Comput Methods Programs Biomed</addtitle><description>Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.</description><subject>Algorithms</subject><subject>Biological and medical sciences</subject><subject>DNA - chemistry</subject><subject>Gaps</subject><subject>Gene Expression Regulation</subject><subject>Gene regulation</subject><subject>Medical sciences</subject><subject>Motif detection</subject><subject>Sequence Analysis, DNA</subject><subject>Subtle signals</subject><subject>Transcription factors</subject><issn>0169-2607</issn><issn>1872-7565</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2003</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNqFkMtOwzAQRS0EgvL4BFA2IFgExk7sxCtUFQpIFSyAteVXilGaFDsB8fe4bQRLNjPS6MzM1UHoGMMlBsyunmPhKWFQnAO-AMC8TOkWGuGyIGlBGd1Go19kD-2H8A4AhFK2i_YwyVmJSTZCk6lrjGvmSehVV9tk0XauCsmX696ST-mdVHE4l8uQuCbpG1m7eWNNcvM4ToL96G2jbThEO5Wsgz0a-gF6nd6-TO7T2dPdw2Q8S3XOeJfyrDSEkUxpA8xI4JhhonBVUc4MtRnNFYaSK5tlRnGiodCWUVZBVWoVe3aAzjZ3l76Nr0MnFi5oW9eysW0fREHKgmY5_xfEJct5zlkE6QbUvg3B20osvVtI_y0wiJVmsdYsVg4FYLHWLGjcOxke9Gphzd_W4DUCpwMgg5Z15WWjXfjj8hxihiJy1xvORm-fznoRtFtJNc5b3QnTun-i_ABYMphQ</recordid><startdate>2003</startdate><enddate>2003</enddate><creator>Hu, Yuh-Jyh</creator><general>Elsevier Ireland Ltd</general><general>Elsevier Science</general><scope>IQODW</scope><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7QO</scope><scope>8FD</scope><scope>FR3</scope><scope>P64</scope><scope>7X8</scope></search><sort><creationdate>2003</creationdate><title>Finding subtle motifs with variable gaps in unaligned DNA sequences</title><author>Hu, Yuh-Jyh</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c469t-938d2623bcd06da091612b1ff596d5e354b1089be33db92c07ce656f0f8cb56f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2003</creationdate><topic>Algorithms</topic><topic>Biological and medical sciences</topic><topic>DNA - chemistry</topic><topic>Gaps</topic><topic>Gene Expression Regulation</topic><topic>Gene regulation</topic><topic>Medical sciences</topic><topic>Motif detection</topic><topic>Sequence Analysis, DNA</topic><topic>Subtle signals</topic><topic>Transcription factors</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Hu, Yuh-Jyh</creatorcontrib><collection>Pascal-Francis</collection><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Biotechnology Research Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>MEDLINE - Academic</collection><jtitle>Computer methods and programs in biomedicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Hu, Yuh-Jyh</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Finding subtle motifs with variable gaps in unaligned DNA sequences</atitle><jtitle>Computer methods and programs in biomedicine</jtitle><addtitle>Comput Methods Programs Biomed</addtitle><date>2003</date><risdate>2003</risdate><volume>70</volume><issue>1</issue><spage>11</spage><epage>20</epage><pages>11-20</pages><issn>0169-2607</issn><eissn>1872-7565</eissn><abstract>Biologists have determined that the control and regulation of gene expression is primarily determined by relatively short sequences in the region surrounding a gene. These sequences vary in length, position, redundancy, orientation, and bases. Finding these short sequences is a fundamental problem in molecular biology with important applications. Though there exist many different approaches to signal (i.e. short sequence) finding, some new study shows that this problem still leaves plenty of room for improvement. In 2000, Pevzner and Sze proposed the Challenge Problem of motif detection. They reported that most current motif finding algorithms are incapable of detecting the target motifs in their Challenge Problem. In this paper, we show that using an iterative-restart design, our new algorithm can correctly find the target motifs. Furthermore, taking into account the fact that some transcription factors form a dimer or even more complex structures, and transcription process can sometimes involve multiple factors with variable spacers in between, we extend the original problem to an even more challenging one by addressing the issue of combinatorial signals with gaps of variable lengths. To demonstrate the effectiveness of our algorithm, we tested it on a series of the new challenge problem as well as real regulons, and compared it with some current representative motif-finding algorithms.</abstract><cop>Shannon</cop><pub>Elsevier Ireland Ltd</pub><pmid>12468123</pmid><doi>10.1016/S0169-2607(01)00198-5</doi><tpages>10</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0169-2607
ispartof Computer methods and programs in biomedicine, 2003, Vol.70 (1), p.11-20
issn 0169-2607
1872-7565
language eng
recordid cdi_proquest_miscellaneous_72875349
source MEDLINE; Elsevier ScienceDirect Journals Complete
subjects Algorithms
Biological and medical sciences
DNA - chemistry
Gaps
Gene Expression Regulation
Gene regulation
Medical sciences
Motif detection
Sequence Analysis, DNA
Subtle signals
Transcription factors
title Finding subtle motifs with variable gaps in unaligned DNA sequences
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-02T05%3A17%3A23IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Finding%20subtle%20motifs%20with%20variable%20gaps%20in%20unaligned%20DNA%20sequences&rft.jtitle=Computer%20methods%20and%20programs%20in%20biomedicine&rft.au=Hu,%20Yuh-Jyh&rft.date=2003&rft.volume=70&rft.issue=1&rft.spage=11&rft.epage=20&rft.pages=11-20&rft.issn=0169-2607&rft.eissn=1872-7565&rft_id=info:doi/10.1016/S0169-2607(01)00198-5&rft_dat=%3Cproquest_cross%3E72875349%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=18649496&rft_id=info:pmid/12468123&rft_els_id=S0169260701001985&rfr_iscdi=true