Discovering DNA Motifs with Nucleotide Dependency

The problem of finding motifs of binding sites is very important to the understanding of gene regulatory networks. Motifs are generally represented by matrices (PWM or PSSM) or strings. However, these representations cannot model biological binding sites well because they fail to capture nucleotide...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Leung, H.C.M., Chin, F.Y.L.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 80
container_issue
container_start_page 70
container_title
container_volume
creator Leung, H.C.M.
Chin, F.Y.L.
description The problem of finding motifs of binding sites is very important to the understanding of gene regulatory networks. Motifs are generally represented by matrices (PWM or PSSM) or strings. However, these representations cannot model biological binding sites well because they fail to capture nucleotide interdependence. It has been pointed out by many researchers that the nucleotides of the DNA binding site cannot be treated independently, e.g. the binding of zinc finger in proteins. In this paper, a new representation called Scored Position Specific Pattern (SPSP), which is a generalization of the matrix and string representations, is introduced which takes into consideration the dependent occurrences of neighboring nucleotides. Even though the problem of finding the optimal motif in SPSP representation is proved to be NP-hard, we introduce a heuristic algorithm called SPSP-Finder, which can effectively find optimal motifs in most simulated cases and some real cases for which existing popular motif-finding software, such as MEME and AlignACE fail
doi_str_mv 10.1109/BIBE.2006.253318
format Conference Proceeding
fullrecord <record><control><sourceid>ieee_6IE</sourceid><recordid>TN_cdi_ieee_primary_4019643</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><ieee_id>4019643</ieee_id><sourcerecordid>4019643</sourcerecordid><originalsourceid>FETCH-LOGICAL-i206t-94c96d400d737794b39338c81db0b2e79845d5549ea3f62d88cab94cb2a008af3</originalsourceid><addsrcrecordid>eNotjktLw0AURgdEUGr2BTfzBxLvPDIzd9k2VQu1buy6zONGR2pakqj031vR1cdZnMPH2FRAJQTg3Xw1X1YSwFSyVkq4C1agdWAN1tJKa69YMQzvACDQoNBwzUSTh3j4oj53r7zZzPjTYcztwL_z-MY3n3FPZ07EGzpSl6iLpxt22fr9QMX_Ttj2fvmyeCzXzw-rxWxdZglmLFFHNEkDJKusRR0UKuWiEylAkGTR6TrVtUbyqjUyORd9OEtBegDnWzVht3_dTES7Y58_fH_a6d_rWqkfAf1Bug</addsrcrecordid><sourcetype>Publisher</sourcetype><iscdi>true</iscdi><recordtype>conference_proceeding</recordtype></control><display><type>conference_proceeding</type><title>Discovering DNA Motifs with Nucleotide Dependency</title><source>IEEE Electronic Library (IEL) Conference Proceedings</source><creator>Leung, H.C.M. ; Chin, F.Y.L.</creator><creatorcontrib>Leung, H.C.M. ; Chin, F.Y.L.</creatorcontrib><description>The problem of finding motifs of binding sites is very important to the understanding of gene regulatory networks. Motifs are generally represented by matrices (PWM or PSSM) or strings. However, these representations cannot model biological binding sites well because they fail to capture nucleotide interdependence. It has been pointed out by many researchers that the nucleotides of the DNA binding site cannot be treated independently, e.g. the binding of zinc finger in proteins. In this paper, a new representation called Scored Position Specific Pattern (SPSP), which is a generalization of the matrix and string representations, is introduced which takes into consideration the dependent occurrences of neighboring nucleotides. Even though the problem of finding the optimal motif in SPSP representation is proved to be NP-hard, we introduce a heuristic algorithm called SPSP-Finder, which can effectively find optimal motifs in most simulated cases and some real cases for which existing popular motif-finding software, such as MEME and AlignACE fail</description><identifier>ISBN: 9780769527277</identifier><identifier>ISBN: 0769527272</identifier><identifier>DOI: 10.1109/BIBE.2006.253318</identifier><language>eng</language><publisher>IEEE</publisher><subject>Biological system modeling ; Computer science ; DNA ; Fingers ; Heuristic algorithms ; Hidden Markov models ; Proteins ; Pulse width modulation ; Software algorithms ; Zinc</subject><ispartof>Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06), 2006, p.70-80</ispartof><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://ieeexplore.ieee.org/document/4019643$$EHTML$$P50$$Gieee$$H</linktohtml><link.rule.ids>309,310,780,784,789,790,2056,27924,54919</link.rule.ids><linktorsrc>$$Uhttps://ieeexplore.ieee.org/document/4019643$$EView_record_in_IEEE$$FView_record_in_$$GIEEE</linktorsrc></links><search><creatorcontrib>Leung, H.C.M.</creatorcontrib><creatorcontrib>Chin, F.Y.L.</creatorcontrib><title>Discovering DNA Motifs with Nucleotide Dependency</title><title>Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06)</title><addtitle>BIBE</addtitle><description>The problem of finding motifs of binding sites is very important to the understanding of gene regulatory networks. Motifs are generally represented by matrices (PWM or PSSM) or strings. However, these representations cannot model biological binding sites well because they fail to capture nucleotide interdependence. It has been pointed out by many researchers that the nucleotides of the DNA binding site cannot be treated independently, e.g. the binding of zinc finger in proteins. In this paper, a new representation called Scored Position Specific Pattern (SPSP), which is a generalization of the matrix and string representations, is introduced which takes into consideration the dependent occurrences of neighboring nucleotides. Even though the problem of finding the optimal motif in SPSP representation is proved to be NP-hard, we introduce a heuristic algorithm called SPSP-Finder, which can effectively find optimal motifs in most simulated cases and some real cases for which existing popular motif-finding software, such as MEME and AlignACE fail</description><subject>Biological system modeling</subject><subject>Computer science</subject><subject>DNA</subject><subject>Fingers</subject><subject>Heuristic algorithms</subject><subject>Hidden Markov models</subject><subject>Proteins</subject><subject>Pulse width modulation</subject><subject>Software algorithms</subject><subject>Zinc</subject><isbn>9780769527277</isbn><isbn>0769527272</isbn><fulltext>true</fulltext><rsrctype>conference_proceeding</rsrctype><creationdate>2006</creationdate><recordtype>conference_proceeding</recordtype><sourceid>6IE</sourceid><sourceid>RIE</sourceid><recordid>eNotjktLw0AURgdEUGr2BTfzBxLvPDIzd9k2VQu1buy6zONGR2pakqj031vR1cdZnMPH2FRAJQTg3Xw1X1YSwFSyVkq4C1agdWAN1tJKa69YMQzvACDQoNBwzUSTh3j4oj53r7zZzPjTYcztwL_z-MY3n3FPZ07EGzpSl6iLpxt22fr9QMX_Ttj2fvmyeCzXzw-rxWxdZglmLFFHNEkDJKusRR0UKuWiEylAkGTR6TrVtUbyqjUyORd9OEtBegDnWzVht3_dTES7Y58_fH_a6d_rWqkfAf1Bug</recordid><startdate>20060101</startdate><enddate>20060101</enddate><creator>Leung, H.C.M.</creator><creator>Chin, F.Y.L.</creator><general>IEEE</general><scope>6IE</scope><scope>6IL</scope><scope>CBEJK</scope><scope>RIE</scope><scope>RIL</scope></search><sort><creationdate>20060101</creationdate><title>Discovering DNA Motifs with Nucleotide Dependency</title><author>Leung, H.C.M. ; Chin, F.Y.L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-i206t-94c96d400d737794b39338c81db0b2e79845d5549ea3f62d88cab94cb2a008af3</frbrgroupid><rsrctype>conference_proceedings</rsrctype><prefilter>conference_proceedings</prefilter><language>eng</language><creationdate>2006</creationdate><topic>Biological system modeling</topic><topic>Computer science</topic><topic>DNA</topic><topic>Fingers</topic><topic>Heuristic algorithms</topic><topic>Hidden Markov models</topic><topic>Proteins</topic><topic>Pulse width modulation</topic><topic>Software algorithms</topic><topic>Zinc</topic><toplevel>online_resources</toplevel><creatorcontrib>Leung, H.C.M.</creatorcontrib><creatorcontrib>Chin, F.Y.L.</creatorcontrib><collection>IEEE Electronic Library (IEL) Conference Proceedings</collection><collection>IEEE Proceedings Order Plan All Online (POP All Online) 1998-present by volume</collection><collection>IEEE Xplore All Conference Proceedings</collection><collection>IEEE Electronic Library (IEL)</collection><collection>IEEE Proceedings Order Plans (POP All) 1998-Present</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>Leung, H.C.M.</au><au>Chin, F.Y.L.</au><format>book</format><genre>proceeding</genre><ristype>CONF</ristype><atitle>Discovering DNA Motifs with Nucleotide Dependency</atitle><btitle>Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06)</btitle><stitle>BIBE</stitle><date>2006-01-01</date><risdate>2006</risdate><spage>70</spage><epage>80</epage><pages>70-80</pages><isbn>9780769527277</isbn><isbn>0769527272</isbn><abstract>The problem of finding motifs of binding sites is very important to the understanding of gene regulatory networks. Motifs are generally represented by matrices (PWM or PSSM) or strings. However, these representations cannot model biological binding sites well because they fail to capture nucleotide interdependence. It has been pointed out by many researchers that the nucleotides of the DNA binding site cannot be treated independently, e.g. the binding of zinc finger in proteins. In this paper, a new representation called Scored Position Specific Pattern (SPSP), which is a generalization of the matrix and string representations, is introduced which takes into consideration the dependent occurrences of neighboring nucleotides. Even though the problem of finding the optimal motif in SPSP representation is proved to be NP-hard, we introduce a heuristic algorithm called SPSP-Finder, which can effectively find optimal motifs in most simulated cases and some real cases for which existing popular motif-finding software, such as MEME and AlignACE fail</abstract><pub>IEEE</pub><doi>10.1109/BIBE.2006.253318</doi><tpages>11</tpages></addata></record>
fulltext fulltext_linktorsrc
identifier ISBN: 9780769527277
ispartof Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06), 2006, p.70-80
issn
language eng
recordid cdi_ieee_primary_4019643
source IEEE Electronic Library (IEL) Conference Proceedings
subjects Biological system modeling
Computer science
DNA
Fingers
Heuristic algorithms
Hidden Markov models
Proteins
Pulse width modulation
Software algorithms
Zinc
title Discovering DNA Motifs with Nucleotide Dependency
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-12T01%3A57%3A47IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-ieee_6IE&rft_val_fmt=info:ofi/fmt:kev:mtx:book&rft.genre=proceeding&rft.atitle=Discovering%20DNA%20Motifs%20with%20Nucleotide%20Dependency&rft.btitle=Sixth%20IEEE%20Symposium%20on%20BioInformatics%20and%20BioEngineering%20(BIBE'06)&rft.au=Leung,%20H.C.M.&rft.date=2006-01-01&rft.spage=70&rft.epage=80&rft.pages=70-80&rft.isbn=9780769527277&rft.isbn_list=0769527272&rft_id=info:doi/10.1109/BIBE.2006.253318&rft_dat=%3Cieee_6IE%3E4019643%3C/ieee_6IE%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rft_ieee_id=4019643&rfr_iscdi=true