A deep learning framework for modeling structural features of RNA-binding protein targets

RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational me...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nucleic acids research 2016-02, Vol.44 (4), p.e32-e32
Hauptverfasser: Zhang, Sai, Zhou, Jingtian, Hu, Hailin, Gong, Haipeng, Chen, Ligong, Cheng, Chao, Zeng, Jianyang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e32
container_issue 4
container_start_page e32
container_title Nucleic acids research
container_volume 44
creator Zhang, Sai
Zhou, Jingtian
Hu, Hailin
Gong, Haipeng
Chen, Ligong
Cheng, Chao
Zeng, Jianyang
description RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.
doi_str_mv 10.1093/nar/gkv1025
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4770198</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1769984206</sourcerecordid><originalsourceid>FETCH-LOGICAL-c381t-a686c8d5977a157f0f1f80e4154fde738959532088c79aee13abb7aae0b0f69b3</originalsourceid><addsrcrecordid>eNpVkctLxDAQxoMouj5O3iVHQaqTps3jIiziCxYF0YOnkLaTtdo2a9Iq_vd2cV30NMPMj28-5iPkkMEpA83POhvO5m8fDNJ8g0wYF2mSaZFukglwyBMGmdohuzG-ArCM5dk22UlFJmSmYEKep7RCXNAGbejqbk5dsC1--vBGnQ-09RU2y3Hsw1D2Q7ANdWjHBiP1jj7cTZOi7qolsgi-x7qjvQ1z7OM-2XK2iXiwqnvk6ery8eImmd1f315MZ0nJFesTK5QoVZVrKS3LpQPHnAJcGnUVSq50rnOeglKl1BaRcVsU0lqEApzQBd8j5z-6i6FosSqx60eXZhHq1oYv421t_m-6-sXM_YfJpASm1ShwvBII_n3A2Ju2jiU2je3QD9EwKbRWWQpiRE9-0DL4GAO69RkGZhmGGcMwqzBG-uivszX7-33-DUqoiIA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1769984206</pqid></control><display><type>article</type><title>A deep learning framework for modeling structural features of RNA-binding protein targets</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>Oxford Journals Open Access Collection</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Zhang, Sai ; Zhou, Jingtian ; Hu, Hailin ; Gong, Haipeng ; Chen, Ligong ; Cheng, Chao ; Zeng, Jianyang</creator><creatorcontrib>Zhang, Sai ; Zhou, Jingtian ; Hu, Hailin ; Gong, Haipeng ; Chen, Ligong ; Cheng, Chao ; Zeng, Jianyang</creatorcontrib><description>RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.</description><identifier>ISSN: 0305-1048</identifier><identifier>EISSN: 1362-4962</identifier><identifier>DOI: 10.1093/nar/gkv1025</identifier><identifier>PMID: 26467480</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Binding Sites ; Computational Biology ; Gene Expression Regulation ; Methods Online ; Nucleic Acid Conformation ; Polypyrimidine Tract-Binding Protein - chemistry ; Polypyrimidine Tract-Binding Protein - genetics ; Ribosomes - chemistry ; Ribosomes - genetics ; RNA Processing, Post-Transcriptional - genetics ; RNA, Messenger - chemistry ; RNA, Messenger - metabolism ; RNA-Binding Proteins - chemistry ; RNA-Binding Proteins - genetics</subject><ispartof>Nucleic acids research, 2016-02, Vol.44 (4), p.e32-e32</ispartof><rights>The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research.</rights><rights>The Author(s) 2015. Published by Oxford University Press on behalf of Nucleic Acids Research. 2016</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c381t-a686c8d5977a157f0f1f80e4154fde738959532088c79aee13abb7aae0b0f69b3</citedby><cites>FETCH-LOGICAL-c381t-a686c8d5977a157f0f1f80e4154fde738959532088c79aee13abb7aae0b0f69b3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4770198/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC4770198/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,314,724,777,781,861,882,27905,27906,53772,53774</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/26467480$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Zhang, Sai</creatorcontrib><creatorcontrib>Zhou, Jingtian</creatorcontrib><creatorcontrib>Hu, Hailin</creatorcontrib><creatorcontrib>Gong, Haipeng</creatorcontrib><creatorcontrib>Chen, Ligong</creatorcontrib><creatorcontrib>Cheng, Chao</creatorcontrib><creatorcontrib>Zeng, Jianyang</creatorcontrib><title>A deep learning framework for modeling structural features of RNA-binding protein targets</title><title>Nucleic acids research</title><addtitle>Nucleic Acids Res</addtitle><description>RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.</description><subject>Binding Sites</subject><subject>Computational Biology</subject><subject>Gene Expression Regulation</subject><subject>Methods Online</subject><subject>Nucleic Acid Conformation</subject><subject>Polypyrimidine Tract-Binding Protein - chemistry</subject><subject>Polypyrimidine Tract-Binding Protein - genetics</subject><subject>Ribosomes - chemistry</subject><subject>Ribosomes - genetics</subject><subject>RNA Processing, Post-Transcriptional - genetics</subject><subject>RNA, Messenger - chemistry</subject><subject>RNA, Messenger - metabolism</subject><subject>RNA-Binding Proteins - chemistry</subject><subject>RNA-Binding Proteins - genetics</subject><issn>0305-1048</issn><issn>1362-4962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2016</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVkctLxDAQxoMouj5O3iVHQaqTps3jIiziCxYF0YOnkLaTtdo2a9Iq_vd2cV30NMPMj28-5iPkkMEpA83POhvO5m8fDNJ8g0wYF2mSaZFukglwyBMGmdohuzG-ArCM5dk22UlFJmSmYEKep7RCXNAGbejqbk5dsC1--vBGnQ-09RU2y3Hsw1D2Q7ANdWjHBiP1jj7cTZOi7qolsgi-x7qjvQ1z7OM-2XK2iXiwqnvk6ery8eImmd1f315MZ0nJFesTK5QoVZVrKS3LpQPHnAJcGnUVSq50rnOeglKl1BaRcVsU0lqEApzQBd8j5z-6i6FosSqx60eXZhHq1oYv421t_m-6-sXM_YfJpASm1ShwvBII_n3A2Ju2jiU2je3QD9EwKbRWWQpiRE9-0DL4GAO69RkGZhmGGcMwqzBG-uivszX7-33-DUqoiIA</recordid><startdate>20160229</startdate><enddate>20160229</enddate><creator>Zhang, Sai</creator><creator>Zhou, Jingtian</creator><creator>Hu, Hailin</creator><creator>Gong, Haipeng</creator><creator>Chen, Ligong</creator><creator>Cheng, Chao</creator><creator>Zeng, Jianyang</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20160229</creationdate><title>A deep learning framework for modeling structural features of RNA-binding protein targets</title><author>Zhang, Sai ; Zhou, Jingtian ; Hu, Hailin ; Gong, Haipeng ; Chen, Ligong ; Cheng, Chao ; Zeng, Jianyang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c381t-a686c8d5977a157f0f1f80e4154fde738959532088c79aee13abb7aae0b0f69b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2016</creationdate><topic>Binding Sites</topic><topic>Computational Biology</topic><topic>Gene Expression Regulation</topic><topic>Methods Online</topic><topic>Nucleic Acid Conformation</topic><topic>Polypyrimidine Tract-Binding Protein - chemistry</topic><topic>Polypyrimidine Tract-Binding Protein - genetics</topic><topic>Ribosomes - chemistry</topic><topic>Ribosomes - genetics</topic><topic>RNA Processing, Post-Transcriptional - genetics</topic><topic>RNA, Messenger - chemistry</topic><topic>RNA, Messenger - metabolism</topic><topic>RNA-Binding Proteins - chemistry</topic><topic>RNA-Binding Proteins - genetics</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Zhang, Sai</creatorcontrib><creatorcontrib>Zhou, Jingtian</creatorcontrib><creatorcontrib>Hu, Hailin</creatorcontrib><creatorcontrib>Gong, Haipeng</creatorcontrib><creatorcontrib>Chen, Ligong</creatorcontrib><creatorcontrib>Cheng, Chao</creatorcontrib><creatorcontrib>Zeng, Jianyang</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Nucleic acids research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Zhang, Sai</au><au>Zhou, Jingtian</au><au>Hu, Hailin</au><au>Gong, Haipeng</au><au>Chen, Ligong</au><au>Cheng, Chao</au><au>Zeng, Jianyang</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A deep learning framework for modeling structural features of RNA-binding protein targets</atitle><jtitle>Nucleic acids research</jtitle><addtitle>Nucleic Acids Res</addtitle><date>2016-02-29</date><risdate>2016</risdate><volume>44</volume><issue>4</issue><spage>e32</spage><epage>e32</epage><pages>e32-e32</pages><issn>0305-1048</issn><eissn>1362-4962</eissn><abstract>RNA-binding proteins (RBPs) play important roles in the post-transcriptional control of RNAs. Identifying RBP binding sites and characterizing RBP binding preferences are key steps toward understanding the basic mechanisms of the post-transcriptional gene regulation. Though numerous computational methods have been developed for modeling RBP binding preferences, discovering a complete structural representation of the RBP targets by integrating their available structural features in all three dimensions is still a challenging task. In this paper, we develop a general and flexible deep learning framework for modeling structural binding preferences and predicting binding sites of RBPs, which takes (predicted) RNA tertiary structural information into account for the first time. Our framework constructs a unified representation that characterizes the structural specificities of RBP targets in all three dimensions, which can be further used to predict novel candidate binding sites and discover potential binding motifs. Through testing on the real CLIP-seq datasets, we have demonstrated that our deep learning framework can automatically extract effective hidden structural features from the encoded raw sequence and structural profiles, and predict accurate RBP binding sites. In addition, we have conducted the first study to show that integrating the additional RNA tertiary structural features can improve the model performance in predicting RBP binding sites, especially for the polypyrimidine tract-binding protein (PTB), which also provides a new evidence to support the view that RBPs may own specific tertiary structural binding preferences. In particular, the tests on the internal ribosome entry site (IRES) segments yield satisfiable results with experimental support from the literature and further demonstrate the necessity of incorporating RNA tertiary structural information into the prediction model. The source code of our approach can be found in https://github.com/thucombio/deepnet-rbp.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>26467480</pmid><doi>10.1093/nar/gkv1025</doi><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0305-1048
ispartof Nucleic acids research, 2016-02, Vol.44 (4), p.e32-e32
issn 0305-1048
1362-4962
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_4770198
source MEDLINE; DOAJ Directory of Open Access Journals; Oxford Journals Open Access Collection; PubMed Central; Free Full-Text Journals in Chemistry
subjects Binding Sites
Computational Biology
Gene Expression Regulation
Methods Online
Nucleic Acid Conformation
Polypyrimidine Tract-Binding Protein - chemistry
Polypyrimidine Tract-Binding Protein - genetics
Ribosomes - chemistry
Ribosomes - genetics
RNA Processing, Post-Transcriptional - genetics
RNA, Messenger - chemistry
RNA, Messenger - metabolism
RNA-Binding Proteins - chemistry
RNA-Binding Proteins - genetics
title A deep learning framework for modeling structural features of RNA-binding protein targets
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-19T15%3A12%3A51IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20deep%20learning%20framework%20for%20modeling%20structural%20features%20of%20RNA-binding%20protein%20targets&rft.jtitle=Nucleic%20acids%20research&rft.au=Zhang,%20Sai&rft.date=2016-02-29&rft.volume=44&rft.issue=4&rft.spage=e32&rft.epage=e32&rft.pages=e32-e32&rft.issn=0305-1048&rft.eissn=1362-4962&rft_id=info:doi/10.1093/nar/gkv1025&rft_dat=%3Cproquest_pubme%3E1769984206%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1769984206&rft_id=info:pmid/26467480&rfr_iscdi=true