Full-length ribosome density prediction by a multi-input and multi-output model

Translation elongation is regulated by a series of complicated mechanisms in both prokaryotes and eukaryotes. Although recent advance in ribosome profiling techniques has enabled one to capture the genome-wide ribosome footprints along transcripts at codon resolution, the regulatory codes of elongat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:PLoS computational biology 2021-03, Vol.17 (3), p.e1008842-e1008842, Article 1008842
Hauptverfasser: Tian, Tingzhong, Li, Shuya, Lang, Peng, Zhao, Dan, Zeng, Jianyang
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page e1008842
container_issue 3
container_start_page e1008842
container_title PLoS computational biology
container_volume 17
creator Tian, Tingzhong
Li, Shuya
Lang, Peng
Zhao, Dan
Zeng, Jianyang
description Translation elongation is regulated by a series of complicated mechanisms in both prokaryotes and eukaryotes. Although recent advance in ribosome profiling techniques has enabled one to capture the genome-wide ribosome footprints along transcripts at codon resolution, the regulatory codes of elongation dynamics are still not fully understood. Most of the existing computational approaches for modeling translation elongation from ribosome profiling data mainly focus on local contextual patterns, while ignoring the continuity of the elongation process and relations between ribosome densities of remote codons. Modeling the translation elongation process in full-length coding sequence (CDS) level has not been studied to the best of our knowledge. In this paper, we developed a deep learning based approach with a multi-input and multi-output framework, named RiboMIMO, for modeling the ribosome density distributions of full-length mRNA CDS regions. Through considering the underlying correlations in translation efficiency among neighboring and remote codons and extracting hidden features from the input full-length coding sequence, RiboMIMO can greatly outperform the state-of-the-art baseline approaches and accurately predict the ribosome density distributions along the whole mRNA CDS regions. In addition, RiboMIMO explores the contributions of individual input codons to the predictions of output ribosome densities, which thus can help reveal important biological factors influencing the translation elongation process. The analyses, based on our interpretable metric named codon impact score, not only identified several patterns consistent with the previously-published literatures, but also for the first time (to the best of our knowledge) revealed that the codons located at a long distance from the ribosomal A site may also have an association on the translation elongation rate. This finding of long-range impact on translation elongation velocity may shed new light on the regulatory mechanisms of protein synthesis. Overall, these results indicated that RiboMIMO can provide a useful tool for studying the regulation of translation elongation in the range of full-length CDS. Author summary Translation elongation is a process in which amino acids are linked into proteins by ribosomes in cells. Translation elongation rates along the mRNAs are not constant, and are regulated by a series of mechanisms, such as codon rarity and mRNA stability. In this study, we modeled the t
doi_str_mv 10.1371/journal.pcbi.1008842
format Article
fullrecord <record><control><sourceid>gale_proqu</sourceid><recordid>TN_cdi_proquest_miscellaneous_2506275200</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><galeid>A658712976</galeid><doaj_id>oai_doaj_org_article_99c0101d8448448d9b9aa3d81c36629a</doaj_id><sourcerecordid>A658712976</sourcerecordid><originalsourceid>FETCH-LOGICAL-c633t-41685cee57c1cf6007cd39e8ffaca04269290ae8810a50efbc1d46ba9f8a39a73</originalsourceid><addsrcrecordid>eNqVkl2L1DAUhoso7jr6D0QL3ijSMWnaNLkRlsHVgcUFP8C7kCansxnSZGxSdf69qdMdHPFCaaDJyXNOTt68WfYYoyUmDX619ePgpF3uVGuWGCHGqvJOdo7rmhQNqdnd47z6cpY9CGGLUApzej87I6RpEGqq8-z6crS2sOA28SYfTOuD7yHX4IKJ-3w3gDYqGu_ydp_LvB9tNIVxuzHm0ul57cc4BXqvwT7M7nXSBng0_xfZ58s3n1bviqvrt-vVxVWhKCGxqDBltQKoG4VVR1MvShMOrOukkqgqKS85ksAYRrJG0LUK64q2kndMEi4bssieHururA9i1iKIssaEMpK0SMT6QGgvt2I3mF4Oe-GlEb8CftgIOUSjLAjOFcIIa1ZV09C85VISzbAilJZcplqv59PGtgetwMVB2pOipzvO3IiN_yYYKikiUzPP5wKD_zpCiKI3QYG10oEfp74RLZu6TE-0yJ79gf79dssDtZHpAsZ1Pp2r0qehN8o76EyKX9CaNbjkDU0JL04SEhPhR9zIMQSx_vjhP9j3p2x1YNXgQxigO6qCkZiMetu-mIwqZqOmtCe_K3pMunVmAl4egO_Q-i4oA07BEUMIUVI1nCQ4aZZo9u_0ykQ5eXrlRxfJT6nOBZA</addsrcrecordid><sourcetype>Open Website</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2513683884</pqid></control><display><type>article</type><title>Full-length ribosome density prediction by a multi-input and multi-output model</title><source>DOAJ Directory of Open Access Journals</source><source>Public Library of Science (PLoS) Journals Open Access</source><source>Web of Science - Science Citation Index Expanded - 2021&lt;img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" /&gt;</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Tian, Tingzhong ; Li, Shuya ; Lang, Peng ; Zhao, Dan ; Zeng, Jianyang</creator><contributor>Roy, Sushmita</contributor><creatorcontrib>Tian, Tingzhong ; Li, Shuya ; Lang, Peng ; Zhao, Dan ; Zeng, Jianyang ; Roy, Sushmita</creatorcontrib><description>Translation elongation is regulated by a series of complicated mechanisms in both prokaryotes and eukaryotes. Although recent advance in ribosome profiling techniques has enabled one to capture the genome-wide ribosome footprints along transcripts at codon resolution, the regulatory codes of elongation dynamics are still not fully understood. Most of the existing computational approaches for modeling translation elongation from ribosome profiling data mainly focus on local contextual patterns, while ignoring the continuity of the elongation process and relations between ribosome densities of remote codons. Modeling the translation elongation process in full-length coding sequence (CDS) level has not been studied to the best of our knowledge. In this paper, we developed a deep learning based approach with a multi-input and multi-output framework, named RiboMIMO, for modeling the ribosome density distributions of full-length mRNA CDS regions. Through considering the underlying correlations in translation efficiency among neighboring and remote codons and extracting hidden features from the input full-length coding sequence, RiboMIMO can greatly outperform the state-of-the-art baseline approaches and accurately predict the ribosome density distributions along the whole mRNA CDS regions. In addition, RiboMIMO explores the contributions of individual input codons to the predictions of output ribosome densities, which thus can help reveal important biological factors influencing the translation elongation process. The analyses, based on our interpretable metric named codon impact score, not only identified several patterns consistent with the previously-published literatures, but also for the first time (to the best of our knowledge) revealed that the codons located at a long distance from the ribosomal A site may also have an association on the translation elongation rate. This finding of long-range impact on translation elongation velocity may shed new light on the regulatory mechanisms of protein synthesis. Overall, these results indicated that RiboMIMO can provide a useful tool for studying the regulation of translation elongation in the range of full-length CDS. Author summary Translation elongation is a process in which amino acids are linked into proteins by ribosomes in cells. Translation elongation rates along the mRNAs are not constant, and are regulated by a series of mechanisms, such as codon rarity and mRNA stability. In this study, we modeled the translation elongation process at a full-length coding sequence level and developed a deep learning based approach to predict the translation elongation rates from mRNA sequences, through extracting the regulatory codes of elongation rates from the contextual sequences. The analyses, based on our interpretable metric named codon impact score, for the first time (to the best of our knowledge), revealed that in addition to the neighboring codons of the ribosomal A sites, the remote codons may also have an important impact on the translation elongation rates. This new finding may stimulate additional experiments and shed light on the regulatory mechanisms of protein synthesis.</description><identifier>ISSN: 1553-734X</identifier><identifier>ISSN: 1553-7358</identifier><identifier>EISSN: 1553-7358</identifier><identifier>DOI: 10.1371/journal.pcbi.1008842</identifier><identifier>PMID: 33770074</identifier><language>eng</language><publisher>SAN FRANCISCO: Public Library Science</publisher><subject>Biochemical Research Methods ; Biochemistry &amp; Molecular Biology ; Biology and Life Sciences ; Codons ; Computational biology ; Computer and Information Sciences ; Datasets ; Deep learning ; Density ; Elongation ; Forecasts and trends ; Gene sequencing ; Genomes ; Life Sciences &amp; Biomedicine ; Mathematical &amp; Computational Biology ; Measurement ; Mechanical properties ; Methods ; Model testing ; Neural networks ; Physical Sciences ; Proteins ; Research and Analysis Methods ; Ribosomes ; RNA sequencing ; Science &amp; Technology ; Transcription ; Translation elongation</subject><ispartof>PLoS computational biology, 2021-03, Vol.17 (3), p.e1008842-e1008842, Article 1008842</ispartof><rights>COPYRIGHT 2021 Public Library of Science</rights><rights>2021 Tian et al. This is an open access article distributed under the terms of the Creative Commons Attribution License: http://creativecommons.org/licenses/by/4.0/ (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original author and source are credited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License.</rights><rights>2021 Tian et al 2021 Tian et al</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>true</woscitedreferencessubscribed><woscitedreferencescount>4</woscitedreferencescount><woscitedreferencesoriginalsourcerecordid>wos000634793700003</woscitedreferencesoriginalsourcerecordid><citedby>FETCH-LOGICAL-c633t-41685cee57c1cf6007cd39e8ffaca04269290ae8810a50efbc1d46ba9f8a39a73</citedby><cites>FETCH-LOGICAL-c633t-41685cee57c1cf6007cd39e8ffaca04269290ae8810a50efbc1d46ba9f8a39a73</cites><orcidid>0000-0003-4899-0632 ; 0000-0003-0195-6031</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8026034/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8026034/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,728,781,785,865,886,2103,2115,2929,23871,27929,27930,39263,53796,53798</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/33770074$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><contributor>Roy, Sushmita</contributor><creatorcontrib>Tian, Tingzhong</creatorcontrib><creatorcontrib>Li, Shuya</creatorcontrib><creatorcontrib>Lang, Peng</creatorcontrib><creatorcontrib>Zhao, Dan</creatorcontrib><creatorcontrib>Zeng, Jianyang</creatorcontrib><title>Full-length ribosome density prediction by a multi-input and multi-output model</title><title>PLoS computational biology</title><addtitle>PLOS COMPUT BIOL</addtitle><addtitle>PLoS Comput Biol</addtitle><description>Translation elongation is regulated by a series of complicated mechanisms in both prokaryotes and eukaryotes. Although recent advance in ribosome profiling techniques has enabled one to capture the genome-wide ribosome footprints along transcripts at codon resolution, the regulatory codes of elongation dynamics are still not fully understood. Most of the existing computational approaches for modeling translation elongation from ribosome profiling data mainly focus on local contextual patterns, while ignoring the continuity of the elongation process and relations between ribosome densities of remote codons. Modeling the translation elongation process in full-length coding sequence (CDS) level has not been studied to the best of our knowledge. In this paper, we developed a deep learning based approach with a multi-input and multi-output framework, named RiboMIMO, for modeling the ribosome density distributions of full-length mRNA CDS regions. Through considering the underlying correlations in translation efficiency among neighboring and remote codons and extracting hidden features from the input full-length coding sequence, RiboMIMO can greatly outperform the state-of-the-art baseline approaches and accurately predict the ribosome density distributions along the whole mRNA CDS regions. In addition, RiboMIMO explores the contributions of individual input codons to the predictions of output ribosome densities, which thus can help reveal important biological factors influencing the translation elongation process. The analyses, based on our interpretable metric named codon impact score, not only identified several patterns consistent with the previously-published literatures, but also for the first time (to the best of our knowledge) revealed that the codons located at a long distance from the ribosomal A site may also have an association on the translation elongation rate. This finding of long-range impact on translation elongation velocity may shed new light on the regulatory mechanisms of protein synthesis. Overall, these results indicated that RiboMIMO can provide a useful tool for studying the regulation of translation elongation in the range of full-length CDS. Author summary Translation elongation is a process in which amino acids are linked into proteins by ribosomes in cells. Translation elongation rates along the mRNAs are not constant, and are regulated by a series of mechanisms, such as codon rarity and mRNA stability. In this study, we modeled the translation elongation process at a full-length coding sequence level and developed a deep learning based approach to predict the translation elongation rates from mRNA sequences, through extracting the regulatory codes of elongation rates from the contextual sequences. The analyses, based on our interpretable metric named codon impact score, for the first time (to the best of our knowledge), revealed that in addition to the neighboring codons of the ribosomal A sites, the remote codons may also have an important impact on the translation elongation rates. This new finding may stimulate additional experiments and shed light on the regulatory mechanisms of protein synthesis.</description><subject>Biochemical Research Methods</subject><subject>Biochemistry &amp; Molecular Biology</subject><subject>Biology and Life Sciences</subject><subject>Codons</subject><subject>Computational biology</subject><subject>Computer and Information Sciences</subject><subject>Datasets</subject><subject>Deep learning</subject><subject>Density</subject><subject>Elongation</subject><subject>Forecasts and trends</subject><subject>Gene sequencing</subject><subject>Genomes</subject><subject>Life Sciences &amp; Biomedicine</subject><subject>Mathematical &amp; Computational Biology</subject><subject>Measurement</subject><subject>Mechanical properties</subject><subject>Methods</subject><subject>Model testing</subject><subject>Neural networks</subject><subject>Physical Sciences</subject><subject>Proteins</subject><subject>Research and Analysis Methods</subject><subject>Ribosomes</subject><subject>RNA sequencing</subject><subject>Science &amp; Technology</subject><subject>Transcription</subject><subject>Translation elongation</subject><issn>1553-734X</issn><issn>1553-7358</issn><issn>1553-7358</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>HGBXW</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><sourceid>DOA</sourceid><recordid>eNqVkl2L1DAUhoso7jr6D0QL3ijSMWnaNLkRlsHVgcUFP8C7kCansxnSZGxSdf69qdMdHPFCaaDJyXNOTt68WfYYoyUmDX619ePgpF3uVGuWGCHGqvJOdo7rmhQNqdnd47z6cpY9CGGLUApzej87I6RpEGqq8-z6crS2sOA28SYfTOuD7yHX4IKJ-3w3gDYqGu_ydp_LvB9tNIVxuzHm0ul57cc4BXqvwT7M7nXSBng0_xfZ58s3n1bviqvrt-vVxVWhKCGxqDBltQKoG4VVR1MvShMOrOukkqgqKS85ksAYRrJG0LUK64q2kndMEi4bssieHururA9i1iKIssaEMpK0SMT6QGgvt2I3mF4Oe-GlEb8CftgIOUSjLAjOFcIIa1ZV09C85VISzbAilJZcplqv59PGtgetwMVB2pOipzvO3IiN_yYYKikiUzPP5wKD_zpCiKI3QYG10oEfp74RLZu6TE-0yJ79gf79dssDtZHpAsZ1Pp2r0qehN8o76EyKX9CaNbjkDU0JL04SEhPhR9zIMQSx_vjhP9j3p2x1YNXgQxigO6qCkZiMetu-mIwqZqOmtCe_K3pMunVmAl4egO_Q-i4oA07BEUMIUVI1nCQ4aZZo9u_0ykQ5eXrlRxfJT6nOBZA</recordid><startdate>20210301</startdate><enddate>20210301</enddate><creator>Tian, Tingzhong</creator><creator>Li, Shuya</creator><creator>Lang, Peng</creator><creator>Zhao, Dan</creator><creator>Zeng, Jianyang</creator><general>Public Library Science</general><general>Public Library of Science</general><general>Public Library of Science (PLoS)</general><scope>BLEPL</scope><scope>DTL</scope><scope>HGBXW</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>ISN</scope><scope>ISR</scope><scope>3V.</scope><scope>7QO</scope><scope>7QP</scope><scope>7TK</scope><scope>7TM</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AL</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>BHPHI</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>K9.</scope><scope>LK8</scope><scope>M0N</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P5Z</scope><scope>P62</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>Q9U</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope><scope>DOA</scope><orcidid>https://orcid.org/0000-0003-4899-0632</orcidid><orcidid>https://orcid.org/0000-0003-0195-6031</orcidid></search><sort><creationdate>20210301</creationdate><title>Full-length ribosome density prediction by a multi-input and multi-output model</title><author>Tian, Tingzhong ; Li, Shuya ; Lang, Peng ; Zhao, Dan ; Zeng, Jianyang</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c633t-41685cee57c1cf6007cd39e8ffaca04269290ae8810a50efbc1d46ba9f8a39a73</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Biochemical Research Methods</topic><topic>Biochemistry &amp; Molecular Biology</topic><topic>Biology and Life Sciences</topic><topic>Codons</topic><topic>Computational biology</topic><topic>Computer and Information Sciences</topic><topic>Datasets</topic><topic>Deep learning</topic><topic>Density</topic><topic>Elongation</topic><topic>Forecasts and trends</topic><topic>Gene sequencing</topic><topic>Genomes</topic><topic>Life Sciences &amp; Biomedicine</topic><topic>Mathematical &amp; Computational Biology</topic><topic>Measurement</topic><topic>Mechanical properties</topic><topic>Methods</topic><topic>Model testing</topic><topic>Neural networks</topic><topic>Physical Sciences</topic><topic>Proteins</topic><topic>Research and Analysis Methods</topic><topic>Ribosomes</topic><topic>RNA sequencing</topic><topic>Science &amp; Technology</topic><topic>Transcription</topic><topic>Translation elongation</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Tian, Tingzhong</creatorcontrib><creatorcontrib>Li, Shuya</creatorcontrib><creatorcontrib>Lang, Peng</creatorcontrib><creatorcontrib>Zhao, Dan</creatorcontrib><creatorcontrib>Zeng, Jianyang</creatorcontrib><collection>Web of Science Core Collection</collection><collection>Science Citation Index Expanded</collection><collection>Web of Science - Science Citation Index Expanded - 2021</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>Gale In Context: Canada</collection><collection>Gale In Context: Science</collection><collection>ProQuest Central (Corporate)</collection><collection>Biotechnology Research Abstracts</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Neurosciences Abstracts</collection><collection>Nucleic Acids Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>Computing Database (Alumni Edition)</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>Natural Science Collection (ProQuest)</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Computing Database</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>ProQuest Central Basic</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><collection>DOAJ Directory of Open Access Journals</collection><jtitle>PLoS computational biology</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Tian, Tingzhong</au><au>Li, Shuya</au><au>Lang, Peng</au><au>Zhao, Dan</au><au>Zeng, Jianyang</au><au>Roy, Sushmita</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Full-length ribosome density prediction by a multi-input and multi-output model</atitle><jtitle>PLoS computational biology</jtitle><stitle>PLOS COMPUT BIOL</stitle><addtitle>PLoS Comput Biol</addtitle><date>2021-03-01</date><risdate>2021</risdate><volume>17</volume><issue>3</issue><spage>e1008842</spage><epage>e1008842</epage><pages>e1008842-e1008842</pages><artnum>1008842</artnum><issn>1553-734X</issn><issn>1553-7358</issn><eissn>1553-7358</eissn><abstract>Translation elongation is regulated by a series of complicated mechanisms in both prokaryotes and eukaryotes. Although recent advance in ribosome profiling techniques has enabled one to capture the genome-wide ribosome footprints along transcripts at codon resolution, the regulatory codes of elongation dynamics are still not fully understood. Most of the existing computational approaches for modeling translation elongation from ribosome profiling data mainly focus on local contextual patterns, while ignoring the continuity of the elongation process and relations between ribosome densities of remote codons. Modeling the translation elongation process in full-length coding sequence (CDS) level has not been studied to the best of our knowledge. In this paper, we developed a deep learning based approach with a multi-input and multi-output framework, named RiboMIMO, for modeling the ribosome density distributions of full-length mRNA CDS regions. Through considering the underlying correlations in translation efficiency among neighboring and remote codons and extracting hidden features from the input full-length coding sequence, RiboMIMO can greatly outperform the state-of-the-art baseline approaches and accurately predict the ribosome density distributions along the whole mRNA CDS regions. In addition, RiboMIMO explores the contributions of individual input codons to the predictions of output ribosome densities, which thus can help reveal important biological factors influencing the translation elongation process. The analyses, based on our interpretable metric named codon impact score, not only identified several patterns consistent with the previously-published literatures, but also for the first time (to the best of our knowledge) revealed that the codons located at a long distance from the ribosomal A site may also have an association on the translation elongation rate. This finding of long-range impact on translation elongation velocity may shed new light on the regulatory mechanisms of protein synthesis. Overall, these results indicated that RiboMIMO can provide a useful tool for studying the regulation of translation elongation in the range of full-length CDS. Author summary Translation elongation is a process in which amino acids are linked into proteins by ribosomes in cells. Translation elongation rates along the mRNAs are not constant, and are regulated by a series of mechanisms, such as codon rarity and mRNA stability. In this study, we modeled the translation elongation process at a full-length coding sequence level and developed a deep learning based approach to predict the translation elongation rates from mRNA sequences, through extracting the regulatory codes of elongation rates from the contextual sequences. The analyses, based on our interpretable metric named codon impact score, for the first time (to the best of our knowledge), revealed that in addition to the neighboring codons of the ribosomal A sites, the remote codons may also have an important impact on the translation elongation rates. This new finding may stimulate additional experiments and shed light on the regulatory mechanisms of protein synthesis.</abstract><cop>SAN FRANCISCO</cop><pub>Public Library Science</pub><pmid>33770074</pmid><doi>10.1371/journal.pcbi.1008842</doi><tpages>25</tpages><orcidid>https://orcid.org/0000-0003-4899-0632</orcidid><orcidid>https://orcid.org/0000-0003-0195-6031</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1553-734X
ispartof PLoS computational biology, 2021-03, Vol.17 (3), p.e1008842-e1008842, Article 1008842
issn 1553-734X
1553-7358
1553-7358
language eng
recordid cdi_proquest_miscellaneous_2506275200
source DOAJ Directory of Open Access Journals; Public Library of Science (PLoS) Journals Open Access; Web of Science - Science Citation Index Expanded - 2021<img src="https://exlibris-pub.s3.amazonaws.com/fromwos-v2.jpg" />; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Biochemical Research Methods
Biochemistry & Molecular Biology
Biology and Life Sciences
Codons
Computational biology
Computer and Information Sciences
Datasets
Deep learning
Density
Elongation
Forecasts and trends
Gene sequencing
Genomes
Life Sciences & Biomedicine
Mathematical & Computational Biology
Measurement
Mechanical properties
Methods
Model testing
Neural networks
Physical Sciences
Proteins
Research and Analysis Methods
Ribosomes
RNA sequencing
Science & Technology
Transcription
Translation elongation
title Full-length ribosome density prediction by a multi-input and multi-output model
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-11T14%3A26%3A07IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-gale_proqu&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Full-length%20ribosome%20density%20prediction%20by%20a%20multi-input%20and%20multi-output%20model&rft.jtitle=PLoS%20computational%20biology&rft.au=Tian,%20Tingzhong&rft.date=2021-03-01&rft.volume=17&rft.issue=3&rft.spage=e1008842&rft.epage=e1008842&rft.pages=e1008842-e1008842&rft.artnum=1008842&rft.issn=1553-734X&rft.eissn=1553-7358&rft_id=info:doi/10.1371/journal.pcbi.1008842&rft_dat=%3Cgale_proqu%3EA658712976%3C/gale_proqu%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2513683884&rft_id=info:pmid/33770074&rft_galeid=A658712976&rft_doaj_id=oai_doaj_org_article_99c0101d8448448d9b9aa3d81c36629a&rfr_iscdi=true