A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants

Pervasive transcription of eukaryotic genomes results in expression of long non-coding RNAs (lncRNAs) most of which are poorly conserved in evolution and appear to be non-functional. However, some lncRNAs have been shown to perform specific functions, in particular, transcription regulation. Thousan...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nucleic acids research 2021-10, Vol.49 (18), p.10328-10346
Hauptverfasser: Fesenko, Igor, Shabalina, Svetlana A, Mamaeva, Anna, Knyazev, Andrey, Glushkevich, Anna, Lyapina, Irina, Ziganshin, Rustam, Kovalchuk, Sergey, Kharlampieva, Daria, Lazarev, Vassili, Taliansky, Michael, Koonin, Eugene V
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 10346
container_issue 18
container_start_page 10328
container_title Nucleic acids research
container_volume 49
creator Fesenko, Igor
Shabalina, Svetlana A
Mamaeva, Anna
Knyazev, Andrey
Glushkevich, Anna
Lyapina, Irina
Ziganshin, Rustam
Kovalchuk, Sergey
Kharlampieva, Daria
Lazarev, Vassili
Taliansky, Michael
Koonin, Eugene V
description Pervasive transcription of eukaryotic genomes results in expression of long non-coding RNAs (lncRNAs) most of which are poorly conserved in evolution and appear to be non-functional. However, some lncRNAs have been shown to perform specific functions, in particular, transcription regulation. Thousands of small open reading frames (smORFs,
doi_str_mv 10.1093/nar/gkab816
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8501992</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2576915583</sourcerecordid><originalsourceid>FETCH-LOGICAL-c381t-4029e8e97aa81f52ce6bed7d3db7db5dc86607c5e6ac0f962b685c32992f1d3b3</originalsourceid><addsrcrecordid>eNpVUU1LxDAQDaK468fJu-QoSDUfTZpehEX8AlEQxWNIk-ka7Sa16Qr-eyO7ip5mmHm8mfceQgeUnFBS89NghtP5m2kUlRtoSrlkRVlLtommhBNRUFKqCdpJ6ZUQWlJRbqMJL0VFGGdT9DzDHyaNuI-xw7HFnQ9g5lCkHqxvvcULb4fYD3EEHxKGYKMDh5tP3MUwxyGGIk98bh_uZgn7gPvOhDHtoa3WdAn213UXPV1ePJ5fF7f3Vzfns9vCckXHoiSsBgV1ZYyirWAWZAOuctw1lWuEs0pKUlkB0ljSZlWNVMJyVtespY43fBedrXj7ZbMAZyGMg-l0P_iFGT51NF7_3wT_oufxQytBaGbJBEdrgiG-LyGNeuGThS6rgLhMmolK1lQIxTP0eAXNjqQ0QPt7hhL9HYXOUeh1FBl9-PezX-yP9_wL66WHwA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2576915583</pqid></control><display><type>article</type><title>A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants</title><source>Oxford Journals Open Access Collection</source><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>PubMed Central</source><source>Free Full-Text Journals in Chemistry</source><creator>Fesenko, Igor ; Shabalina, Svetlana A ; Mamaeva, Anna ; Knyazev, Andrey ; Glushkevich, Anna ; Lyapina, Irina ; Ziganshin, Rustam ; Kovalchuk, Sergey ; Kharlampieva, Daria ; Lazarev, Vassili ; Taliansky, Michael ; Koonin, Eugene V</creator><creatorcontrib>Fesenko, Igor ; Shabalina, Svetlana A ; Mamaeva, Anna ; Knyazev, Andrey ; Glushkevich, Anna ; Lyapina, Irina ; Ziganshin, Rustam ; Kovalchuk, Sergey ; Kharlampieva, Daria ; Lazarev, Vassili ; Taliansky, Michael ; Koonin, Eugene V</creatorcontrib><description>Pervasive transcription of eukaryotic genomes results in expression of long non-coding RNAs (lncRNAs) most of which are poorly conserved in evolution and appear to be non-functional. However, some lncRNAs have been shown to perform specific functions, in particular, transcription regulation. Thousands of small open reading frames (smORFs, &lt;100 codons) located on lncRNAs potentially might be translated into peptides or microproteins. We report a comprehensive analysis of the conservation and evolutionary trajectories of lncRNAs-smORFs from the moss Physcomitrium patens across transcriptomes of 479 plant species. Although thousands of smORFs are subject to substantial purifying selection, the majority of the smORFs appear to be evolutionary young and could represent a major pool for functional innovation. Using nanopore RNA sequencing, we show that, on average, the transcriptional level of conserved smORFs is higher than that of non-conserved smORFs. Proteomic analysis confirmed translation of 82 novel species-specific smORFs. Numerous conserved smORFs containing low complexity regions (LCRs) or transmembrane domains were identified, the biological functions of a selected LCR-smORF were demonstrated experimentally. Thus, microproteins encoded by smORFs are a major, functionally diverse component of the plant proteome.</description><identifier>ISSN: 0305-1048</identifier><identifier>EISSN: 1362-4962</identifier><identifier>DOI: 10.1093/nar/gkab816</identifier><identifier>PMID: 34570232</identifier><language>eng</language><publisher>England: Oxford University Press</publisher><subject>Bryopsida - genetics ; Computational Biology ; Open Reading Frames ; Proteome ; RNA, Long Noncoding ; Transcriptome</subject><ispartof>Nucleic acids research, 2021-10, Vol.49 (18), p.10328-10346</ispartof><rights>Published by Oxford University Press on behalf of Nucleic Acids Research 2021.</rights><rights>Published by Oxford University Press on behalf of Nucleic Acids Research 2021. 2021</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c381t-4029e8e97aa81f52ce6bed7d3db7db5dc86607c5e6ac0f962b685c32992f1d3b3</citedby><cites>FETCH-LOGICAL-c381t-4029e8e97aa81f52ce6bed7d3db7db5dc86607c5e6ac0f962b685c32992f1d3b3</cites><orcidid>0000-0002-5757-8271 ; 0000-0003-3943-8299 ; 0000-0002-3774-819X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8501992/pdf/$$EPDF$$P50$$Gpubmedcentral$$H</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC8501992/$$EHTML$$P50$$Gpubmedcentral$$H</linktohtml><link.rule.ids>230,314,723,776,780,860,881,27901,27902,53766,53768</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/34570232$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Fesenko, Igor</creatorcontrib><creatorcontrib>Shabalina, Svetlana A</creatorcontrib><creatorcontrib>Mamaeva, Anna</creatorcontrib><creatorcontrib>Knyazev, Andrey</creatorcontrib><creatorcontrib>Glushkevich, Anna</creatorcontrib><creatorcontrib>Lyapina, Irina</creatorcontrib><creatorcontrib>Ziganshin, Rustam</creatorcontrib><creatorcontrib>Kovalchuk, Sergey</creatorcontrib><creatorcontrib>Kharlampieva, Daria</creatorcontrib><creatorcontrib>Lazarev, Vassili</creatorcontrib><creatorcontrib>Taliansky, Michael</creatorcontrib><creatorcontrib>Koonin, Eugene V</creatorcontrib><title>A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants</title><title>Nucleic acids research</title><addtitle>Nucleic Acids Res</addtitle><description>Pervasive transcription of eukaryotic genomes results in expression of long non-coding RNAs (lncRNAs) most of which are poorly conserved in evolution and appear to be non-functional. However, some lncRNAs have been shown to perform specific functions, in particular, transcription regulation. Thousands of small open reading frames (smORFs, &lt;100 codons) located on lncRNAs potentially might be translated into peptides or microproteins. We report a comprehensive analysis of the conservation and evolutionary trajectories of lncRNAs-smORFs from the moss Physcomitrium patens across transcriptomes of 479 plant species. Although thousands of smORFs are subject to substantial purifying selection, the majority of the smORFs appear to be evolutionary young and could represent a major pool for functional innovation. Using nanopore RNA sequencing, we show that, on average, the transcriptional level of conserved smORFs is higher than that of non-conserved smORFs. Proteomic analysis confirmed translation of 82 novel species-specific smORFs. Numerous conserved smORFs containing low complexity regions (LCRs) or transmembrane domains were identified, the biological functions of a selected LCR-smORF were demonstrated experimentally. Thus, microproteins encoded by smORFs are a major, functionally diverse component of the plant proteome.</description><subject>Bryopsida - genetics</subject><subject>Computational Biology</subject><subject>Open Reading Frames</subject><subject>Proteome</subject><subject>RNA, Long Noncoding</subject><subject>Transcriptome</subject><issn>0305-1048</issn><issn>1362-4962</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2021</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><recordid>eNpVUU1LxDAQDaK468fJu-QoSDUfTZpehEX8AlEQxWNIk-ka7Sa16Qr-eyO7ip5mmHm8mfceQgeUnFBS89NghtP5m2kUlRtoSrlkRVlLtommhBNRUFKqCdpJ6ZUQWlJRbqMJL0VFGGdT9DzDHyaNuI-xw7HFnQ9g5lCkHqxvvcULb4fYD3EEHxKGYKMDh5tP3MUwxyGGIk98bh_uZgn7gPvOhDHtoa3WdAn213UXPV1ePJ5fF7f3Vzfns9vCckXHoiSsBgV1ZYyirWAWZAOuctw1lWuEs0pKUlkB0ljSZlWNVMJyVtespY43fBedrXj7ZbMAZyGMg-l0P_iFGT51NF7_3wT_oufxQytBaGbJBEdrgiG-LyGNeuGThS6rgLhMmolK1lQIxTP0eAXNjqQ0QPt7hhL9HYXOUeh1FBl9-PezX-yP9_wL66WHwA</recordid><startdate>20211011</startdate><enddate>20211011</enddate><creator>Fesenko, Igor</creator><creator>Shabalina, Svetlana A</creator><creator>Mamaeva, Anna</creator><creator>Knyazev, Andrey</creator><creator>Glushkevich, Anna</creator><creator>Lyapina, Irina</creator><creator>Ziganshin, Rustam</creator><creator>Kovalchuk, Sergey</creator><creator>Kharlampieva, Daria</creator><creator>Lazarev, Vassili</creator><creator>Taliansky, Michael</creator><creator>Koonin, Eugene V</creator><general>Oxford University Press</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><scope>5PM</scope><orcidid>https://orcid.org/0000-0002-5757-8271</orcidid><orcidid>https://orcid.org/0000-0003-3943-8299</orcidid><orcidid>https://orcid.org/0000-0002-3774-819X</orcidid></search><sort><creationdate>20211011</creationdate><title>A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants</title><author>Fesenko, Igor ; Shabalina, Svetlana A ; Mamaeva, Anna ; Knyazev, Andrey ; Glushkevich, Anna ; Lyapina, Irina ; Ziganshin, Rustam ; Kovalchuk, Sergey ; Kharlampieva, Daria ; Lazarev, Vassili ; Taliansky, Michael ; Koonin, Eugene V</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c381t-4029e8e97aa81f52ce6bed7d3db7db5dc86607c5e6ac0f962b685c32992f1d3b3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2021</creationdate><topic>Bryopsida - genetics</topic><topic>Computational Biology</topic><topic>Open Reading Frames</topic><topic>Proteome</topic><topic>RNA, Long Noncoding</topic><topic>Transcriptome</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Fesenko, Igor</creatorcontrib><creatorcontrib>Shabalina, Svetlana A</creatorcontrib><creatorcontrib>Mamaeva, Anna</creatorcontrib><creatorcontrib>Knyazev, Andrey</creatorcontrib><creatorcontrib>Glushkevich, Anna</creatorcontrib><creatorcontrib>Lyapina, Irina</creatorcontrib><creatorcontrib>Ziganshin, Rustam</creatorcontrib><creatorcontrib>Kovalchuk, Sergey</creatorcontrib><creatorcontrib>Kharlampieva, Daria</creatorcontrib><creatorcontrib>Lazarev, Vassili</creatorcontrib><creatorcontrib>Taliansky, Michael</creatorcontrib><creatorcontrib>Koonin, Eugene V</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>Nucleic acids research</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Fesenko, Igor</au><au>Shabalina, Svetlana A</au><au>Mamaeva, Anna</au><au>Knyazev, Andrey</au><au>Glushkevich, Anna</au><au>Lyapina, Irina</au><au>Ziganshin, Rustam</au><au>Kovalchuk, Sergey</au><au>Kharlampieva, Daria</au><au>Lazarev, Vassili</au><au>Taliansky, Michael</au><au>Koonin, Eugene V</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants</atitle><jtitle>Nucleic acids research</jtitle><addtitle>Nucleic Acids Res</addtitle><date>2021-10-11</date><risdate>2021</risdate><volume>49</volume><issue>18</issue><spage>10328</spage><epage>10346</epage><pages>10328-10346</pages><issn>0305-1048</issn><eissn>1362-4962</eissn><abstract>Pervasive transcription of eukaryotic genomes results in expression of long non-coding RNAs (lncRNAs) most of which are poorly conserved in evolution and appear to be non-functional. However, some lncRNAs have been shown to perform specific functions, in particular, transcription regulation. Thousands of small open reading frames (smORFs, &lt;100 codons) located on lncRNAs potentially might be translated into peptides or microproteins. We report a comprehensive analysis of the conservation and evolutionary trajectories of lncRNAs-smORFs from the moss Physcomitrium patens across transcriptomes of 479 plant species. Although thousands of smORFs are subject to substantial purifying selection, the majority of the smORFs appear to be evolutionary young and could represent a major pool for functional innovation. Using nanopore RNA sequencing, we show that, on average, the transcriptional level of conserved smORFs is higher than that of non-conserved smORFs. Proteomic analysis confirmed translation of 82 novel species-specific smORFs. Numerous conserved smORFs containing low complexity regions (LCRs) or transmembrane domains were identified, the biological functions of a selected LCR-smORF were demonstrated experimentally. Thus, microproteins encoded by smORFs are a major, functionally diverse component of the plant proteome.</abstract><cop>England</cop><pub>Oxford University Press</pub><pmid>34570232</pmid><doi>10.1093/nar/gkab816</doi><tpages>19</tpages><orcidid>https://orcid.org/0000-0002-5757-8271</orcidid><orcidid>https://orcid.org/0000-0003-3943-8299</orcidid><orcidid>https://orcid.org/0000-0002-3774-819X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 0305-1048
ispartof Nucleic acids research, 2021-10, Vol.49 (18), p.10328-10346
issn 0305-1048
1362-4962
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_8501992
source Oxford Journals Open Access Collection; MEDLINE; DOAJ Directory of Open Access Journals; PubMed Central; Free Full-Text Journals in Chemistry
subjects Bryopsida - genetics
Computational Biology
Open Reading Frames
Proteome
RNA, Long Noncoding
Transcriptome
title A vast pool of lineage-specific microproteins encoded by long non-coding RNAs in plants
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T01%3A04%3A01IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=A%20vast%20pool%20of%20lineage-specific%20microproteins%20encoded%20by%20long%20non-coding%20RNAs%20in%20plants&rft.jtitle=Nucleic%20acids%20research&rft.au=Fesenko,%20Igor&rft.date=2021-10-11&rft.volume=49&rft.issue=18&rft.spage=10328&rft.epage=10346&rft.pages=10328-10346&rft.issn=0305-1048&rft.eissn=1362-4962&rft_id=info:doi/10.1093/nar/gkab816&rft_dat=%3Cproquest_pubme%3E2576915583%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2576915583&rft_id=info:pmid/34570232&rfr_iscdi=true