RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome

Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:BMC genomics 2013-03, Vol.14 (1), p.204-204
Hauptverfasser: Wenger, Yvan, Galliot, Brigitte
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 204
container_issue 1
container_start_page 204
container_title BMC genomics
container_volume 14
creator Wenger, Yvan
Galliot, Brigitte
description Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.
doi_str_mv 10.1186/1471-2164-14-204
format Article
fullrecord <record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3764976</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1352285724</sourcerecordid><originalsourceid>FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</originalsourceid><addsrcrecordid>eNqNkk2LFDEQhoMo7ofePUnAi5fWfHa6PQjL4LoLi4LoOeSrxyzppDfpHth_4M8246zDjCgIgRRVbz28VBUALzB6g3HXvsVM4IbgljWYNQSxR-B0n3p8EJ-As1JuEcKiI_wpOCGUU9QJfAp-fPl0Udwd3LhclgLXLqbRNVN21pvZWThnFYvJfpprvryDCgaV1w5OaVqCmn2KMA0wpo0LB9oCvXVx9oOvCB-hivA6hGX0UTWMM3h1b7M6Zj8DTwYVinv-8J-Db5cfvq6umpvPH69XFzeN5ljMjel61VJkEKaWacKN6zqjB91TIzjpTH2D5s5qJgbOiHYao7ZnSiBb-7Wl5-D9jjstenTWVJtZBTllP6p8L5Py8rgS_Xe5ThtJRct60VbAagfQPv0DcFwxaZTbTcjtJmok66Iq5fWDjZzuFldmOfpiXAgqurQUiSknpOOC_I-UtIK0vN96e_WH9DYtOdZ5_lIRxntGqwrtVCanUrIb9uYxktu7-pvdl4dT2zf8PiT6E5YLy5Q</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1326245943</pqid></control><display><type>article</type><title>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>SpringerNature Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Wenger, Yvan ; Galliot, Brigitte</creator><creatorcontrib>Wenger, Yvan ; Galliot, Brigitte</creatorcontrib><description>Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.</description><identifier>ISSN: 1471-2164</identifier><identifier>EISSN: 1471-2164</identifier><identifier>DOI: 10.1186/1471-2164-14-204</identifier><identifier>PMID: 23530871</identifier><language>eng</language><publisher>England: BioMed Central</publisher><subject>Animals ; Comparative analysis ; Comparative Genomic Hybridization ; Genetics ; Genome ; Genomes ; Genomics ; Hydra ; Hydra - classification ; Hydra - genetics ; Hydra vulgaris ; Life sciences ; Open Reading Frames ; Phylogenetics ; Phylogeny ; Sequence Analysis, RNA ; Transcriptome ; Trees</subject><ispartof>BMC genomics, 2013-03, Vol.14 (1), p.204-204</ispartof><rights>2013 Wenger and Galliot; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</rights><rights>Copyright © 2013 Wenger and Galliot; licensee BioMed Central Ltd. 2013 Wenger and Galliot; licensee BioMed Central Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</citedby><cites>FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3764976/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3764976/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,729,782,786,866,887,27931,27932,53798,53800</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/23530871$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wenger, Yvan</creatorcontrib><creatorcontrib>Galliot, Brigitte</creatorcontrib><title>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</title><title>BMC genomics</title><addtitle>BMC Genomics</addtitle><description>Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.</description><subject>Animals</subject><subject>Comparative analysis</subject><subject>Comparative Genomic Hybridization</subject><subject>Genetics</subject><subject>Genome</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Hydra</subject><subject>Hydra - classification</subject><subject>Hydra - genetics</subject><subject>Hydra vulgaris</subject><subject>Life sciences</subject><subject>Open Reading Frames</subject><subject>Phylogenetics</subject><subject>Phylogeny</subject><subject>Sequence Analysis, RNA</subject><subject>Transcriptome</subject><subject>Trees</subject><issn>1471-2164</issn><issn>1471-2164</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNqNkk2LFDEQhoMo7ofePUnAi5fWfHa6PQjL4LoLi4LoOeSrxyzppDfpHth_4M8246zDjCgIgRRVbz28VBUALzB6g3HXvsVM4IbgljWYNQSxR-B0n3p8EJ-As1JuEcKiI_wpOCGUU9QJfAp-fPl0Udwd3LhclgLXLqbRNVN21pvZWThnFYvJfpprvryDCgaV1w5OaVqCmn2KMA0wpo0LB9oCvXVx9oOvCB-hivA6hGX0UTWMM3h1b7M6Zj8DTwYVinv-8J-Db5cfvq6umpvPH69XFzeN5ljMjel61VJkEKaWacKN6zqjB91TIzjpTH2D5s5qJgbOiHYao7ZnSiBb-7Wl5-D9jjstenTWVJtZBTllP6p8L5Py8rgS_Xe5ThtJRct60VbAagfQPv0DcFwxaZTbTcjtJmok66Iq5fWDjZzuFldmOfpiXAgqurQUiSknpOOC_I-UtIK0vN96e_WH9DYtOdZ5_lIRxntGqwrtVCanUrIb9uYxktu7-pvdl4dT2zf8PiT6E5YLy5Q</recordid><startdate>20130325</startdate><enddate>20130325</enddate><creator>Wenger, Yvan</creator><creator>Galliot, Brigitte</creator><general>BioMed Central</general><general>BioMed Central Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QP</scope><scope>7QR</scope><scope>7SS</scope><scope>7TK</scope><scope>7U7</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20130325</creationdate><title>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</title><author>Wenger, Yvan ; Galliot, Brigitte</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Animals</topic><topic>Comparative analysis</topic><topic>Comparative Genomic Hybridization</topic><topic>Genetics</topic><topic>Genome</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Hydra</topic><topic>Hydra - classification</topic><topic>Hydra - genetics</topic><topic>Hydra vulgaris</topic><topic>Life sciences</topic><topic>Open Reading Frames</topic><topic>Phylogenetics</topic><topic>Phylogeny</topic><topic>Sequence Analysis, RNA</topic><topic>Transcriptome</topic><topic>Trees</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wenger, Yvan</creatorcontrib><creatorcontrib>Galliot, Brigitte</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Calcium &amp; Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Health &amp; Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health &amp; Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health &amp; Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BMC genomics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wenger, Yvan</au><au>Galliot, Brigitte</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</atitle><jtitle>BMC genomics</jtitle><addtitle>BMC Genomics</addtitle><date>2013-03-25</date><risdate>2013</risdate><volume>14</volume><issue>1</issue><spage>204</spage><epage>204</epage><pages>204-204</pages><issn>1471-2164</issn><eissn>1471-2164</eissn><abstract>Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.</abstract><cop>England</cop><pub>BioMed Central</pub><pmid>23530871</pmid><doi>10.1186/1471-2164-14-204</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1471-2164
ispartof BMC genomics, 2013-03, Vol.14 (1), p.204-204
issn 1471-2164
1471-2164
language eng
recordid cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3764976
source MEDLINE; DOAJ Directory of Open Access Journals; SpringerNature Journals; PubMed Central Open Access; Springer Nature OA Free Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects Animals
Comparative analysis
Comparative Genomic Hybridization
Genetics
Genome
Genomes
Genomics
Hydra
Hydra - classification
Hydra - genetics
Hydra vulgaris
Life sciences
Open Reading Frames
Phylogenetics
Phylogeny
Sequence Analysis, RNA
Transcriptome
Trees
title RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-03T23%3A21%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RNAseq%20versus%20genome-predicted%20transcriptomes:%20a%20large%20population%20of%20novel%20transcripts%20identified%20in%20an%20Illumina-454%20Hydra%20transcriptome&rft.jtitle=BMC%20genomics&rft.au=Wenger,%20Yvan&rft.date=2013-03-25&rft.volume=14&rft.issue=1&rft.spage=204&rft.epage=204&rft.pages=204-204&rft.issn=1471-2164&rft.eissn=1471-2164&rft_id=info:doi/10.1186/1471-2164-14-204&rft_dat=%3Cproquest_pubme%3E1352285724%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1326245943&rft_id=info:pmid/23530871&rfr_iscdi=true