RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome

Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	BMC genomics 2013-03, Vol.14 (1), p.204-204
Hauptverfasser:	Wenger, Yvan, Galliot, Brigitte
Format:	Artikel
Sprache:	eng
Schlagworte:	Animals Comparative analysis Comparative Genomic Hybridization Genetics Genome Genomes Genomics Hydra Hydra - classification Hydra - genetics Hydra vulgaris Life sciences Open Reading Frames Phylogenetics Phylogeny Sequence Analysis, RNA Transcriptome Trees
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	204
container_issue	1
container_start_page	204
container_title	BMC genomics
container_volume	14
creator	Wenger, Yvan Galliot, Brigitte
description	Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.
doi_str_mv	10.1186/1471-2164-14-204
format	Article
fullrecord	<record><control><sourceid>proquest_pubme</sourceid><recordid>TN_cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3764976</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>1352285724</sourcerecordid><originalsourceid>FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</originalsourceid><addsrcrecordid>eNqNkk2LFDEQhoMo7ofePUnAi5fWfHa6PQjL4LoLi4LoOeSrxyzppDfpHth_4M8246zDjCgIgRRVbz28VBUALzB6g3HXvsVM4IbgljWYNQSxR-B0n3p8EJ-As1JuEcKiI_wpOCGUU9QJfAp-fPl0Udwd3LhclgLXLqbRNVN21pvZWThnFYvJfpprvryDCgaV1w5OaVqCmn2KMA0wpo0LB9oCvXVx9oOvCB-hivA6hGX0UTWMM3h1b7M6Zj8DTwYVinv-8J-Db5cfvq6umpvPH69XFzeN5ljMjel61VJkEKaWacKN6zqjB91TIzjpTH2D5s5qJgbOiHYao7ZnSiBb-7Wl5-D9jjstenTWVJtZBTllP6p8L5Py8rgS_Xe5ThtJRct60VbAagfQPv0DcFwxaZTbTcjtJmok66Iq5fWDjZzuFldmOfpiXAgqurQUiSknpOOC_I-UtIK0vN96e_WH9DYtOdZ5_lIRxntGqwrtVCanUrIb9uYxktu7-pvdl4dT2zf8PiT6E5YLy5Q</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1326245943</pqid></control><display><type>article</type><title>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</title><source>MEDLINE</source><source>DOAJ Directory of Open Access Journals</source><source>SpringerNature Journals</source><source>PubMed Central Open Access</source><source>Springer Nature OA Free Journals</source><source>EZB-FREE-00999 freely available EZB journals</source><source>PubMed Central</source><creator>Wenger, Yvan ; Galliot, Brigitte</creator><creatorcontrib>Wenger, Yvan ; Galliot, Brigitte</creatorcontrib><description>Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.</description><identifier>ISSN: 1471-2164</identifier><identifier>EISSN: 1471-2164</identifier><identifier>DOI: 10.1186/1471-2164-14-204</identifier><identifier>PMID: 23530871</identifier><language>eng</language><publisher>England: BioMed Central</publisher><subject>Animals ; Comparative analysis ; Comparative Genomic Hybridization ; Genetics ; Genome ; Genomes ; Genomics ; Hydra ; Hydra - classification ; Hydra - genetics ; Hydra vulgaris ; Life sciences ; Open Reading Frames ; Phylogenetics ; Phylogeny ; Sequence Analysis, RNA ; Transcriptome ; Trees</subject><ispartof>BMC genomics, 2013-03, Vol.14 (1), p.204-204</ispartof><rights>2013 Wenger and Galliot; licensee BioMed Central Ltd. This is an Open Access article distributed under the terms of the Creative Commons Attribution License (http://creativecommons.org/licenses/by/2.0), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited.</rights><rights>Copyright © 2013 Wenger and Galliot; licensee BioMed Central Ltd. 2013 Wenger and Galliot; licensee BioMed Central Ltd.</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</citedby><cites>FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3764976/pdf/$$EPDF$$P50$$Gpubmedcentral$$Hfree_for_read</linktopdf><linktohtml>$$Uhttps://www.ncbi.nlm.nih.gov/pmc/articles/PMC3764976/$$EHTML$$P50$$Gpubmedcentral$$Hfree_for_read</linktohtml><link.rule.ids>230,315,729,782,786,866,887,27931,27932,53798,53800</link.rule.ids><backlink>$$Uhttps://www.ncbi.nlm.nih.gov/pubmed/23530871$$D View this record in MEDLINE/PubMed$$Hfree_for_read</backlink></links><search><creatorcontrib>Wenger, Yvan</creatorcontrib><creatorcontrib>Galliot, Brigitte</creatorcontrib><title>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</title><title>BMC genomics</title><addtitle>BMC Genomics</addtitle><description>Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.</description><subject>Animals</subject><subject>Comparative analysis</subject><subject>Comparative Genomic Hybridization</subject><subject>Genetics</subject><subject>Genome</subject><subject>Genomes</subject><subject>Genomics</subject><subject>Hydra</subject><subject>Hydra - classification</subject><subject>Hydra - genetics</subject><subject>Hydra vulgaris</subject><subject>Life sciences</subject><subject>Open Reading Frames</subject><subject>Phylogenetics</subject><subject>Phylogeny</subject><subject>Sequence Analysis, RNA</subject><subject>Transcriptome</subject><subject>Trees</subject><issn>1471-2164</issn><issn>1471-2164</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2013</creationdate><recordtype>article</recordtype><sourceid>EIF</sourceid><sourceid>ABUWG</sourceid><sourceid>AFKRA</sourceid><sourceid>AZQEC</sourceid><sourceid>BENPR</sourceid><sourceid>CCPQU</sourceid><sourceid>DWQXO</sourceid><sourceid>GNUQQ</sourceid><recordid>eNqNkk2LFDEQhoMo7ofePUnAi5fWfHa6PQjL4LoLi4LoOeSrxyzppDfpHth_4M8246zDjCgIgRRVbz28VBUALzB6g3HXvsVM4IbgljWYNQSxR-B0n3p8EJ-As1JuEcKiI_wpOCGUU9QJfAp-fPl0Udwd3LhclgLXLqbRNVN21pvZWThnFYvJfpprvryDCgaV1w5OaVqCmn2KMA0wpo0LB9oCvXVx9oOvCB-hivA6hGX0UTWMM3h1b7M6Zj8DTwYVinv-8J-Db5cfvq6umpvPH69XFzeN5ljMjel61VJkEKaWacKN6zqjB91TIzjpTH2D5s5qJgbOiHYao7ZnSiBb-7Wl5-D9jjstenTWVJtZBTllP6p8L5Py8rgS_Xe5ThtJRct60VbAagfQPv0DcFwxaZTbTcjtJmok66Iq5fWDjZzuFldmOfpiXAgqurQUiSknpOOC_I-UtIK0vN96e_WH9DYtOdZ5_lIRxntGqwrtVCanUrIb9uYxktu7-pvdl4dT2zf8PiT6E5YLy5Q</recordid><startdate>20130325</startdate><enddate>20130325</enddate><creator>Wenger, Yvan</creator><creator>Galliot, Brigitte</creator><general>BioMed Central</general><general>BioMed Central Ltd</general><scope>CGR</scope><scope>CUY</scope><scope>CVF</scope><scope>ECM</scope><scope>EIF</scope><scope>NPM</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>3V.</scope><scope>7QP</scope><scope>7QR</scope><scope>7SS</scope><scope>7TK</scope><scope>7U7</scope><scope>7X7</scope><scope>7XB</scope><scope>88E</scope><scope>8AO</scope><scope>8FD</scope><scope>8FE</scope><scope>8FH</scope><scope>8FI</scope><scope>8FJ</scope><scope>8FK</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>AZQEC</scope><scope>BBNVY</scope><scope>BENPR</scope><scope>BHPHI</scope><scope>C1K</scope><scope>CCPQU</scope><scope>DWQXO</scope><scope>FR3</scope><scope>FYUFA</scope><scope>GHDGH</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>K9.</scope><scope>LK8</scope><scope>M0S</scope><scope>M1P</scope><scope>M7P</scope><scope>P64</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>RC3</scope><scope>7X8</scope><scope>5PM</scope></search><sort><creationdate>20130325</creationdate><title>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</title><author>Wenger, Yvan ; Galliot, Brigitte</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-b517t-c89a630c013d4b25ce88cbfb93c7528c28cfb5edb47f542beb10694a70d517bd3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2013</creationdate><topic>Animals</topic><topic>Comparative analysis</topic><topic>Comparative Genomic Hybridization</topic><topic>Genetics</topic><topic>Genome</topic><topic>Genomes</topic><topic>Genomics</topic><topic>Hydra</topic><topic>Hydra - classification</topic><topic>Hydra - genetics</topic><topic>Hydra vulgaris</topic><topic>Life sciences</topic><topic>Open Reading Frames</topic><topic>Phylogenetics</topic><topic>Phylogeny</topic><topic>Sequence Analysis, RNA</topic><topic>Transcriptome</topic><topic>Trees</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Wenger, Yvan</creatorcontrib><creatorcontrib>Galliot, Brigitte</creatorcontrib><collection>Medline</collection><collection>MEDLINE</collection><collection>MEDLINE (Ovid)</collection><collection>MEDLINE</collection><collection>MEDLINE</collection><collection>PubMed</collection><collection>CrossRef</collection><collection>ProQuest Central (Corporate)</collection><collection>Calcium & Calcified Tissue Abstracts</collection><collection>Chemoreception Abstracts</collection><collection>Entomology Abstracts (Full archive)</collection><collection>Neurosciences Abstracts</collection><collection>Toxicology Abstracts</collection><collection>Health & Medical Collection</collection><collection>ProQuest Central (purchase pre-March 2016)</collection><collection>Medical Database (Alumni Edition)</collection><collection>ProQuest Pharma Collection</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Natural Science Collection</collection><collection>Hospital Premium Collection</collection><collection>Hospital Premium Collection (Alumni Edition)</collection><collection>ProQuest Central (Alumni) (purchase pre-March 2016)</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>ProQuest Central Essentials</collection><collection>Biological Science Collection</collection><collection>ProQuest Central</collection><collection>Natural Science Collection</collection><collection>Environmental Sciences and Pollution Management</collection><collection>ProQuest One Community College</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>Health Research Premium Collection</collection><collection>Health Research Premium Collection (Alumni)</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Health & Medical Complete (Alumni)</collection><collection>ProQuest Biological Science Collection</collection><collection>Health & Medical Collection (Alumni Edition)</collection><collection>Medical Database</collection><collection>Biological Science Database</collection><collection>Biotechnology and BioEngineering Abstracts</collection><collection>Access via ProQuest (Open Access)</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Genetics Abstracts</collection><collection>MEDLINE - Academic</collection><collection>PubMed Central (Full Participant titles)</collection><jtitle>BMC genomics</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Wenger, Yvan</au><au>Galliot, Brigitte</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome</atitle><jtitle>BMC genomics</jtitle><addtitle>BMC Genomics</addtitle><date>2013-03-25</date><risdate>2013</risdate><volume>14</volume><issue>1</issue><spage>204</spage><epage>204</epage><pages>204-204</pages><issn>1471-2164</issn><eissn>1471-2164</eissn><abstract>Evolutionary studies benefit from deep sequencing technologies that generate genomic and transcriptomic sequences from a variety of organisms. Genome sequencing and RNAseq have complementary strengths. In this study, we present the assembly of the most complete Hydra transcriptome to date along with a comparative analysis of the specific features of RNAseq and genome-predicted transcriptomes currently available in the freshwater hydrozoan Hydra vulgaris. To produce an accurate and extensive Hydra transcriptome, we combined Illumina and 454 Titanium reads, giving the primacy to Illumina over 454 reads to correct homopolymer errors. This strategy yielded an RNAseq transcriptome that contains 48'909 unique sequences including splice variants, representing approximately 24'450 distinct genes. Comparative analysis to the available genome-predicted transcriptomes identified 10'597 novel Hydra transcripts that encode 529 evolutionarily-conserved proteins. The annotation of 170 human orthologs points to critical functions in protein biosynthesis, FGF and TOR signaling, vesicle transport, immunity, cell cycle regulation, cell death, mitochondrial metabolism, transcription and chromatin regulation. However, a majority of these novel transcripts encodes short ORFs, at least 767 of them corresponding to pseudogenes. This RNAseq transcriptome also lacks 11'270 predicted transcripts that correspond either to silent genes or to genes expressed below the detection level of this study. We established a simple and powerful strategy to combine Illumina and 454 reads and we produced, with genome assistance, an extensive and accurate Hydra transcriptome. The comparative analysis of the RNAseq transcriptome with genome-predicted transcriptomes lead to the identification of large populations of novel as well as missing transcripts that might reflect Hydra-specific evolutionary events.</abstract><cop>England</cop><pub>BioMed Central</pub><pmid>23530871</pmid><doi>10.1186/1471-2164-14-204</doi><tpages>1</tpages><oa>free_for_read</oa></addata></record>
fulltext	fulltext
identifier	ISSN: 1471-2164
ispartof	BMC genomics, 2013-03, Vol.14 (1), p.204-204
issn	1471-2164 1471-2164
language	eng
recordid	cdi_pubmedcentral_primary_oai_pubmedcentral_nih_gov_3764976
source	MEDLINE; DOAJ Directory of Open Access Journals; SpringerNature Journals; PubMed Central Open Access; Springer Nature OA Free Journals; EZB-FREE-00999 freely available EZB journals; PubMed Central
subjects	Animals Comparative analysis Comparative Genomic Hybridization Genetics Genome Genomes Genomics Hydra Hydra - classification Hydra - genetics Hydra vulgaris Life sciences Open Reading Frames Phylogenetics Phylogeny Sequence Analysis, RNA Transcriptome Trees
title	RNAseq versus genome-predicted transcriptomes: a large population of novel transcripts identified in an Illumina-454 Hydra transcriptome
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-03T23%3A21%3A50IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_pubme&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=RNAseq%20versus%20genome-predicted%20transcriptomes:%20a%20large%20population%20of%20novel%20transcripts%20identified%20in%20an%20Illumina-454%20Hydra%20transcriptome&rft.jtitle=BMC%20genomics&rft.au=Wenger,%20Yvan&rft.date=2013-03-25&rft.volume=14&rft.issue=1&rft.spage=204&rft.epage=204&rft.pages=204-204&rft.issn=1471-2164&rft.eissn=1471-2164&rft_id=info:doi/10.1186/1471-2164-14-204&rft_dat=%3Cproquest_pubme%3E1352285724%3C/proquest_pubme%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1326245943&rft_id=info:pmid/23530871&rfr_iscdi=true