Full‐length de novo assembly of RNA‐seq data in pea (Pisum sativum L.) provides a gene expression atlas and gives insights into root nodulation in this species

Summary Next‐generation sequencing technologies allow an almost exhaustive survey of the transcriptome, even in species with no available genome sequence. To produce a Unigene set representing most of the expressed genes of pea, 20 cDNA libraries produced from various plant tissues harvested at vari...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Plant journal : for cell and molecular biology 2015-10, Vol.84 (1), p.1-19
Hauptverfasser: Alves‐Carvalho, Susete, Aubert, Grégoire, Carrère, Sébastien, Cruaud, Corinne, Brochot, Anne‐Lise, Jacquin, Françoise, Klein, Anthony, Martin, Chantal, Boucherot, Karen, Kreplak, Jonathan, Silva, Corinne, Moreau, Sandra, Gamas, Pascal, Wincker, Patrick, Gouzy, Jérôme, Burstin, Judith
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Summary Next‐generation sequencing technologies allow an almost exhaustive survey of the transcriptome, even in species with no available genome sequence. To produce a Unigene set representing most of the expressed genes of pea, 20 cDNA libraries produced from various plant tissues harvested at various developmental stages from plants grown under contrasting nitrogen conditions were sequenced. Around one billion reads and 100 Gb of sequence were de novo assembled. Following several steps of redundancy reduction, 46 099 contigs with N50 length of 1667 nt were identified. These constitute the ‘Caméor’ Unigene set. The high depth of sequencing allowed identification of rare transcripts and detected expression for approximately 80% of contigs in each library. The Unigene set is now available online (http://bios.dijon.inra.fr/FATAL/cgi/pscam.cgi), allowing (i) searches for pea orthologs of candidate genes based on gene sequences from other species, or based on annotation, (ii) determination of transcript expression patterns using various metrics, (iii) identification of uncharacterized genes with interesting patterns of expression, and (iv) comparison of gene ontology pathways between tissues. This resource has allowed identification of the pea orthologs of major nodulation genes characterized in recent years in model species, as a major step towards deciphering unresolved pea nodulation phenotypes. In addition to a remarkable conservation of the early transcriptome nodulation apparatus between pea and Medicago truncatula, some specific features were highlighted. The resource provides a reference for the pea exome, and will facilitate transcriptome and proteome approaches as well as SNP discovery in pea. Significance Statement From Mendel's discovery of the laws of genetics up to the advent of molecular biology, pea has been a valuable model for genetics and physiology. We present a comprehensive inventory of the expressed genes of pea in a readily searchable format. This resource strengthens pea as a model species and will facilitate searches for candidate gene sequences, microarray design, large‐scale proteomics studies, and identification of major genes in available mutant populations.
ISSN:0960-7412
1365-313X
DOI:10.1111/tpj.12967