A survey of the sorghum transcriptome using single-molecule long reads

Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature communications 2016-06, Vol.7 (1), p.11706-11706, Article 11706
Hauptverfasser: Abdel-Ghany, Salah E., Hamilton, Michael, Jacobi, Jennifer L., Ngam, Peter, Devitt, Nicholas, Schilkey, Faye, Ben-Hur, Asa, Reddy, Anireddy S. N.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Alternative splicing and alternative polyadenylation (APA) of pre-mRNAs greatly contribute to transcriptome diversity, coding capacity of a genome and gene regulatory mechanisms in eukaryotes. Second-generation sequencing technologies have been extensively used to analyse transcriptomes. However, a major limitation of short-read data is that it is difficult to accurately predict full-length splice isoforms. Here we sequenced the sorghum transcriptome using Pacific Biosciences single-molecule real-time long-read isoform sequencing and developed a pipeline called TAPIS (Transcriptome Analysis Pipeline for Isoform Sequencing) to identify full-length splice isoforms and APA sites. Our analysis reveals transcriptome-wide full-length isoforms at an unprecedented scale with over 11,000 novel splice isoforms. Additionally, we uncover APA of ∼11,000 expressed genes and more than 2,100 novel genes. These results greatly enhance sorghum gene annotations and aid in studying gene regulation in this important bioenergy crop. The TAPIS pipeline will serve as a useful tool to analyse Iso-Seq data from any organism. Alternative splicing and alternative polyadenylation (APA) contribute to mRNA diversity but are difficult to assess using short read RNA-seq data. Here, the authors use single molecule long-read isoform sequencing and develop a computational pipeline to identify full-length splice isoforms and APA sites in sorghum.
ISSN:2041-1723
2041-1723
DOI:10.1038/ncomms11706