Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus S accharum

Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun ( WGS ). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene‐rich regions. Gene‐enrichment strategies base...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:The Plant journal : for cell and molecular biology 2014-07, Vol.79 (1), p.162-172
Hauptverfasser: Grativol, Clícia, Regulski, Michael, Bertalan, Marcelo, McCombie, W. Richard, da Silva, Felipe Rodrigues, Zerlotini Neto, Adhemar, Vicentini, Renato, Farinelli, Laurent, Hemerly, Adriana Silva, Martienssen, Robert A., Ferreira, Paulo Cavalcanti Gomes
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun ( WGS ). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene‐rich regions. Gene‐enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with Mcr BC endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene‐enrichment strategy, we have compared assemblies using methyl‐filtered ( MF ) and unfiltered ( UF ) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences ( CDS ) by MF scaffolds was at least 36% higher than by the use of UF scaffolds. Using MF technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The MF reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes ( BAC s), 97.2% of sugarcane expressed sequence tags ( EST s), 92.7% of sugarcane RNA ‐seq reads and 98.4% of sorghum protein sequences. Analysis of MF scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single‐nucleotide polymorphisms ( SNP s) in the wild sugarcane species, S . spontaneum and S . officinarum . A large number of micro RNA genes was also identified in the MF scaffolds. The information achieved by the MF dataset provides a valuable tool for genomic research in the genus S accharum and for improvement of sugarcane as a biofuel crop.
ISSN:0960-7412
1365-313X
DOI:10.1111/tpj.12539