Sugarcane genome sequencing by methylation filtration provides tools for genomic research in the genus S accharum
Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun ( WGS ). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene‐rich regions. Gene‐enrichment strategies base...
Gespeichert in:
Veröffentlicht in: | The Plant journal : for cell and molecular biology 2014-07, Vol.79 (1), p.162-172 |
---|---|
Hauptverfasser: | , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Many economically important crops have large and complex genomes that hamper their sequencing by standard methods such as whole genome shotgun (
WGS
). Large tracts of methylated repeats occur in plant genomes that are interspersed by hypomethylated gene‐rich regions. Gene‐enrichment strategies based on methylation profiles offer an alternative to sequencing repetitive genomes. Here, we have applied methyl filtration with Mcr
BC
endonuclease digestion to enrich for euchromatic regions in the sugarcane genome. To verify the efficiency of methylation filtration and the assembly quality of sequences submitted to gene‐enrichment strategy, we have compared assemblies using methyl‐filtered (
MF
) and unfiltered (
UF
) libraries. The use of methy filtration allowed a better assembly by filtering out 35% of the sugarcane genome and by producing 1.5× more scaffolds and 1.7× more assembled Mb in length compared with unfiltered dataset. The coverage of sorghum coding sequences (
CDS
) by
MF
scaffolds was at least 36% higher than by the use of
UF
scaffolds. Using
MF
technology, we increased by 134× the coverage of gene regions of the monoploid sugarcane genome. The
MF
reads assembled into scaffolds that covered all genes of the sugarcane bacterial artificial chromosomes (
BAC
s), 97.2% of sugarcane expressed sequence tags (
EST
s), 92.7% of sugarcane
RNA
‐seq reads and 98.4% of sorghum protein sequences. Analysis of
MF
scaffolds from encoded enzymes of the sucrose/starch pathway discovered 291 single‐nucleotide polymorphisms (
SNP
s) in the wild sugarcane species,
S
. spontaneum
and
S
. officinarum
. A large number of micro
RNA
genes was also identified in the
MF
scaffolds. The information achieved by the
MF
dataset provides a valuable tool for genomic research in the genus
S
accharum
and for improvement of sugarcane as a biofuel crop. |
---|---|
ISSN: | 0960-7412 1365-313X |
DOI: | 10.1111/tpj.12539 |