Genome-wide detection of cytosine methylations in plant from Nanopore data using deep learning

In plants, cytosine DNA methylations (5mCs) can happen in three sequence contexts as CpG, CHG, and CHH (where H = A, C, or T), which play different roles in the regulation of biological processes. Although long Nanopore reads are advantageous in the detection of 5mCs comparing to short-read bisulfit...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature communications 2021-10, Vol.12 (1), p.5976-11, Article 5976
Hauptverfasser: Ni, Peng, Huang, Neng, Nie, Fan, Zhang, Jun, Zhang, Zhi, Wu, Bo, Bai, Lu, Liu, Wende, Xiao, Chuan-Le, Luo, Feng, Wang, Jianxin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In plants, cytosine DNA methylations (5mCs) can happen in three sequence contexts as CpG, CHG, and CHH (where H = A, C, or T), which play different roles in the regulation of biological processes. Although long Nanopore reads are advantageous in the detection of 5mCs comparing to short-read bisulfite sequencing, existing methods can only detect 5mCs in the CpG context, which limits their application in plants. Here, we develop DeepSignal-plant, a deep learning tool to detect genome-wide 5mCs of all three contexts in plants from Nanopore reads. We sequence Arabidopsis thaliana and Oryza sativa using both Nanopore and bisulfite sequencing. We develop a denoising process for training models, which enables DeepSignal-plant to achieve high correlations with bisulfite sequencing for 5mC detection in all three contexts. Furthermore, DeepSignal-plant can profile more 5mC sites, which will help to provide a more complete understanding of epigenetic mechanisms of different biological processes. Existing methods cannot profile genome-wide cytosine DNA methylations (5mCs) in all three contexts with acceptable accuracy. Here, the authors develop a deep learning tool to detect genome-wide 5mCs of all three contexts in plants with high accuracy from Nanopore reads.
ISSN:2041-1723
2041-1723
DOI:10.1038/s41467-021-26278-9