Strategies for analyzing bisulfite sequencing data

•Presentation of all the necessary steps of downstream analysis for bisulfite sequencing experiments starting from read alignment and quality check.•Comparison of differential methylation methods.•Comparison of methylome segmentation methods.•Suggestions for dealing with large data sets using on-dis...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of biotechnology 2017-11, Vol.261, p.105-115
Hauptverfasser: Wreczycka, Katarzyna, Gosdschan, Alexander, Yusuf, Dilmurat, Grüning, Björn, Assenov, Yassen, Akalin, Altuna
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Presentation of all the necessary steps of downstream analysis for bisulfite sequencing experiments starting from read alignment and quality check.•Comparison of differential methylation methods.•Comparison of methylome segmentation methods.•Suggestions for dealing with large data sets using on-disk data structures.•Review of guided user interfaces for methylation analysis. DNA methylation is one of the main epigenetic modifications in the eukaryotic genome; it has been shown to play a role in cell-type specific regulation of gene expression, and therefore cell-type identity. Bisulfite sequencing is the gold-standard for measuring methylation over the genomes of interest. Here, we review several techniques used for the analysis of high-throughput bisulfite sequencing. We introduce specialized short-read alignment techniques as well as pre/post-alignment quality check methods to ensure data quality. Furthermore, we discuss subsequent analysis steps after alignment. We introduce various differential methylation methods and compare their performance using simulated and real bisulfite sequencing datasets. We also discuss the methods used to segment methylomes in order to pinpoint regulatory regions. We introduce annotation methods that can be used for further classification of regions returned by segmentation and differential methylation methods. Finally, we review software packages that implement strategies to efficiently deal with large bisulfite sequencing datasets locally and we discuss online analysis workflows that do not require any prior programming skills. The analysis strategies described in this review will guide researchers at any level to the best practices of bisulfite sequencing analysis.
ISSN:0168-1656
1873-4863
DOI:10.1016/j.jbiotec.2017.08.007