Highly accurate fluorogenic DNA sequencing with information theory–based error correction

The error rate of DNA sequencing is much reduced by combining an improved fluorogenic sequencing chemistry with a new error-correction scheme. Eliminating errors in next-generation DNA sequencing has proved challenging. Here we present error-correction code (ECC) sequencing, a method to greatly impr...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature biotechnology 2017-12, Vol.35 (12), p.1170-1178
Hauptverfasser: Chen, Zitian, Zhou, Wenxiong, Qiao, Shuo, Kang, Li, Duan, Haifeng, Xie, X Sunney, Huang, Yanyi
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The error rate of DNA sequencing is much reduced by combining an improved fluorogenic sequencing chemistry with a new error-correction scheme. Eliminating errors in next-generation DNA sequencing has proved challenging. Here we present error-correction code (ECC) sequencing, a method to greatly improve sequencing accuracy by combining fluorogenic sequencing-by-synthesis (SBS) with an information theory–based error-correction algorithm. ECC embeds redundancy in sequencing reads by creating three orthogonal degenerate sequences, generated by alternate dual-base reactions. This is similar to encoding and decoding strategies that have proved effective in detecting and correcting errors in information communication and storage. We show that, when combined with a fluorogenic SBS chemistry with raw accuracy of 98.1%, ECC sequencing provides single-end, error-free sequences up to 200 bp. ECC approaches should enable accurate identification of extremely rare genomic variations in various applications in biology and medicine.
ISSN:1087-0156
1546-1696
DOI:10.1038/nbt.3982