SquiggleNet: real-time, direct classification of nanopore signals

We present SquiggleNet, the first deep-learning model that can classify nanopore reads directly from their electrical signals. SquiggleNet operates faster than DNA passes through the pore, allowing real-time classification and read ejection. Using 1 s of sequencing data, the classifier achieves sign...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Genome Biology 2021-10, Vol.22 (1), p.298-16, Article 298
Hauptverfasser: Bao, Yuwei, Wadden, Jack, Erb-Downward, John R., Ranjan, Piyush, Zhou, Weichen, McDonald, Torrin L., Mills, Ryan E., Boyle, Alan P., Dickson, Robert P., Blaauw, David, Welch, Joshua D.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We present SquiggleNet, the first deep-learning model that can classify nanopore reads directly from their electrical signals. SquiggleNet operates faster than DNA passes through the pore, allowing real-time classification and read ejection. Using 1 s of sequencing data, the classifier achieves significantly higher accuracy than base calling followed by sequence alignment. Our approach is also faster and requires an order of magnitude less memory than alignment-based approaches. SquiggleNet distinguished human from bacterial DNA with over 90% accuracy, generalized to unseen bacterial species in a human respiratory meta genome sample, and accurately classified sequences containing human long interspersed repeat elements.
ISSN:1474-760X
1474-7596
1474-760X
DOI:10.1186/s13059-021-02511-y