Accurate detection of mosaic variants in sequencing data without matched controls

Detection of mosaic mutations that arise in normal development is challenging, as such mutations are typically present in only a minute fraction of cells and there is no clear matched control for removing germline variants and systematic artifacts. We present MosaicForecast, a machine-learning metho...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature biotechnology 2020-03, Vol.38 (3), p.314-319
Hauptverfasser: Dou, Yanmei, Kwon, Minseok, Rodin, Rachel E., Cortés-Ciriano, Isidro, Doan, Ryan, Luquette, Lovelace J., Galor, Alon, Bohrson, Craig, Walsh, Christopher A., Park, Peter J.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Detection of mosaic mutations that arise in normal development is challenging, as such mutations are typically present in only a minute fraction of cells and there is no clear matched control for removing germline variants and systematic artifacts. We present MosaicForecast, a machine-learning method that leverages read-based phasing and read-level features to accurately detect mosaic single-nucleotide variants and indels, achieving a multifold increase in specificity compared with existing algorithms. Using single-cell sequencing and targeted sequencing, we validated 80–90% of the mosaic single-nucleotide variants and 60–80% of indels detected in human brain whole-genome sequencing data. Our method should help elucidate the contribution of mosaic somatic mutations to the origin and development of disease. MosaicForecast detects mosaic single-nucleotide variants and indels in human samples.
ISSN:1087-0156
1546-1696
DOI:10.1038/s41587-019-0368-8