SYSTEMS AND METHODS FOR ITERATIVE AND SCALABLE POPULATION-SCALE VARIANT ANALYSIS

An iterative process may be implemented for incrementally aggregating available batches of sample data with previously available batches to perform sequencing analysis. Genomic variant call files associated with one or more samples may be received in batches from sequencing devices and aggregated fo...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: HUANG, Zhuoyi, BIRNBAUM, Adam, SCHULZ-TRIEGLAFF, Ole Benjamin, MONTAÑO, Bernardo Ochoa, DE BEER, Jacobus
Format: Patent
Sprache:eng ; fre ; ger
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:An iterative process may be implemented for incrementally aggregating available batches of sample data with previously available batches to perform sequencing analysis. Genomic variant call files associated with one or more samples may be received in batches from sequencing devices and aggregated for performing sequencing analysis. The aggregated genomic variant call files may be used to generate cohort files and census files that comprise summary information related to the genomic variant call files in each batch. The census data in census files may be aggregated into a global census file that includes summary genome variant data. Multi-sample variant call files may be generated based on the global census file, cohort files, and census files. The genomic variant call files may be processed using parallel processing at multiple compute nodes. The files may be further compressed and overlapping data may be efficiently stored in buffer positions.