Comparison of high-throughput sequencing data compression tools

A team of international scientists benchmark current compression methods for high-throughput sequencing data. High-throughput sequencing (HTS) data are commonly stored as raw sequencing reads in FASTQ format or as reads mapped to a reference, in SAM format, both with large memory footprints. Worldwi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Nature methods 2016-12, Vol.13 (12), p.1005-1008
Hauptverfasser: Numanagić, Ibrahim, Bonfield, James K, Hach, Faraz, Voges, Jan, Ostermann, Jörn, Alberti, Claudio, Mattavelli, Marco, Sahinalp, S Cenk
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:A team of international scientists benchmark current compression methods for high-throughput sequencing data. High-throughput sequencing (HTS) data are commonly stored as raw sequencing reads in FASTQ format or as reads mapped to a reference, in SAM format, both with large memory footprints. Worldwide growth of HTS data has prompted the development of compression methods that aim to significantly reduce HTS data size. Here we report on a benchmarking study of available compression methods on a comprehensive set of HTS data using an automated framework.
ISSN:1548-7091
1548-7105
DOI:10.1038/nmeth.4037