PyIR: a scalable wrapper for processing billions of immunoglobulin and T cell receptor sequences using IgBLAST

Recent advances in DNA sequencing technologies have enabled significant leaps in capacity to generate large volumes of DNA sequence data, which has spurred a rapid growth in the use of bioinformatics as a means of interrogating antibody variable gene repertoires. Common tools used for annotation of...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	BMC bioinformatics 2020-07, Vol.21 (1), p.314-314, Article 314
Hauptverfasser:	Soto, Cinque, Finn, Jessica A, Willis, Jordan R, Day, Samuel B, Sinkovits, Robert S, Jones, Taylor, Schmitz, Samuel, Meiler, Jens, Branchizio, Andre, Crowe, Jr, James E
Format:	Artikel
Sprache:	eng
Schlagworte:	Annotations Antibodies Antibody Application programming interface Base Sequence Bioinformatics CDR3 Computer peripherals Computer systems Datasets Deoxyribonucleic acid DNA DNA sequencing Filtration Genes Humans IgBLAST Illumina Immune repertoires Immunoglobulins Immunoglobulins - genetics Lymphocytes Lymphocytes T Modularity Multiprocessing Nucleotide sequence Receptors, Antigen, T-Cell - genetics Researchers Sequence Alignment Sequence Analysis, DNA Software Spawning T cell receptors T cells Time Factors User-Computer Interface Work stations
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Recent advances in DNA sequencing technologies have enabled significant leaps in capacity to generate large volumes of DNA sequence data, which has spurred a rapid growth in the use of bioinformatics as a means of interrogating antibody variable gene repertoires. Common tools used for annotation of antibody sequences are often limited in functionality, modularity and usability. We have developed PyIR, a Python wrapper and library for IgBLAST, which offers a minimal setup CLI and API, FASTQ support, file chunking for large sequence files, JSON and Python dictionary output, and built-in sequence filtering. PyIR offers improved processing speed over multithreaded IgBLAST (version 1.14) when spawning more than 16 processes on a single computer system. Its customizable filtering and data encapsulation allow it to be adapted to a wide range of computing environments. The API allows for IgBLAST to be used in customized bioinformatics workflows.
ISSN:	1471-2105 1471-2105
DOI:	10.1186/s12859-020-03649-5