Abstract 5105: A plug-and-play infrastructure for scalable bioinformatics operations

Genome profiling represents a critical pillar for clinical, translational, and basic research studies. Hospitals, core facilities, and research enterprises invest significant resources to generate genomic data sets. Yet, data management and analysis is frequently manual, which demands significant op...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Cancer research (Chicago, Ill.) Ill.), 2019-07, Vol.79 (13_Supplement), p.5105-5105
Hauptverfasser:	Medina-Martínez, Juan S., Arango-Ossa, Juan E., Gundem, Gunes, Levine, Max F., Patel, Minal, Farnoud, Noushin R., Yellapantula, Venkata D., Teng, Gao, Mccarter, Joseph G., Bernard, Elsa, Rapaport, Franck, Glodzik, Dominik, Levine, Ross L., Kung, Andrew, Papaemmanuil, Elli
Format:	Artikel
Sprache:	eng
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Genome profiling represents a critical pillar for clinical, translational, and basic research studies. Hospitals, core facilities, and research enterprises invest significant resources to generate genomic data sets. Yet, data management and analysis is frequently manual, which demands significant operator time and often results in siloed resources rendering them as single-use assets. Centralization of the genomic capital in a framework that enables automated processing, metadata integration and continuous interrogation maximizes return for investment and serves as the critical catalyst for research innovation, clinical translation and reproducible research. We developed Isabl, a plug-and-play infrastructure for scalable bioinformatics operations. Isabl provides solutions for databasing, assets management, tracking, automated and reproducible data processing. Dynamic reporting and meta-analysis across data assets is enabled. Isabl is built on four main components. First, an individual-centric and extensible relational database with tracking support for samples (temporal, spatial, aliquot), experimental data (assays, platforms, sequencing runs), cohorts (clinical trials, research projects) and versioned bioinformatics applications (assembly aware, tools, results). Second, the database is exposed through a fully featured RESTful API that enables horizontal integration with information systems such as sequencing cores LIMS, variant visualization platforms like cBioPortal, and where applicable, clinical and biospecimen institutional databases. Third, a Software Development Kit (SDK) built for Next Generation Sequencing assets management. The SDK enables automated execution of data import and language-agnostic bioinformatics applications (alignment, variant calling, post-processing) with support for cohort and individual level reporting features. Furthermore, the SDK facilitates dynamic retrieval of results using vertical and horizontal queries (individual and cohort level, respectively). Lastly, Isabl comes with a Single Page Web Application that fosters user interaction with multidisciplinary teams (i.e. researchers, project coordinators, engineers, clinicians) facilitating tracking of analyses, results visualization, and dynamic query processing. Isabl is currently supporting the Memorial Sloan Kettering Genome Pediatrics Precision Medicine Initiative, a prototype platform that delivers integrated, real-time automated reporting of clinical targeted gene re-se
ISSN:	0008-5472 1538-7445
DOI:	10.1158/1538-7445.AM2019-5105