A scalable method for supporting multiple patient cohort discovery projects using i2b2

[Display omitted] •i2b2, a popular cohort discovery tool, can be used to support multiple projects.•Creating separate tables for these projects is time- and space-consuming.•Using views, we reduced processing time and disk space for multiple i2b2 projects.•This approach is generalizable to other dat...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Journal of biomedical informatics 2018-08, Vol.84, p.179-183
Hauptverfasser: Sholle, Evan T., Davila, Marcos A., Kabariti, Joseph, Schwartz, Julian Z., Varughese, Vinay I., Cole, Curtis L., Campion, Thomas R.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:[Display omitted] •i2b2, a popular cohort discovery tool, can be used to support multiple projects.•Creating separate tables for these projects is time- and space-consuming.•Using views, we reduced processing time and disk space for multiple i2b2 projects.•This approach is generalizable to other data models, such as the OMOP CDM. Although i2b2, a popular platform for patient cohort discovery using electronic health record (EHR) data, can support multiple projects specific to individual disease areas or research interests, the standard approach for doing so duplicates data across projects, requiring additional disk space and processing time, which limits scalability. To address this deficiency, we developed a novel approach that stored data in a single i2b2 fact table and used structured query language (SQL) views to access data for specific projects. Compared to the standard approach, the view-based approach reduced required disk space by 59% and extract-transfer-load (ETL) time by 46%, without substantially impacting query performance. The view-based approach has enabled scalability of multiple i2b2 projects and generalized to another data model at our institution. Other institutions may benefit from this approach, code of which is available on GitHub (https://github.com/wcmc-research-informatics/super-i2b2).
ISSN:1532-0464
1532-0480
DOI:10.1016/j.jbi.2018.07.010