Applying machine learning to identify pediatric patients with newly diagnosed acute lymphoblastic leukemia using administrative data

Case identification in administrative databases is challenging as diagnosis codes alone are not adequate for case ascertainment. We utilized machine learning (ML) to efficiently identify pediatric patients with newly diagnosed acute lymphoblastic leukemia. We tested nine ML models and validated the...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Pediatric blood & cancer 2024-03, Vol.71 (3), p.e30858-n/a
Hauptverfasser: Cao, Lusha, Huang, Yuan‐shung, Getz, Kelly D., Seif, Alix E., Ruiz, Jenny, Miller, Tamara P., Fisher, Brian T., Aplenc, Richard, Li, Yimei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Case identification in administrative databases is challenging as diagnosis codes alone are not adequate for case ascertainment. We utilized machine learning (ML) to efficiently identify pediatric patients with newly diagnosed acute lymphoblastic leukemia. We tested nine ML models and validated the best model internally and externally. The optimal model had 97% positive predictive value (PPV) and 99% sensitivity in internal validation; 94% PPV and 82% sensitivity in external validation. Our ML model identified a large cohort of 21,044 patients, demonstrating an efficient approach for cohort assembly and enhancing the usability of administrative data.
ISSN:1545-5009
1545-5017
DOI:10.1002/pbc.30858