Early temporal prediction of Type 2 Diabetes Risk Condition from a General Practitioner Electronic Health Record: A Multiple Instance Boosting Approach
•MIL-Boost is applied to predict insulin resistance worsening.•MIL-Boost is applied to temporal EHR data stored by a general practitioner.•MIL-Boost overcomes other competitors (Recall from 0.70 and up to 0.83).•MIL-Boost identifies also non-conventional insulin resistance biomarkers.•MIL-Boost may...
Gespeichert in:
Veröffentlicht in: | Artificial intelligence in medicine 2020-05, Vol.105, p.101847-101847, Article 101847 |
---|---|
Hauptverfasser: | , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •MIL-Boost is applied to predict insulin resistance worsening.•MIL-Boost is applied to temporal EHR data stored by a general practitioner.•MIL-Boost overcomes other competitors (Recall from 0.70 and up to 0.83).•MIL-Boost identifies also non-conventional insulin resistance biomarkers.•MIL-Boost may represent the main core of a clinical decision support system.
Early prediction of target patients at high risk of developing Type 2 diabetes (T2D) plays a significant role in preventing the onset of overt disease and its associated comorbidities. Although fundamental in early phases of T2D natural history, insulin resistance is not usually quantified by General Practitioners (GPs). Triglyceride-glucose (TyG) index has been proven useful in clinical studies for quantifying insulin resistance and for the early identification of individuals at T2D risk but still not applied by GPs for diagnostic purposes. The aim of this study is to propose a multiple instance learning boosting algorithm (MIL-Boost) for creating a predictive model capable of early prediction of worsening insulin resistance (low vs high T2D risk) in terms of TyG index. The MIL-Boost is applied to past electronic health record (EHR) patients’ information stored by a single GP. The proposed MIL-Boost algorithm proved to be effective in dealing with this task, by performing better than the other state-of-the-art ML competitors (Recall from 0.70 and up to 0.83). The proposed MIL-based approach is able to extract hidden patterns from past EHR temporal data, even not directly exploiting triglycerides and glucose measurements. The major advantages of our method can be found in its ability to model the temporal evolution of longitudinal EHR data while dealing with small sample size and variability in the observations (e.g., a small variable number of prescriptions for non-hospitalized patients). The proposed algorithm may represent the main core of a clinical decision support system. |
---|---|
ISSN: | 0933-3657 1873-2860 |
DOI: | 10.1016/j.artmed.2020.101847 |