Performance of explainable artificial intelligence in guiding the management of patients with a pancreatic cyst
Pancreatic cyst management can be distilled into three separate pathways – discharge, monitoring or surgery– based on the risk of malignant transformation. This study compares the performance of artificial intelligence (AI) models to clinical care for this task. Two explainable boosting machine (EBM...
Gespeichert in:
Veröffentlicht in: | Pancreatology : official journal of the International Association of Pancreatology (IAP) ... [et al.] 2024-11, Vol.24 (7), p.1182-1191 |
---|---|
Hauptverfasser: | , , , , , , , , , , , , , , , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Pancreatic cyst management can be distilled into three separate pathways – discharge, monitoring or surgery– based on the risk of malignant transformation. This study compares the performance of artificial intelligence (AI) models to clinical care for this task.
Two explainable boosting machine (EBM) models were developed and evaluated using clinical features only, or clinical features and cyst fluid molecular markers (CFMM) using a publicly available dataset, consisting of 850 cases (median age 64; 65 % female) with independent training (429 cases) and holdout test cohorts (421 cases). There were 137 cysts with no malignant potential, 114 malignant cysts, and 599 IPMNs and MCNs.
The EBM and EBM with CFMM models had higher accuracy for identifying patients requiring monitoring (0.88 and 0.82) and surgery (0.66 and 0.82) respectively compared with current clinical care (0.62 and 0.58). For discharge, the EBM with CFMM model had a higher accuracy (0.91) than either the EBM model (0.84) or current clinical care (0.86). In the cohort of patients who underwent surgical resection, use of the EBM-CFMM model would have decreased the number of unnecessary surgeries by 59 % (n = 92), increased correct surgeries by 7.5 % (n = 11), identified patients who require monitoring by 122 % (n = 76), and increased the number of patients correctly classified for discharge by 138 % (n = 18) compared to clinical care.
EBM models had greater sensitivity and specificity for identifying the correct management compared with either clinical management or previous AI models. The model predictions are demonstrated to be interpretable by clinicians. |
---|---|
ISSN: | 1424-3903 1424-3911 1424-3911 |
DOI: | 10.1016/j.pan.2024.09.001 |