Evaluating the capability of ChatGPT in predicting drug–drug interactions: Real‐world evidence using hospitalized patient data

Drug–drug interactions (DDIs) present a significant health burden, compounded by clinician time constraints and poor patient health literacy. We assessed the ability of ChatGPT (generative artificial intelligence‐based large language model) to predict DDIs in a real‐world setting. Demographics, diag...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:British journal of clinical pharmacology 2024-12, Vol.90 (12), p.3361-3366
Hauptverfasser: Radha Krishnan, Ramya Padmavathy, Hung, Euniss Hinyo, Ashford, Megan, Edillo, Clark Ethan, Gardner, Charlise, Hatrick, Hector Blake, Kim, Byungjun, Lai, Angel Wing Yan, Li, Xinran, Zhao, Yvonne Xinyi, Raubenheimer, Jacques Eugene
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Drug–drug interactions (DDIs) present a significant health burden, compounded by clinician time constraints and poor patient health literacy. We assessed the ability of ChatGPT (generative artificial intelligence‐based large language model) to predict DDIs in a real‐world setting. Demographics, diagnoses and prescribed medicines for 120 hospitalized patients were input through three standardized prompts to ChatGPT version 3.5 and compared against pharmacist DDI evaluation to estimate diagnostic accuracy. Area under receiver operating characteristic and inter‐rater reliability (Cohen's and Fleiss' kappa coefficients) were calculated. ChatGPT's responses differed based on prompt wording style, with higher sensitivity for prompts mentioning ‘drug interaction’. Confusion matrices displayed low true positive and high true negative rates, and there was minimal agreement between ChatGPT and pharmacists (Cohen's kappa values 0.077–0.143). Low sensitivity values suggest a lack of success in identifying DDIs by ChatGPT, and further development is required before it can reliably assess potential DDIs in real‐world scenarios.
ISSN:0306-5251
1365-2125
1365-2125
DOI:10.1111/bcp.16275