An automated information extraction model for unstructured discharge letters using large language models and GPT-4

The administrative burden of manually extracting clinical information from discharge letters is a common challenge in healthcare. This study aims to explore the use of Large Language Models (LLMs), specifically Generative Pretrained Transformer 4 (GPT-4) by OpenAI, for automated extraction of diagno...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Healthcare analytics (New York, N.Y.) N.Y.), 2025-06, Vol.7, p.100378, Article 100378
Hauptverfasser: Siepmann, Robert M., Baldini, Giulia, Schmidt, Cynthia S., Truhn, Daniel, Müller-Franzes, Gustav Anton, Dada, Amin, Kleesiek, Jens, Nensa, Felix, Hosch, René
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The administrative burden of manually extracting clinical information from discharge letters is a common challenge in healthcare. This study aims to explore the use of Large Language Models (LLMs), specifically Generative Pretrained Transformer 4 (GPT-4) by OpenAI, for automated extraction of diagnoses, medications, and allergies from discharge letters. Data for this study were sourced from two healthcare institutions in Germany, comprising discharge letters for ten patients from each institution. The first experiment is conducted using a standardized prompt for information extraction. However, challenges were encountered, and the prompt was fine-tuned in a second experiment to improve the results. We further tested whether open-source LLMs can achieve similar results. In the first experiment, primary diagnoses were identified with 85% accuracy and secondary diagnoses with 55.8%. Medications and allergies were extracted with 85.9% and 100% accuracy, respectively. The International Classification of Diseases, 10th revision (ICD-10) codes for the identified diagnoses achieved an accuracy of 85% for primary diagnoses and 60.7% for secondary diagnoses. Anatomical Therapeutic Chemical (ATC) codes were identified with an accuracy of 78.8%. On the other hand, open-source LLMs did not provide similar levels of accuracy and could not consistently fill the template. With prompt fine-tuning in the second experiment, the primary diagnoses, secondary diagnoses, and medications could be predicted with 95%, 88.9%, and 92.2% accuracy, respectively. GPT-4 shows excellent potential for automated extraction of crucial diagnostic and medication information from discharge letters, presumably lowering the administrative burden for healthcare professionals and improving patient outcomes. [Display omitted] •GPT-4 can be used for information extraction in unstructured discharge letters.•GPT-4 can extract diagnoses, medications, allergies using relevant coding systems.•Prompt fine-tuning improves the results and targets specific mistakes.•GPT-4 achieves 95% and 88.9% accuracy for the primary and secondary diagnoses.•GPT-4 predicts the medications with 92.2% accuracy.
ISSN:2772-4425
2772-4425
DOI:10.1016/j.health.2024.100378