Collaboration between explainable artificial intelligence and pulmonologists improves the accuracy of pulmonary function test interpretation

BACKGROUND: Few studies have investigated the collaborative potential between artificial intelligence (AI) and pulmonologists for diagnosing pulmonary disease. We hypothesised that the collaboration between a pulmonologist and AI with explanations (explainable AI (XAI)) is superior in diagnostic int...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:EUROPEAN RESPIRATORY JOURNAL 2023-05, Vol.61 (5)
Hauptverfasser: Das, Nilakash, Happaerts, Sofie, Gyselinck, Iwein, Staes, Michael, Derom, Eric, Brusselle, Guy, Burgos, Felip, Contoli, Marco, Dinh-Xuan, Anh Tuan, Franssen, Frits M.E, Gonem, Sherif, Greening, Neil, Haenebalcke, Christel, Man, William D.-C, Moises, Jorge, Peche, Rudi, Poberezhets, Vitalii, Quint, Jennifer K, Steiner, Michael C, Vanderhelst, Eef, Abdo, Mustafa, Topalovic, Marko, Janssens, Wim
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:BACKGROUND: Few studies have investigated the collaborative potential between artificial intelligence (AI) and pulmonologists for diagnosing pulmonary disease. We hypothesised that the collaboration between a pulmonologist and AI with explanations (explainable AI (XAI)) is superior in diagnostic interpretation of pulmonary function tests (PFTs) than the pulmonologist without support. METHODS: The study was conducted in two phases, a monocentre study (phase 1) and a multicentre intervention study (phase 2). Each phase utilised two different sets of 24 PFT reports of patients with a clinically validated gold standard diagnosis. Each PFT was interpreted without (control) and with XAI's suggestions (intervention). Pulmonologists provided a differential diagnosis consisting of a preferential diagnosis and optionally up to three additional diagnoses. The primary end-point compared accuracy of preferential and additional diagnoses between control and intervention. Secondary end-points were the number of diagnoses in differential diagnosis, diagnostic confidence and inter-rater agreement. We also analysed how XAI influenced pulmonologists' decisions. RESULTS: In phase 1 (n=16 pulmonologists), mean preferential and differential diagnostic accuracy significantly increased by 10.4% and 9.4%, respectively, between control and intervention (p
ISSN:0903-1936