Diagnostic and comparative performance for the prediction of tuberculous pleural effusion using machine learning algorithms

Early diagnosis and differential diagnosis of tuberculous pleural effusion (TPE) remains challenging and is critical to the patients' prognosis. The present study aimed to develop nine machine learning (ML) algorithms for early diagnosis of TPE and compare their performance. A total of 1435 unt...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of medical informatics (Shannon, Ireland) Ireland), 2024-02, Vol.182, p.105320-105320, Article 105320
Hauptverfasser: Liu, Yanqing, Liang, Zhigang, Yang, Jing, Yuan, Songbo, Wang, Shanshan, Huang, Weina, Wu, Aihua
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Early diagnosis and differential diagnosis of tuberculous pleural effusion (TPE) remains challenging and is critical to the patients' prognosis. The present study aimed to develop nine machine learning (ML) algorithms for early diagnosis of TPE and compare their performance. A total of 1435 untreated patients with pleural effusions (PEs) were retrospectively included and divided into the training set (80%) and the test set (20%). The demographic and laboratory variables were collected, preprocessed, and analyzed to select features, which were fed into nine ML algorithms to develop an optimal diagnostic model for TPE. The proposed model was validated by an independently external data. The decision curve analysis (DCA) and the SHapley Additive exPlanations (SHAP) were also applied. Support vector machine (SVM) was the best model in discriminating TPE from non-TPE, with a balanced accuracy of 87.7%, precision of 85.3%, area under the curve (AUC) of 0.914, sensitivity of 94.7%, specificity of 80.7%, and F1-score of 86.0% among the nine ML algorithms. The excellent diagnostic performance was also validated by the external data (a balanced accuracy of 87.7%, precision of 85.2%, and AUC of 0.898). Neural network (NN) and K-nearest neighbor (KNN) had better net benefits in clinical usefulness. Besides, PE adenosine deaminase (ADA), PE carcinoembryonic antigen (CEA), and serum CYFRA21-1 were identified as the top three important features for diagnosing TPE. This study developed and validated a SVM model for the early diagnosis of TPE, which might help clinicians provide better diagnosis and treatment for TPE patients.
ISSN:1386-5056
1872-8243
DOI:10.1016/j.ijmedinf.2023.105320