An efficient interpretable stacking ensemble model for lung cancer prognosis

Lung cancer significantly contributes to global cancer mortality, posing challenges in clinical management. Early detection and accurate prognosis are crucial for improving patient outcomes. This study develops an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction and i...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computational biology and chemistry 2024-12, Vol.113, p.108248, Article 108248
Hauptverfasser: Arif, Umair, Zhang, Chunxia, Hussain, Sajid, Abbasi, Abdul Rauf
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Lung cancer significantly contributes to global cancer mortality, posing challenges in clinical management. Early detection and accurate prognosis are crucial for improving patient outcomes. This study develops an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction and identifies key risk factors. Using a Kaggle dataset of 1000 patients with 22 variables, the model classifies prognosis into Low, Medium, and High-risk categories. The bootstrap method was employed for evaluation metrics, while SHAP (Shapley Additive Explanations) and LIME (Local Interpretable Model-agnostic Explanations) assessed model interpretability. Results showed SEM's superior interpretability over traditional models, such as Random Forest, Logistic Regression, Decision Tree, Gradient Boosting Machine, Extreme Gradient Boosting Machine, and Light Gradient Boosting Machine. SEM achieved an accuracy of 98.90 %, precision of 98.70 %, F1 score of 98.85 %, sensitivity of 98.77 %, specificity of 95.45 %, Cohen’s kappa value of 94.56 %, and an AUC of 98.10 %. The SEM demonstrated robust performance in lung cancer prognosis, revealing chronic lung cancer and genetic risk as major factors. [Display omitted] •Developed an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction.•SEM outperformed traditional models with an accuracy of 98.90 %, precision of 98.70, sensitivity of 98.77 %, and AUC of 98.10 %.•Integrated SHAP and LIME techniques to enhance model interpretability and facilitate clinical decision-making.•Identified chronic lung cancer, genetic risk, and other key factors as significant predictors of lung cancer prognosis.
ISSN:1476-9271
1476-928X
1476-928X
DOI:10.1016/j.compbiolchem.2024.108248