An efficient interpretable stacking ensemble model for lung cancer prognosis
Lung cancer significantly contributes to global cancer mortality, posing challenges in clinical management. Early detection and accurate prognosis are crucial for improving patient outcomes. This study develops an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction and i...
Gespeichert in:
Veröffentlicht in: | Computational biology and chemistry 2024-12, Vol.113, p.108248, Article 108248 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | Lung cancer significantly contributes to global cancer mortality, posing challenges in clinical management. Early detection and accurate prognosis are crucial for improving patient outcomes. This study develops an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction and identifies key risk factors. Using a Kaggle dataset of 1000 patients with 22 variables, the model classifies prognosis into Low, Medium, and High-risk categories. The bootstrap method was employed for evaluation metrics, while SHAP (Shapley Additive Explanations) and LIME (Local Interpretable Model-agnostic Explanations) assessed model interpretability. Results showed SEM's superior interpretability over traditional models, such as Random Forest, Logistic Regression, Decision Tree, Gradient Boosting Machine, Extreme Gradient Boosting Machine, and Light Gradient Boosting Machine. SEM achieved an accuracy of 98.90 %, precision of 98.70 %, F1 score of 98.85 %, sensitivity of 98.77 %, specificity of 95.45 %, Cohen’s kappa value of 94.56 %, and an AUC of 98.10 %. The SEM demonstrated robust performance in lung cancer prognosis, revealing chronic lung cancer and genetic risk as major factors.
[Display omitted]
•Developed an interpretable stacking ensemble model (SEM) for lung cancer prognosis prediction.•SEM outperformed traditional models with an accuracy of 98.90 %, precision of 98.70, sensitivity of 98.77 %, and AUC of 98.10 %.•Integrated SHAP and LIME techniques to enhance model interpretability and facilitate clinical decision-making.•Identified chronic lung cancer, genetic risk, and other key factors as significant predictors of lung cancer prognosis. |
---|---|
ISSN: | 1476-9271 1476-928X 1476-928X |
DOI: | 10.1016/j.compbiolchem.2024.108248 |