Predictive Mortality and Gastric Cancer Risk Using Clinical and Socio-Economic Data: A Nationwide Multicenter Cohort Study

Gastric cancer is a leading cause of cancer-related mortality, particularly in East Asia, with a notable burden in Republic of Korea. This study aimed to construct and develop machine learning models for the prediction of gastric cancer mortality and the identification of risk factors. All data were...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Cancers 2024-12, Vol.17 (1), p.30
Hauptverfasser: Kang, Seong Uk, Nam, Seung-Joo, Kwon, Oh Beom, Yim, Inhyeok, Kim, Tae-Hoon, Yeo, Na Young, Lim, Myoung Nam, Kim, Woo Jin, Park, Sang Won
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Gastric cancer is a leading cause of cancer-related mortality, particularly in East Asia, with a notable burden in Republic of Korea. This study aimed to construct and develop machine learning models for the prediction of gastric cancer mortality and the identification of risk factors. All data were acquired from the Korean Clinical Data Utilization for Research Excellence by multiple medical centers in South Korea. A total of 23,717 gastric cancer patients were divided into two groups by cause of mortality (all-cause of 2664 and disease-specific of 1620) and investigated. We used comprehensive data integrating clinical, pathological, lifestyle, and socio-economic factors. Cox proportional hazards analysis was conducted to estimate hazard ratios for mortality. Five machine learning models (random forest, gradient boosting machine, XGBoost, light GBM, and cat boosting) were developed to predict mortality. The models were interpreted by SHAP, one of the explainable AI techniques. For all-cause mortality, the gradient-boosting machine learning model demonstrated the highest performance with an AUC-ROC of 0.795. For disease-specific mortality, the light GBM model outperformed others, achieving an AUC-ROC of 0.867. Significant predictors included the AJCC7 stage, tumor size, lymph node count, and lifestyle factors such as smoking, drinking, and diabetes. This study underscores the importance of integrating both clinical and lifestyle data to enhance mortality prediction accuracy in gastric cancer patients. The findings highlight the need for personalized treatment approaches in the Korean population and emphasize the role of demographic-specific data in predictive modeling.
ISSN:2072-6694
2072-6694
DOI:10.3390/cancers17010030