Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss

Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sens...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems 2024-11, Vol.41 (11), p.n/a
Hauptverfasser: Xu, Liangchen, Guo, Chonghui
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page n/a
container_issue 11
container_start_page
container_title Expert systems
container_volume 41
creator Xu, Liangchen
Guo, Chonghui
description Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sensitivity because of class imbalance. This is a non‐negligible problem due to the poor prognosis of gastric cancer patients. Furthermore, models in the medical domain require strong interpretability to increase their applicability. Due to the better performance and interpretability of the XGBoost model, we design a loss function taking into account cost sensitive and focal loss from the algorithm level for XGBoost to deal with the imbalance problem. We apply the improved model into the prediction of the survival status of gastric cancer patients and analyse the important related features. We use two types of indicators to evaluate the model, and we also design the confusion matrix of two models' predictive results to compare two models. The results show that the improved model has better performance. Furthermore, we calculate the importance of features related to survival with three different time periods and analyse their evolution, which are consistent with existing clinical research or further expand their research conclusions. These all support for clinically relevant decision‐making and has the potential to expand into survival prediction of other cancer patients.
doi_str_mv 10.1111/exsy.13666
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_3113450572</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>3113450572</sourcerecordid><originalsourceid>FETCH-LOGICAL-c1906-4102c26787f301a3657f0b35ae4c19edac67aba08012cc3572dfc511dc57b2aa3</originalsourceid><addsrcrecordid>eNp9kDFPwzAQhS0EEqWw8AsssSGl2HFityNUpVSqxABIZbIujgOu0jjYbkr_PQ5h5pa74Xvvnh5C15RMaJw7_e2PE8o45ydoRDM-TQibZadoRFLOk0yk5BxdeL8lhFAh-AgdVrsCamiULrHfu850UOPW6dKoYGyDK-vwB_jgjMKqxxxuIRjdBI8L8FEVIbNrne3ivVk-WOsDPpjwiVV_ed14E0ynMTRldFPRvrbeX6KzCmqvr_72GL09Ll7nT8n6ebma368TRWckJqYkVSkXU1ExQoHxXFSkYDnoLAK6BMUFFECmhKZKsVykZaVySkuViyIFYGN0M_jGhF977YPc2r1r4kvJKGVZTqImUrcDpVzM5nQlW2d24I6SEtkXK_ti5W-xEaYDfDC1Pv5DysXm5X3Q_ADrT31V</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3113450572</pqid></control><display><type>article</type><title>Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Xu, Liangchen ; Guo, Chonghui</creator><creatorcontrib>Xu, Liangchen ; Guo, Chonghui</creatorcontrib><description>Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sensitivity because of class imbalance. This is a non‐negligible problem due to the poor prognosis of gastric cancer patients. Furthermore, models in the medical domain require strong interpretability to increase their applicability. Due to the better performance and interpretability of the XGBoost model, we design a loss function taking into account cost sensitive and focal loss from the algorithm level for XGBoost to deal with the imbalance problem. We apply the improved model into the prediction of the survival status of gastric cancer patients and analyse the important related features. We use two types of indicators to evaluate the model, and we also design the confusion matrix of two models' predictive results to compare two models. The results show that the improved model has better performance. Furthermore, we calculate the importance of features related to survival with three different time periods and analyse their evolution, which are consistent with existing clinical research or further expand their research conclusions. These all support for clinically relevant decision‐making and has the potential to expand into survival prediction of other cancer patients.</description><identifier>ISSN: 0266-4720</identifier><identifier>EISSN: 1468-0394</identifier><identifier>DOI: 10.1111/exsy.13666</identifier><language>eng</language><publisher>Oxford: Blackwell Publishing Ltd</publisher><subject>Algorithms ; Cost analysis ; cost‐sensitive ; data imbalance ; Design analysis ; focal loss ; Gastric cancer ; Machine learning ; Medical prognosis ; Performance evaluation ; Performance prediction ; Survival ; survival prediction</subject><ispartof>Expert systems, 2024-11, Vol.41 (11), p.n/a</ispartof><rights>2024 John Wiley &amp; Sons Ltd.</rights><rights>2024 John Wiley &amp; Sons, Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><cites>FETCH-LOGICAL-c1906-4102c26787f301a3657f0b35ae4c19edac67aba08012cc3572dfc511dc57b2aa3</cites><orcidid>0000-0002-5155-1297</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1111%2Fexsy.13666$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1111%2Fexsy.13666$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,780,784,1416,27923,27924,45573,45574</link.rule.ids></links><search><creatorcontrib>Xu, Liangchen</creatorcontrib><creatorcontrib>Guo, Chonghui</creatorcontrib><title>Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss</title><title>Expert systems</title><description>Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sensitivity because of class imbalance. This is a non‐negligible problem due to the poor prognosis of gastric cancer patients. Furthermore, models in the medical domain require strong interpretability to increase their applicability. Due to the better performance and interpretability of the XGBoost model, we design a loss function taking into account cost sensitive and focal loss from the algorithm level for XGBoost to deal with the imbalance problem. We apply the improved model into the prediction of the survival status of gastric cancer patients and analyse the important related features. We use two types of indicators to evaluate the model, and we also design the confusion matrix of two models' predictive results to compare two models. The results show that the improved model has better performance. Furthermore, we calculate the importance of features related to survival with three different time periods and analyse their evolution, which are consistent with existing clinical research or further expand their research conclusions. These all support for clinically relevant decision‐making and has the potential to expand into survival prediction of other cancer patients.</description><subject>Algorithms</subject><subject>Cost analysis</subject><subject>cost‐sensitive</subject><subject>data imbalance</subject><subject>Design analysis</subject><subject>focal loss</subject><subject>Gastric cancer</subject><subject>Machine learning</subject><subject>Medical prognosis</subject><subject>Performance evaluation</subject><subject>Performance prediction</subject><subject>Survival</subject><subject>survival prediction</subject><issn>0266-4720</issn><issn>1468-0394</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kDFPwzAQhS0EEqWw8AsssSGl2HFityNUpVSqxABIZbIujgOu0jjYbkr_PQ5h5pa74Xvvnh5C15RMaJw7_e2PE8o45ydoRDM-TQibZadoRFLOk0yk5BxdeL8lhFAh-AgdVrsCamiULrHfu850UOPW6dKoYGyDK-vwB_jgjMKqxxxuIRjdBI8L8FEVIbNrne3ivVk-WOsDPpjwiVV_ed14E0ynMTRldFPRvrbeX6KzCmqvr_72GL09Ll7nT8n6ebma368TRWckJqYkVSkXU1ExQoHxXFSkYDnoLAK6BMUFFECmhKZKsVykZaVySkuViyIFYGN0M_jGhF977YPc2r1r4kvJKGVZTqImUrcDpVzM5nQlW2d24I6SEtkXK_ti5W-xEaYDfDC1Pv5DysXm5X3Q_ADrT31V</recordid><startdate>202411</startdate><enddate>202411</enddate><creator>Xu, Liangchen</creator><creator>Guo, Chonghui</creator><general>Blackwell Publishing Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7TB</scope><scope>8FD</scope><scope>F28</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-5155-1297</orcidid></search><sort><creationdate>202411</creationdate><title>Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss</title><author>Xu, Liangchen ; Guo, Chonghui</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c1906-4102c26787f301a3657f0b35ae4c19edac67aba08012cc3572dfc511dc57b2aa3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Algorithms</topic><topic>Cost analysis</topic><topic>cost‐sensitive</topic><topic>data imbalance</topic><topic>Design analysis</topic><topic>focal loss</topic><topic>Gastric cancer</topic><topic>Machine learning</topic><topic>Medical prognosis</topic><topic>Performance evaluation</topic><topic>Performance prediction</topic><topic>Survival</topic><topic>survival prediction</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xu, Liangchen</creatorcontrib><creatorcontrib>Guo, Chonghui</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>ANTE: Abstracts in New Technology &amp; Engineering</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Expert systems</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xu, Liangchen</au><au>Guo, Chonghui</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss</atitle><jtitle>Expert systems</jtitle><date>2024-11</date><risdate>2024</risdate><volume>41</volume><issue>11</issue><epage>n/a</epage><issn>0266-4720</issn><eissn>1468-0394</eissn><abstract>Accurate prediction of gastric cancer survival state is one of great significant tasks for clinical decision‐making. Many advanced machine learning classification techniques have been applied to predict the survival status of cancer patients in three or 5 years, however, many of them have a low sensitivity because of class imbalance. This is a non‐negligible problem due to the poor prognosis of gastric cancer patients. Furthermore, models in the medical domain require strong interpretability to increase their applicability. Due to the better performance and interpretability of the XGBoost model, we design a loss function taking into account cost sensitive and focal loss from the algorithm level for XGBoost to deal with the imbalance problem. We apply the improved model into the prediction of the survival status of gastric cancer patients and analyse the important related features. We use two types of indicators to evaluate the model, and we also design the confusion matrix of two models' predictive results to compare two models. The results show that the improved model has better performance. Furthermore, we calculate the importance of features related to survival with three different time periods and analyse their evolution, which are consistent with existing clinical research or further expand their research conclusions. These all support for clinically relevant decision‐making and has the potential to expand into survival prediction of other cancer patients.</abstract><cop>Oxford</cop><pub>Blackwell Publishing Ltd</pub><doi>10.1111/exsy.13666</doi><tpages>18</tpages><orcidid>https://orcid.org/0000-0002-5155-1297</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0266-4720
ispartof Expert systems, 2024-11, Vol.41 (11), p.n/a
issn 0266-4720
1468-0394
language eng
recordid cdi_proquest_journals_3113450572
source Wiley Online Library Journals Frontfile Complete
subjects Algorithms
Cost analysis
cost‐sensitive
data imbalance
Design analysis
focal loss
Gastric cancer
Machine learning
Medical prognosis
Performance evaluation
Performance prediction
Survival
survival prediction
title Imbalanced survival prediction for gastric cancer patients based on improved XGBoost with cost sensitive and focal loss
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-13T09%3A50%3A28IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Imbalanced%20survival%20prediction%20for%20gastric%20cancer%20patients%20based%20on%20improved%20XGBoost%20with%20cost%20sensitive%20and%20focal%20loss&rft.jtitle=Expert%20systems&rft.au=Xu,%20Liangchen&rft.date=2024-11&rft.volume=41&rft.issue=11&rft.epage=n/a&rft.issn=0266-4720&rft.eissn=1468-0394&rft_id=info:doi/10.1111/exsy.13666&rft_dat=%3Cproquest_cross%3E3113450572%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3113450572&rft_id=info:pmid/&rfr_iscdi=true