Deep reinforcement learning-based control of chemo-drug dose in cancer treatment

•Proposing a deep RL-based controller for cancer chemotherapy, which handles states and action in infinite space.•The proposed approach does not need to have a mathematical model of the system, it just needs to have tumor volume.•Providing an adaptive control technique to respond to the special cond...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computer methods and programs in biomedicine 2024-01, Vol.243, p.107884-107884, Article 107884
Hauptverfasser: Mashayekhi, Hoda, Nazari, Mostafa, Jafarinejad, Fatemeh, Meskin, Nader
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 107884
container_issue
container_start_page 107884
container_title Computer methods and programs in biomedicine
container_volume 243
creator Mashayekhi, Hoda
Nazari, Mostafa
Jafarinejad, Fatemeh
Meskin, Nader
description •Proposing a deep RL-based controller for cancer chemotherapy, which handles states and action in infinite space.•The proposed approach does not need to have a mathematical model of the system, it just needs to have tumor volume.•Providing an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients.•Evaluating and comparing the efficiency and robustness of the proposed method using patients with diverse conditions. Advancement in the treatment of cancer, as a leading cause of death worldwide, has promoted several research activities in various related fields. The development of effective treatment regimens with optimal drug dose administration using a mathematical modeling framework has received extensive research attention during the last decades. However, most of the control techniques presented for cancer chemotherapy are mainly model-based approaches. The available model-free techniques based on Reinforcement Learning (RL), commonly discretize the problem states and variables, which other than demanding expert supervision, cannot model the real-world conditions accurately. The more recent Deep Reinforcement Learning (DRL) methods, which enable modeling the problem in its original continuous space, are rarely applied in cancer chemotherapy. In this paper, we propose an effective and robust DRL-based, model-free method for the closed-loop control of cancer chemotherapy drug dosing. A nonlinear pharmacological cancer model is used for simulating the patient and capturing the cancer dynamics. In contrast to previous work, the state variables and control action are modeled in their original infinite spaces to avoid expert-guided discretization and provide a more realistic solution. The DRL network is trained to automatically adjust the drug dose based on the monitored states of the patient. The proposed method provides an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients. The performance of the proposed DRL-based controller is evaluated by numerical analysis of different diverse simulated patients. Comparison to the state-of-the-art RL-based method, which uses discretized state and action spaces, shows the superiority of the approach in the process and duration of cancer chemotherapy treatment. In the majority of the studied cases, the proposed model decreases the medication period and the total amount of adminis
doi_str_mv 10.1016/j.cmpb.2023.107884
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_2889242781</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0169260723005503</els_id><sourcerecordid>2889242781</sourcerecordid><originalsourceid>FETCH-LOGICAL-c333t-da922e10898878f9cfd881710788407910e433d51271a716c8491ac56aee650e3</originalsourceid><addsrcrecordid>eNp9kE9LAzEUxIMoWKtfwFOOXlKT7J9kwYtUq0JBD3oOafK2puwma5IKfnt3Wc-eHjzmN8wMQteMrhhl9e1hZfpht-KUF-NDSFmeoAWTghNR1dUpWoyihvCainN0kdKBUsqrql6gtweAAUdwvg3RQA8-4w509M7vyU4nsNgEn2PocGix-YQ-EBuPe2xDAuw8NtobiDhH0HmiL9FZq7sEV393iT42j-_rZ7J9fXpZ32-JKYoiE6sbzoFR2UgpZNuY1krJxJydioZRKIvCVowLpgWrjSwbpk1Va4C6olAs0c3sO8TwdYSUVe-Sga7THsIxKS5lw0suJBulfJaaGFKK0Kohul7HH8WomuZTBzXNp6b51BxhhO5mCMYS3w6iSsbB2NW6CCYrG9x_-C-cLHfO</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2889242781</pqid></control><display><type>article</type><title>Deep reinforcement learning-based control of chemo-drug dose in cancer treatment</title><source>Access via ScienceDirect (Elsevier)</source><creator>Mashayekhi, Hoda ; Nazari, Mostafa ; Jafarinejad, Fatemeh ; Meskin, Nader</creator><creatorcontrib>Mashayekhi, Hoda ; Nazari, Mostafa ; Jafarinejad, Fatemeh ; Meskin, Nader</creatorcontrib><description>•Proposing a deep RL-based controller for cancer chemotherapy, which handles states and action in infinite space.•The proposed approach does not need to have a mathematical model of the system, it just needs to have tumor volume.•Providing an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients.•Evaluating and comparing the efficiency and robustness of the proposed method using patients with diverse conditions. Advancement in the treatment of cancer, as a leading cause of death worldwide, has promoted several research activities in various related fields. The development of effective treatment regimens with optimal drug dose administration using a mathematical modeling framework has received extensive research attention during the last decades. However, most of the control techniques presented for cancer chemotherapy are mainly model-based approaches. The available model-free techniques based on Reinforcement Learning (RL), commonly discretize the problem states and variables, which other than demanding expert supervision, cannot model the real-world conditions accurately. The more recent Deep Reinforcement Learning (DRL) methods, which enable modeling the problem in its original continuous space, are rarely applied in cancer chemotherapy. In this paper, we propose an effective and robust DRL-based, model-free method for the closed-loop control of cancer chemotherapy drug dosing. A nonlinear pharmacological cancer model is used for simulating the patient and capturing the cancer dynamics. In contrast to previous work, the state variables and control action are modeled in their original infinite spaces to avoid expert-guided discretization and provide a more realistic solution. The DRL network is trained to automatically adjust the drug dose based on the monitored states of the patient. The proposed method provides an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients. The performance of the proposed DRL-based controller is evaluated by numerical analysis of different diverse simulated patients. Comparison to the state-of-the-art RL-based method, which uses discretized state and action spaces, shows the superiority of the approach in the process and duration of cancer chemotherapy treatment. In the majority of the studied cases, the proposed model decreases the medication period and the total amount of administrated drug, while increasing the rate of reduction in tumor cells.</description><identifier>ISSN: 0169-2607</identifier><identifier>EISSN: 1872-7565</identifier><identifier>DOI: 10.1016/j.cmpb.2023.107884</identifier><language>eng</language><publisher>Elsevier B.V</publisher><subject>Active drug dosing ; Actor-critic ; Cancer chemotherapy ; Deep reinforcement learning ; Optimal control ; Twin-delayed deep deterministic policy gradient</subject><ispartof>Computer methods and programs in biomedicine, 2024-01, Vol.243, p.107884-107884, Article 107884</ispartof><rights>2023 Elsevier B.V.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c333t-da922e10898878f9cfd881710788407910e433d51271a716c8491ac56aee650e3</citedby><cites>FETCH-LOGICAL-c333t-da922e10898878f9cfd881710788407910e433d51271a716c8491ac56aee650e3</cites><orcidid>0000-0003-0080-2743 ; 0000-0001-5745-4984 ; 0000-0003-3098-9369</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://dx.doi.org/10.1016/j.cmpb.2023.107884$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,780,784,3550,27924,27925,45995</link.rule.ids></links><search><creatorcontrib>Mashayekhi, Hoda</creatorcontrib><creatorcontrib>Nazari, Mostafa</creatorcontrib><creatorcontrib>Jafarinejad, Fatemeh</creatorcontrib><creatorcontrib>Meskin, Nader</creatorcontrib><title>Deep reinforcement learning-based control of chemo-drug dose in cancer treatment</title><title>Computer methods and programs in biomedicine</title><description>•Proposing a deep RL-based controller for cancer chemotherapy, which handles states and action in infinite space.•The proposed approach does not need to have a mathematical model of the system, it just needs to have tumor volume.•Providing an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients.•Evaluating and comparing the efficiency and robustness of the proposed method using patients with diverse conditions. Advancement in the treatment of cancer, as a leading cause of death worldwide, has promoted several research activities in various related fields. The development of effective treatment regimens with optimal drug dose administration using a mathematical modeling framework has received extensive research attention during the last decades. However, most of the control techniques presented for cancer chemotherapy are mainly model-based approaches. The available model-free techniques based on Reinforcement Learning (RL), commonly discretize the problem states and variables, which other than demanding expert supervision, cannot model the real-world conditions accurately. The more recent Deep Reinforcement Learning (DRL) methods, which enable modeling the problem in its original continuous space, are rarely applied in cancer chemotherapy. In this paper, we propose an effective and robust DRL-based, model-free method for the closed-loop control of cancer chemotherapy drug dosing. A nonlinear pharmacological cancer model is used for simulating the patient and capturing the cancer dynamics. In contrast to previous work, the state variables and control action are modeled in their original infinite spaces to avoid expert-guided discretization and provide a more realistic solution. The DRL network is trained to automatically adjust the drug dose based on the monitored states of the patient. The proposed method provides an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients. The performance of the proposed DRL-based controller is evaluated by numerical analysis of different diverse simulated patients. Comparison to the state-of-the-art RL-based method, which uses discretized state and action spaces, shows the superiority of the approach in the process and duration of cancer chemotherapy treatment. In the majority of the studied cases, the proposed model decreases the medication period and the total amount of administrated drug, while increasing the rate of reduction in tumor cells.</description><subject>Active drug dosing</subject><subject>Actor-critic</subject><subject>Cancer chemotherapy</subject><subject>Deep reinforcement learning</subject><subject>Optimal control</subject><subject>Twin-delayed deep deterministic policy gradient</subject><issn>0169-2607</issn><issn>1872-7565</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2024</creationdate><recordtype>article</recordtype><recordid>eNp9kE9LAzEUxIMoWKtfwFOOXlKT7J9kwYtUq0JBD3oOafK2puwma5IKfnt3Wc-eHjzmN8wMQteMrhhl9e1hZfpht-KUF-NDSFmeoAWTghNR1dUpWoyihvCainN0kdKBUsqrql6gtweAAUdwvg3RQA8-4w509M7vyU4nsNgEn2PocGix-YQ-EBuPe2xDAuw8NtobiDhH0HmiL9FZq7sEV393iT42j-_rZ7J9fXpZ32-JKYoiE6sbzoFR2UgpZNuY1krJxJydioZRKIvCVowLpgWrjSwbpk1Va4C6olAs0c3sO8TwdYSUVe-Sga7THsIxKS5lw0suJBulfJaaGFKK0Kohul7HH8WomuZTBzXNp6b51BxhhO5mCMYS3w6iSsbB2NW6CCYrG9x_-C-cLHfO</recordid><startdate>202401</startdate><enddate>202401</enddate><creator>Mashayekhi, Hoda</creator><creator>Nazari, Mostafa</creator><creator>Jafarinejad, Fatemeh</creator><creator>Meskin, Nader</creator><general>Elsevier B.V</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7X8</scope><orcidid>https://orcid.org/0000-0003-0080-2743</orcidid><orcidid>https://orcid.org/0000-0001-5745-4984</orcidid><orcidid>https://orcid.org/0000-0003-3098-9369</orcidid></search><sort><creationdate>202401</creationdate><title>Deep reinforcement learning-based control of chemo-drug dose in cancer treatment</title><author>Mashayekhi, Hoda ; Nazari, Mostafa ; Jafarinejad, Fatemeh ; Meskin, Nader</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c333t-da922e10898878f9cfd881710788407910e433d51271a716c8491ac56aee650e3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2024</creationdate><topic>Active drug dosing</topic><topic>Actor-critic</topic><topic>Cancer chemotherapy</topic><topic>Deep reinforcement learning</topic><topic>Optimal control</topic><topic>Twin-delayed deep deterministic policy gradient</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Mashayekhi, Hoda</creatorcontrib><creatorcontrib>Nazari, Mostafa</creatorcontrib><creatorcontrib>Jafarinejad, Fatemeh</creatorcontrib><creatorcontrib>Meskin, Nader</creatorcontrib><collection>CrossRef</collection><collection>MEDLINE - Academic</collection><jtitle>Computer methods and programs in biomedicine</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Mashayekhi, Hoda</au><au>Nazari, Mostafa</au><au>Jafarinejad, Fatemeh</au><au>Meskin, Nader</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Deep reinforcement learning-based control of chemo-drug dose in cancer treatment</atitle><jtitle>Computer methods and programs in biomedicine</jtitle><date>2024-01</date><risdate>2024</risdate><volume>243</volume><spage>107884</spage><epage>107884</epage><pages>107884-107884</pages><artnum>107884</artnum><issn>0169-2607</issn><eissn>1872-7565</eissn><abstract>•Proposing a deep RL-based controller for cancer chemotherapy, which handles states and action in infinite space.•The proposed approach does not need to have a mathematical model of the system, it just needs to have tumor volume.•Providing an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients.•Evaluating and comparing the efficiency and robustness of the proposed method using patients with diverse conditions. Advancement in the treatment of cancer, as a leading cause of death worldwide, has promoted several research activities in various related fields. The development of effective treatment regimens with optimal drug dose administration using a mathematical modeling framework has received extensive research attention during the last decades. However, most of the control techniques presented for cancer chemotherapy are mainly model-based approaches. The available model-free techniques based on Reinforcement Learning (RL), commonly discretize the problem states and variables, which other than demanding expert supervision, cannot model the real-world conditions accurately. The more recent Deep Reinforcement Learning (DRL) methods, which enable modeling the problem in its original continuous space, are rarely applied in cancer chemotherapy. In this paper, we propose an effective and robust DRL-based, model-free method for the closed-loop control of cancer chemotherapy drug dosing. A nonlinear pharmacological cancer model is used for simulating the patient and capturing the cancer dynamics. In contrast to previous work, the state variables and control action are modeled in their original infinite spaces to avoid expert-guided discretization and provide a more realistic solution. The DRL network is trained to automatically adjust the drug dose based on the monitored states of the patient. The proposed method provides an adaptive control technique to respond to the special conditions and diagnosis measurements of different categories of patients. The performance of the proposed DRL-based controller is evaluated by numerical analysis of different diverse simulated patients. Comparison to the state-of-the-art RL-based method, which uses discretized state and action spaces, shows the superiority of the approach in the process and duration of cancer chemotherapy treatment. In the majority of the studied cases, the proposed model decreases the medication period and the total amount of administrated drug, while increasing the rate of reduction in tumor cells.</abstract><pub>Elsevier B.V</pub><doi>10.1016/j.cmpb.2023.107884</doi><tpages>1</tpages><orcidid>https://orcid.org/0000-0003-0080-2743</orcidid><orcidid>https://orcid.org/0000-0001-5745-4984</orcidid><orcidid>https://orcid.org/0000-0003-3098-9369</orcidid></addata></record>
fulltext fulltext
identifier ISSN: 0169-2607
ispartof Computer methods and programs in biomedicine, 2024-01, Vol.243, p.107884-107884, Article 107884
issn 0169-2607
1872-7565
language eng
recordid cdi_proquest_miscellaneous_2889242781
source Access via ScienceDirect (Elsevier)
subjects Active drug dosing
Actor-critic
Cancer chemotherapy
Deep reinforcement learning
Optimal control
Twin-delayed deep deterministic policy gradient
title Deep reinforcement learning-based control of chemo-drug dose in cancer treatment
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-22T13%3A26%3A20IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Deep%20reinforcement%20learning-based%20control%20of%20chemo-drug%20dose%20in%20cancer%20treatment&rft.jtitle=Computer%20methods%20and%20programs%20in%20biomedicine&rft.au=Mashayekhi,%20Hoda&rft.date=2024-01&rft.volume=243&rft.spage=107884&rft.epage=107884&rft.pages=107884-107884&rft.artnum=107884&rft.issn=0169-2607&rft.eissn=1872-7565&rft_id=info:doi/10.1016/j.cmpb.2023.107884&rft_dat=%3Cproquest_cross%3E2889242781%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2889242781&rft_id=info:pmid/&rft_els_id=S0169260723005503&rfr_iscdi=true