Deep reinforcement learning control of combined chemotherapy and anti-angiogenic drug delivery for cancerous tumor treatment

By virtue of the chronic and dangerous nature of cancer, researchers have explored various approaches to managing the abnormal cell growth associated with this disease using novel treatment methods. This study introduces a control system based on normalized advantage function reinforcement learning....

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers in biology and medicine 2024-10, Vol.181, p.109041, Article 109041
Hauptverfasser: Niazmand, Vahid Reza, Raheb, Mohammad Ali, Eqra, Navid, Vatankhah, Ramin, Farrokhi, Amirmohammad
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:By virtue of the chronic and dangerous nature of cancer, researchers have explored various approaches to managing the abnormal cell growth associated with this disease using novel treatment methods. This study introduces a control system based on normalized advantage function reinforcement learning. It aims to boost the body's immune response against cancer cell proliferation. This control approach is applied to provide a combination of both chemotherapy and anti-angiogenic drugs for the first time without the need for complex, predefined mathematical models. It employs a model-free reinforcement learning technique that adaptively adjusts to individual patients to determine optimal drug administration with minimum injection rates. In this regard, a comprehensive and realistic simulation and training environment is employed, with the concentrations of normal cells, cancer cells, and endothelial cells, as well as the levels of chemotherapy and anti-angiogenic agents, as state variables. Furthermore, high levels of disturbances are considered in the simulation to investigate the robustness of the proposed method against probable uncertainties in the treatment process or patient parameters. A practical reward function has also been devised in alignment with medical objectives to ensure effective and safe treatment outcomes. The results demonstrate robustness and superior performance compared to the existing methods. Simulations show that the proposed approach is a dependable strategy for effectively reducing the concentration of cancer cells in the shortest duration using minimal doses of chemotherapy and anti-angiogenic drugs. •NAF RL agent is employed to design a model-free controller for the drug delivery in cancerous tumors.•A comprehensive dynamic model with five states and two inputs is considered as the training environment.•Both chemotherapy and anti-angiogenic drugs are considered as control actions with minimum injection doses.•A practical reward function is proposed to be used with the algorithm in order to meet the defined control objectives.•High levels of disturbance is applied to validate the robustness against uncertainties in the treatment process.
ISSN:0010-4825
1879-0534
1879-0534
DOI:10.1016/j.compbiomed.2024.109041