Physiological control for left ventricular assist devices based on deep reinforcement learning

Background The improvement of controllers of left ventricular assist device (LVAD) technology supporting heart failure (HF) patients has enormous impact, given the high prevalence and mortality of HF in the population. The use of reinforcement learning for control applications in LVAD remains minima...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Artificial organs 2024-12, Vol.48 (12), p.1418-1429
Hauptverfasser: Fernández‐Zapico, Diego, Peirelinck, Thijs, Deconinck, Geert, Donker, Dirk W., Fresiello, Libera
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:Background The improvement of controllers of left ventricular assist device (LVAD) technology supporting heart failure (HF) patients has enormous impact, given the high prevalence and mortality of HF in the population. The use of reinforcement learning for control applications in LVAD remains minimally explored. This work introduces a preload‐based deep reinforcement learning control for LVAD based on the proximal policy optimization algorithm. Methods The deep reinforcement learning control is built upon data derived from a deterministic high‐fidelity cardiorespiratory simulator exposed to variations of total blood volume, heart rate, systemic vascular resistance, pulmonary vascular resistance, right ventricular end‐systolic elastance, and left ventricular end‐systolic elastance, to replicate realistic inter‐ and intra‐patient variability of patients with a severe HF supported by LVAD. The deep reinforcement learning control obtained in this work is trained to avoid ventricular suction and allow aortic valve opening by using left ventricular pressure signals: end‐diastolic pressure, maximum pressure in the left ventricle (LV), and maximum pressure in the aorta. Results The results show controller obtained in this work, compared to the constant speed LVAD alternative, assures a more stable end‐diastolic volume (EDV), with a standard deviation of 5 mL and 9 mL, respectively, and a higher degree of aortic flow, with an average flow of 1.1 L/min and 0.9 L/min, respectively. Conclusion This work implements a deep reinforcement learning controller in a high‐fidelity cardiorespiratory simulator, resulting in increases of flow through the aortic valve and increases of EDV stability, when compared to a constant speed LVAD strategy. This work focuses on a left ventricular assist device control, based on deep reinforcement learning trained with the goal to prevent suction and promote aortic valve opening. The control is built upon data derived from a deterministic high fidelity cardiorespiratory simulator recreating different (patho)physiological conditions. Results show that the control assures adequate perfusion to end organs, increases flow through the aortic valve and promotes end diastolic left ventricular volume stability, when compared to a constant LVAD speed strategy.
ISSN:0160-564X
1525-1594
1525-1594
DOI:10.1111/aor.14845