Adaptive dynamic programming and distributionally robust optimal control of linear stochastic system using the Wasserstein metric

Summary In this paper, we consider the optimal control of unknown stochastic dynamical system for both the finite‐horizon and infinite‐horizon cases. The objective of this paper is to find an optimal controller to minimize the expected value of a function which depends on the random disturbance. Thr...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of adaptive control and signal processing 2024-08, Vol.38 (8), p.2810-2832
Hauptverfasser:	Liang, Qingpeng, Hu, Jiangping, Xiang, Linying, Shi, Kaibo, Wu, Yanzhi
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptive control adaptive dynamic programming Adaptive systems Covariance matrix Dynamic programming Dynamical systems Expected values linear stochastic system Matrix algebra Optimal control random disturbance Riccati equation Robust control Stochastic systems Vectors (mathematics)
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Summary In this paper, we consider the optimal control of unknown stochastic dynamical system for both the finite‐horizon and infinite‐horizon cases. The objective of this paper is to find an optimal controller to minimize the expected value of a function which depends on the random disturbance. Throughout this paper, it is assumed that the mean vector and covariance matrix of the disturbance distribution is unknown. An uncertainty set in the space of mean vector and the covariance matrix is introduced. For the finite‐horizon case, we derive a closed‐form expression of the unique optimal policy and the opponents policy that generates the worst‐case distribution. For the infinite‐horizon case, we simplify the Riccati equation obtained in the finite‐hozion setting to an algebraic Riccati equation, which can guarantee the existence of the solution of the Riccati equation. It is shown that the resulting optimal policies obtained in these two cases can stabilize the expected value of the system state under the worst‐case distribution. Furthermore, the unknown system matrices can also be explicitly computed using the adaptive dynamic programming technique, which can help compute the optimal control policy by solving the algebraic Riccati equation. Finally, a simulation example is presented to demonstrate the effectiveness of our theoretical results.
ISSN:	0890-6327 1099-1115
DOI:	10.1002/acs.3830