optimal control for semi‐Markov jump linear systems via TP‐free temporal difference () learning

In the present study, a temporal difference (TD) learning algorithm is proposed to solve the optimal control problem for semi‐Markov jump linear systems (S‐MJLSs). The proposed scheme is TP‐free so that it can be applied in cases without pre‐known transition probabilities of embedded Markov chain. C...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:International journal of robust and nonlinear control 2021-09, Vol.31 (14), p.6905-6916
Hauptverfasser: Chen, Yaogang, Wen, Jiwei, Luan, Xiaoli, Liu, Fei
Format: Artikel
Sprache:eng
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!