Online Policy Learning-Based Output-Feedback Optimal Control of Continuous-Time Systems

Although state-feedback optimal control of the continuous-time (CT) systems has been extensively studied, resolving optimal control online via output-feedback is still challenging, especially only input-output information can be used. In this brief, we develop an innovative technique to online desig...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on circuits and systems. II, Express briefs Express briefs, 2024-02, Vol.71 (2), p.652-656
Hauptverfasser:	Zhao, Jun, Lv, Yongfeng, Zeng, Qingliang, Wan, Lirong
Format:	Artikel
Sprache:	eng
Schlagworte:	Aircraft models Algorithms Atmospheric modeling Continuous time systems Control methods Control systems Convergence Heuristic algorithms Machine learning Mathematical models Observers Optimal control Output feedback Output-feedback control policy learning Riccati equation Riccati equations State feedback
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Although state-feedback optimal control of the continuous-time (CT) systems has been extensively studied, resolving optimal control online via output-feedback is still challenging, especially only input-output information can be used. In this brief, we develop an innovative technique to online design the output-feedback optimal control (OFOC) of the CT systems. Firstly, to synthesis the OFOC, an output-feedback algebraic Riccati equation (OARE) is constructed, which can be solved using input-output information. Then, an online policy learning (PL) algorithm is developed to compute the solution of the OARE, where only the input-output information is required and the conventional offline learning procedure is avoided. Simulations based on an aircraft model are provided to test the developed control method and online learning algorithm.
ISSN:	1549-7747 1558-3791
DOI:	10.1109/TCSII.2022.3211832