Initial Excitation-Based Iterative Algorithm for Approximate Optimal Control of Completely Unknown LTI Systems

This paper proposes an approximate/adaptive optimal control (AOC) design for completely unknown continuous-time linear time invariant systems, without requiring the restrictive persistence of excitation (PE) condition for parameter convergence. The proposed AOC algorithm utilizes two layers of filte...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on automatic control 2019-12, Vol.64 (12), p.5230-5237
Hauptverfasser:	Jha, Sumit Kumar, Roy, Sayan Basu, Bhasin, Shubhendu
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptive algorithms Adaptive control Algorithms Approximate/adaptive optimal control Computer simulation Convergence Data storage Excitation filter-based design Heuristic algorithms initial excitation Iterative algorithms Iterative methods Linear quadratic regulator Linear systems Microsoft Windows Optimal control Policies System dynamics Time invariant systems
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper proposes an approximate/adaptive optimal control (AOC) design for completely unknown continuous-time linear time invariant systems, without requiring the restrictive persistence of excitation (PE) condition for parameter convergence. The proposed AOC algorithm utilizes two layers of filtering-the first layer filters strategically eliminate the need for state derivative information, while the second layer filters provide suitable algebraic relations for iteratively obtaining the optimal policy under a milder online-verifiable initial excitation assumption. Unlike previous AOC algorithms, the proposed method does not require finite window integrals, intelligent data-storage, and the restrictive PE assumption. Further, the proposed method relaxes the sufficient condition required for obtaining successive stabilizing control policies. The intermediate policies are proved to be stabilizing and converging to the optimal policy. Simulation results validate the efficacy of the proposed adaptive/approximate linear quadratic regulator algorithm.
ISSN:	0018-9286 1558-2523
DOI:	10.1109/TAC.2019.2912828