Value Iteration for (Switched) Homogeneous Systems

In this note, we prove that dynamic programming value iteration converges uniformly for discrete-time homogeneous systems and continuous-time switched homogeneous systems. For discrete-time homogeneous systems, rather than discounting the cost function (which exponentially decreases the weights of t...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:IEEE transactions on automatic control 2009-06, Vol.54 (6), p.1290-1294
Hauptverfasser: Rinehart, M., Dahleh, M., Kolmanovsky, I.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:In this note, we prove that dynamic programming value iteration converges uniformly for discrete-time homogeneous systems and continuous-time switched homogeneous systems. For discrete-time homogeneous systems, rather than discounting the cost function (which exponentially decreases the weights of the cost of future actions), we show that such systems satisfy approximate dynamic programming conditions recently developed by Rantzer, which provides a uniform bound on the convergence rate of value iteration over a compact set. For continuous-time switched homogeneous system, we present a transformation that generates an equivalent discrete-time homogeneous system with an additional ldquosamplingrdquo input for which discrete-time value iteration is compatible, and we further show that the inclusion of homogeneous switching costs results in a continuous value function.
ISSN:0018-9286
1558-2523
DOI:10.1109/TAC.2009.2013055