The stagewise Kuhn-Tucker condition and differential dynamic programming
The intention of this work is to describe and examine a differential dynamic programming (DDP) algorithm for constrained, discrete-time optimal control. This algorithm has performed successfully on a large-scale reservoir control problem [11]. The present paper gives conditions under which convergen...
Gespeichert in:
Veröffentlicht in: | IEEE transactions on automatic control 1986-01, Vol.31 (1), p.25-30 |
---|---|
1. Verfasser: | |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | The intention of this work is to describe and examine a differential dynamic programming (DDP) algorithm for constrained, discrete-time optimal control. This algorithm has performed successfully on a large-scale reservoir control problem [11]. The present paper gives conditions under which convergence to the stationary policy is assured. The convergence demonstration hinges upon a notion which we refer to as the "stagewise" Kuhn-Tucker condition. Strategies generated to satisfy this condition determine policies which satisfy the conventional Kuhn-Tucker condition. This observation may be of wider importance in discrete optimal control theory, for the stagewise condition might be a convenient criterion for constructing strategies. |
---|---|
ISSN: | 0018-9286 1558-2523 |
DOI: | 10.1109/TAC.1986.1104123 |