Event-Based Optimization of Markov Systems

Recent research indicates that Markov decision processes (MDPs) and perturbation analysis (PA) based optimization can be derived easily from two fundamental performance sensitivity formulas. With this sensitivity point of view, an event-based optimization approach, including event-based sensitivity...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on automatic control 2008-05, Vol.53 (4), p.1076-1082
Hauptverfasser:	CAO, Xi-Ren, JUNYU ZHANG
Format:	Artikel
Sprache:	eng
Schlagworte:	Applied sciences Automatic control Computer science control theory systems Control theory Control theory. Systems Decision analysis Eigenvalues and eigenfunctions Equations Exact sciences and technology Markov decision processes (MDPs) Markov processes Mathematical analysis Optimization Parameter estimation Performance analysis performance potentials perturbation analysis (PA) Perturbation methods policy gradients policy iteration Rivers Sensitivity analysis Stochastic processes Stochastic systems System identification
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Recent research indicates that Markov decision processes (MDPs) and perturbation analysis (PA) based optimization can be derived easily from two fundamental performance sensitivity formulas. With this sensitivity point of view, an event-based optimization approach, including event-based sensitivity analysis and event-based policy iteration, was proposed via an example by X. R. Cao (Discrete Event Dyn. Syst.: Theory Appl., vol. 15, pp. 169-197, 2005). This approach utilizes the special feature of a system and illustrates how the potentials can be aggregated using the special feature. The approach applies to many practical problems that do not fit well the standard MDP formulation. This note provides a mathematical formulation and proves the main results for this approach.
ISSN:	0018-9286 1558-2523
DOI:	10.1109/TAC.2008.919557