Improving quasi-optimal inventory and transportation policies using adaptive critic based approximate dynamic programming

We demonstrate the possibility of optimal control of physical inventory systems in a nonstationary fitness terrain, based on the combined application of evolutionary search and adaptive critic terrain following. We show that adaptive critic based approximate dynamic programming techniques based on p...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: Shervais, S., Shannon, T.T.
Format: Tagungsbericht
Sprache:eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:We demonstrate the possibility of optimal control of physical inventory systems in a nonstationary fitness terrain, based on the combined application of evolutionary search and adaptive critic terrain following. We show that adaptive critic based approximate dynamic programming techniques based on plant-controller Jacobeans can be used with systems characterized by discrete valued states and controls. Improvements upon a quasi-optimal policy found using a genetic algorithm in a high-penalty environment, average 66% under conditions both of stationary and non-stationary demand.
ISSN:1062-922X
2577-1655
DOI:10.1109/ICSMC.2000.886542