Opposition-Based Reinforcement Learning in the Management of Water Resources

Opposition-based learning (OBL) is a new scheme in machine intelligence. In this paper, an OBL version Q-learning which exploits opposite quantities to accelerate the learning is used for management of single reservoir operations. In this method, an agent takes an action, receives reward, and update...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Mahootchi, M., Tizhoosh, H.R., Ponnambalam, K.
Format:	Tagungsbericht
Sprache:	eng
Schlagworte:	Design engineering Dynamic programming Machine intelligence Machine learning Neural networks opposite action Q-learning reinforcement learning Reservoirs Resource management Stochastic processes Systems engineering and theory water reservoirs Water resources
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	Opposition-based learning (OBL) is a new scheme in machine intelligence. In this paper, an OBL version Q-learning which exploits opposite quantities to accelerate the learning is used for management of single reservoir operations. In this method, an agent takes an action, receives reward, and updates its knowledge in terms of action-value functions. Furthermore, the transition function which is the balance equation in the optimization model determines the next state and updates the action-value function pertinent to opposite action. Two type of opposite actions will be defined. It will be demonstrated that using OBL can significantly improve the efficiency of the operating policy within limited iterations. It is also shown that this technique is more robust than Q-Learning
ISSN:	2325-1824 2325-1867
DOI:	10.1109/ADPRL.2007.368191