Multi-Agent Safe Policy Learning for Power Management of Networked Microgrids

This article presents a supervised multi-agent safe policy learning (SMAS-PL) method for optimal power management of networked microgrids (MGs) in distribution systems. While unconstrained reinforcement learning (RL) algorithms are black-box decision models that could fail to satisfy grid operationa...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	IEEE transactions on smart grid 2021-03, Vol.12 (2), p.1048-1062
Hauptverfasser:	Zhang, Qianzhi, Dehghanpour, Kaveh, Wang, Zhaoyu, Qiu, Feng, Zhao, Dongbo
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Computational modeling Distributed generation Flow equations Indexes Machine learning multi-agent framework Multiagent systems networked microgrids Optimal control Optimization policy gradient Power flow Power management Power system management Reactive power Safe policy learning Safety Training
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This article presents a supervised multi-agent safe policy learning (SMAS-PL) method for optimal power management of networked microgrids (MGs) in distribution systems. While unconstrained reinforcement learning (RL) algorithms are black-box decision models that could fail to satisfy grid operational constraints, our proposed method considers AC power flow equations and other operational limits. Accordingly, the training process employs the gradient information of operational constraints to ensure that the optimal control policy functions generate safe and feasible decisions. Furthermore, we have developed a distributed consensus-based optimization approach to train the agents' policy functions while maintaining MGs' privacy and data ownership boundaries. After training, the learned optimal policy functions can be safely used by the MGs to dispatch their local resources, without the need to solve a complex optimization problem from scratch. Numerical experiments have been devised to verify the performance of the proposed method.
ISSN:	1949-3053 1949-3061
DOI:	10.1109/TSG.2020.3034827