Virtual power plant containing electric vehicles scheduling strategies based on deep reinforcement learning

•VPP agent and EV charging station agent games to obtain electricity price.•The VPP tends to use mixed strategy, while EVs tend to use pure strategies.•Using Stackelberg game to prevent VPP from obtaining excess profit from EV members. Virtual power plants (VPPs), which aggregate customer-side flexi...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Electric power systems research 2022-04, Vol.205, p.107714, Article 107714
Hauptverfasser:	Wang, Jianing, Guo, Chunlin, Yu, Changshu, Liang, Yanchang
Format:	Artikel
Sprache:	eng
Schlagworte:	Algorithms Customers Deep learning Deep reinforcement learning Distributed generation Electric vehicle Electric vehicle charging Electric vehicle charging stations Electric vehicles Electricity distribution Energy sources Game theory Machine learning Real-time dispatch Resource scheduling Scheduling Stackelberg game Stochastic strategy Strategy Virtual power plant Virtual power plants
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•VPP agent and EV charging station agent games to obtain electricity price.•The VPP tends to use mixed strategy, while EVs tend to use pure strategies.•Using Stackelberg game to prevent VPP from obtaining excess profit from EV members. Virtual power plants (VPPs), which aggregate customer-side flexibility resources, provide an effective way for customers to participate in the electricity market, and provide a variety of flexible technologies and services to the market. Importantly, VPPs can provide services to electric vehicle (EV) charging stations. In this paper, we constructed a deep reinforcement learning (DRL) based Stackelberg game model for a VPP with EV charging stations. Considering the interests of both sides of the game, soft actor-critic (SAC) algorithm is used for the VPP agent and twin delay deep deterministic policy gradient (TD3) algorithm is used for the EV charging station agent. By alternately training the network parameters of the agents, the strategy and solution at the equilibrium of the game are calculated. Results of cases demonstrate that the VPP agent can learn the strategy of selling electricity to EVs, optimize the scheduling of distributed energy resources (DERs), and bidding strategy for participation in the electricity market. Meanwhile, the EV aggregation agent can learn scheduling strategies for charging and discharging EVs. When the EV aggregator uses a deterministic strategy and the virtual power plant uses a stochastic strategy, energy complementarity is achieved and the overall operating economy is improved.
ISSN:	0378-7796 1873-2046
DOI:	10.1016/j.epsr.2021.107714