Virtual power plant containing electric vehicles scheduling strategies based on deep reinforcement learning
•VPP agent and EV charging station agent games to obtain electricity price.•The VPP tends to use mixed strategy, while EVs tend to use pure strategies.•Using Stackelberg game to prevent VPP from obtaining excess profit from EV members. Virtual power plants (VPPs), which aggregate customer-side flexi...
Gespeichert in:
Veröffentlicht in: | Electric power systems research 2022-04, Vol.205, p.107714, Article 107714 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •VPP agent and EV charging station agent games to obtain electricity price.•The VPP tends to use mixed strategy, while EVs tend to use pure strategies.•Using Stackelberg game to prevent VPP from obtaining excess profit from EV members.
Virtual power plants (VPPs), which aggregate customer-side flexibility resources, provide an effective way for customers to participate in the electricity market, and provide a variety of flexible technologies and services to the market. Importantly, VPPs can provide services to electric vehicle (EV) charging stations. In this paper, we constructed a deep reinforcement learning (DRL) based Stackelberg game model for a VPP with EV charging stations. Considering the interests of both sides of the game, soft actor-critic (SAC) algorithm is used for the VPP agent and twin delay deep deterministic policy gradient (TD3) algorithm is used for the EV charging station agent. By alternately training the network parameters of the agents, the strategy and solution at the equilibrium of the game are calculated. Results of cases demonstrate that the VPP agent can learn the strategy of selling electricity to EVs, optimize the scheduling of distributed energy resources (DERs), and bidding strategy for participation in the electricity market. Meanwhile, the EV aggregation agent can learn scheduling strategies for charging and discharging EVs. When the EV aggregator uses a deterministic strategy and the virtual power plant uses a stochastic strategy, energy complementarity is achieved and the overall operating economy is improved. |
---|---|
ISSN: | 0378-7796 1873-2046 |
DOI: | 10.1016/j.epsr.2021.107714 |