Solving large-scale combined heat and power economic dispatch problems by using deep reinforcement learning based crisscross optimization algorithm

•A novel deep reinforcement learning based crisscross optimization algorithm is proposed.•The search space of CSO algorithm is contracted by the DRL.•Dimensionality is reduced by selecting variables of CSO and setting them fixed.•DRL-CSO outperforms other algorithms for large-scale CHPED problems. T...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied thermal engineering 2024-05, Vol.245, p.122781, Article 122781
Hauptverfasser: Meng, Anbo, Rong, Jiayu, Yin, Hao, Luo, Jianqiang, Tang, Yanshu, Zhang, Haitao, Li, Chen, Zhu, Jianbin, Yin, Yiding, Li, Hanhong, Liu, Jiawei
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•A novel deep reinforcement learning based crisscross optimization algorithm is proposed.•The search space of CSO algorithm is contracted by the DRL.•Dimensionality is reduced by selecting variables of CSO and setting them fixed.•DRL-CSO outperforms other algorithms for large-scale CHPED problems. The extensive interconnection in bulk power systems brings great challenges in the economic dispatch problems, especially the large-scale combined heat and power economic dispatch (CHPED), which is quite intractable due to complex thermal and electrical couplings in cogeneration units. Most meta-heuristic algorithms cannot work well while considering non-convexity, discontinuity, and non-differentiability. Particularly, when it comes to the high dimensional and large-scale CHPED problems, these algorithms are either easy to trap in the local optimum or quite time-consuming. To address these issues, a novel deep reinforcement learning (DRL) based crisscross optimization (CSO) algorithm is proposed for the first time, which can (1) improve the searching ability of CSO, (2) enhance the overall searching efficiency, and (3) perform extraordinarily well for the large-scale CHPED problems. Firstly, to further improve the global search ability, DRL is employed to reduce dimensionality and narrow the search space of the initial population of the CSO algorithm. Then, DRL is updated by the deep deterministic policy gradient (DDPG) strategy, which can facilitate an instantaneous decision in accelerating CSO. Thereafter, by adopting each decision provided by DRL, a great difference may occur in the same scenario to speed up the crossover procedure and guide the upcoming search round. On this basis, the enhanced CSO algorithm could quickly carry out crossovers with search space contraction and dimensionality reduction. Finally, experiments on four test systems have substantiated the effectiveness of the proposed algorithm, especially the superiority over other state-of-the-art techniques in solving large-scale CHPED problems.
ISSN:1359-4311
DOI:10.1016/j.applthermaleng.2024.122781