Deep clustering of reinforcement learning based on the bang-bang principle to optimize the energy in multi-boiler for intelligent buildings

The bang-bang relays of the multiple-boiler system (MBS) control, are characterized by complex limiter saturation functions and classified as fixed parameters. Their action signals cannot precisely control the nonlinear dynamic building heating demand over their entire range of operation. Moreover,...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Applied energy 2024-02, Vol.356, p.122357, Article 122357
Hauptverfasser: Homod, Raad Z., Munahi, Basil Sh, Mohammed, Hayder Ibrahim, Albadr, Musatafa Abbas Abbood, Abderrahmane, AISSA, Mahdi, Jasim M., Ben Hamida, Mohamed Bechir, Alhasnawi, Bilal Naji, Albahri, A.S., Togun, Hussein, Alqsair, Umar F., Yaseen, Zaher Mundher
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:The bang-bang relays of the multiple-boiler system (MBS) control, are characterized by complex limiter saturation functions and classified as fixed parameters. Their action signals cannot precisely control the nonlinear dynamic building heating demand over their entire range of operation. Moreover, in a mono-boiler system, the bang-bang controller endures increasing short cycling over partial load time due to the heating system being considered to have an oversized boiler at most times of running, thus promoting high energy consumption and fluctuating indoor thermal comfort. So, it is difficult to cope with uncertainties in outdoor environments and indoor heating load. Hence, this study formulates the MBS control problem as a dynamic Markov decision process and applies a deep clustering of reinforcement learning approach to obtain the optimal control policy through interaction with the environment based on multi-agent learning according to bang-bang action. With such an approach, adopting a new boiler sequencing control (BSC) strategy using deep clustering of reinforcement learning based on a bang-bang (DCRLBB) manner. The deep clustering is configured to break Lagrangian trajectory curves into piecewise segments to represent the RL agent's action policy. The agent's action policy signals are configured from the bang-bang reward formula based on trade-off implications to be more adjustable than traditional fixed parameters such as fuzzy bang-bang controller (FBBC). The agent of BSC significantly affects the energy performance of the MBS, whereas the other agent resizes boiler capacity by acting to adjust the boiler solenoid fuel valve. The comparison of results between the proposed strategy and conventional FBBC shows distinct differences in the superior response of DCRLBB under dynamic indoor/outdoor actual conditions and energy saving by more than 32% while maintaining the indoor thermal in the comfortable range. [Display omitted] •Proposed strategy based on DCRLBB is used to minimize short cycling on oversized boilers.•DCRLBB provides a viable approach to handling various cooperative policies of multi-agent•Clustering technique helps the well-fitting of the Lagrangian model to optimal agent policy.•Online learning for active decision-making enables the agent to predict the power demand of a multi-boiler.•The comparison of results for the DCRLBB shows energy saving by more than 32%.
ISSN:0306-2619
1872-9118
DOI:10.1016/j.apenergy.2023.122357