Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning

Controlling Heating, Ventilation and Air Conditioning (HVAC) systems is critical to improving energy efficiency of demand-side. In this paper, a model-free optimal control method based on deep reinforcement learning is proposed to control the heat pump start/stop and room temperature setting in resi...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Energy (Oxford) 2023-02, Vol.264, p.126209, Article 126209
Hauptverfasser: Qin, Haosen, Yu, Zhen, Li, Tailu, Liu, Xueliang, Li, Li
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page 126209
container_title Energy (Oxford)
container_volume 264
creator Qin, Haosen
Yu, Zhen
Li, Tailu
Liu, Xueliang
Li, Li
description Controlling Heating, Ventilation and Air Conditioning (HVAC) systems is critical to improving energy efficiency of demand-side. In this paper, a model-free optimal control method based on deep reinforcement learning is proposed to control the heat pump start/stop and room temperature setting in residential buildings. The optimization goal of this method is to obtain the highest comprehensive reward which considering thermal comfort and energy cost. Firstly, the randomness, learning process, thermal comfort and energy consumption of the model-free controller are systematically investigated by a simulation system based on measured data. The results show that randomness has a significant impact on the initial performance and convergence speed of the model-free controller; The model-free controller has a linear accumulation of comprehensive rewards during the learning process, and the slope of the accumulated comprehensive rewards can be used to determine whether the controller converges; The model-free controller coordinates monitoring data, weather forecasts and building thermal inertia to achieve the highest comprehensive reward. Afterwards, the model-free controller was verified in a nearly zero energy residential building in Beijing, China. The results show that model-free controller improves the comprehensive reward by 15.3% compared to rule-based method. •A model-free controller based on deep reinforcement learning is developed.•Problem formulation including the state, action, and reward is carefully designed.•Randomness affects the initial performance and convergence speed of the controller.•The controller improved the comprehensive reward by 15.3% over the baseline.
doi_str_mv 10.1016/j.energy.2022.126209
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_3153840326</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S036054422203095X</els_id><sourcerecordid>3153840326</sourcerecordid><originalsourceid>FETCH-LOGICAL-c269t-c27bcfbd71e72b4245673d01bd86b9f792196d6c512844e999dcb2b762d963983</originalsourceid><addsrcrecordid>eNp9kD1PwzAQhj2ARCn8AwaPLAm24zjxgoSq8iFVYoHZiu1L68p1ip2Cyq_HJcwsd8M97yPdi9ANJSUlVNxtSwgQ18eSEcZKygQj8gzNSCVIUXPOLtBlSltCSN1KOUNh-UsX0PfOOAgj3kA3urDGZghjHDzuh4gDdNEf8TfEAU96HCE5m3nXeawPztucSfjLjRtsAfb57kKOGtidpD4LQiau0Hnf-QTXf3uO3h-Xb4vnYvX69LJ4WBWGCTnm2WjTa9tQaJjmjNeiqSyh2rZCy76RjEphhakpazkHKaU1mulGMCtFJdtqjm4n7z4OHwdIo9q5ZMD7LsBwSKqiddVyUjGRUT6hJg4pRejVPrpdF4-KEnWqVG3V9LM6VaqmSnPsfopBfuPTQVTp1J8B6yKYUdnB_S_4ATJ1hZk</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>3153840326</pqid></control><display><type>article</type><title>Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning</title><source>Elsevier ScienceDirect Journals</source><creator>Qin, Haosen ; Yu, Zhen ; Li, Tailu ; Liu, Xueliang ; Li, Li</creator><creatorcontrib>Qin, Haosen ; Yu, Zhen ; Li, Tailu ; Liu, Xueliang ; Li, Li</creatorcontrib><description>Controlling Heating, Ventilation and Air Conditioning (HVAC) systems is critical to improving energy efficiency of demand-side. In this paper, a model-free optimal control method based on deep reinforcement learning is proposed to control the heat pump start/stop and room temperature setting in residential buildings. The optimization goal of this method is to obtain the highest comprehensive reward which considering thermal comfort and energy cost. Firstly, the randomness, learning process, thermal comfort and energy consumption of the model-free controller are systematically investigated by a simulation system based on measured data. The results show that randomness has a significant impact on the initial performance and convergence speed of the model-free controller; The model-free controller has a linear accumulation of comprehensive rewards during the learning process, and the slope of the accumulated comprehensive rewards can be used to determine whether the controller converges; The model-free controller coordinates monitoring data, weather forecasts and building thermal inertia to achieve the highest comprehensive reward. Afterwards, the model-free controller was verified in a nearly zero energy residential building in Beijing, China. The results show that model-free controller improves the comprehensive reward by 15.3% compared to rule-based method. •A model-free controller based on deep reinforcement learning is developed.•Problem formulation including the state, action, and reward is carefully designed.•Randomness affects the initial performance and convergence speed of the controller.•The controller improved the comprehensive reward by 15.3% over the baseline.</description><identifier>ISSN: 0360-5442</identifier><identifier>DOI: 10.1016/j.energy.2022.126209</identifier><language>eng</language><publisher>Elsevier Ltd</publisher><subject>air ; ambient temperature ; China ; control methods ; Deep Q learning ; energy costs ; energy efficiency ; heat ; heat pumps ; HVAC ; Model-free control ; Optimal control ; Prioritized replay ; Reinforcement learning ; residential housing ; weather</subject><ispartof>Energy (Oxford), 2023-02, Vol.264, p.126209, Article 126209</ispartof><rights>2022 Elsevier Ltd</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c269t-c27bcfbd71e72b4245673d01bd86b9f792196d6c512844e999dcb2b762d963983</citedby><cites>FETCH-LOGICAL-c269t-c27bcfbd71e72b4245673d01bd86b9f792196d6c512844e999dcb2b762d963983</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S036054422203095X$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids></links><search><creatorcontrib>Qin, Haosen</creatorcontrib><creatorcontrib>Yu, Zhen</creatorcontrib><creatorcontrib>Li, Tailu</creatorcontrib><creatorcontrib>Liu, Xueliang</creatorcontrib><creatorcontrib>Li, Li</creatorcontrib><title>Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning</title><title>Energy (Oxford)</title><description>Controlling Heating, Ventilation and Air Conditioning (HVAC) systems is critical to improving energy efficiency of demand-side. In this paper, a model-free optimal control method based on deep reinforcement learning is proposed to control the heat pump start/stop and room temperature setting in residential buildings. The optimization goal of this method is to obtain the highest comprehensive reward which considering thermal comfort and energy cost. Firstly, the randomness, learning process, thermal comfort and energy consumption of the model-free controller are systematically investigated by a simulation system based on measured data. The results show that randomness has a significant impact on the initial performance and convergence speed of the model-free controller; The model-free controller has a linear accumulation of comprehensive rewards during the learning process, and the slope of the accumulated comprehensive rewards can be used to determine whether the controller converges; The model-free controller coordinates monitoring data, weather forecasts and building thermal inertia to achieve the highest comprehensive reward. Afterwards, the model-free controller was verified in a nearly zero energy residential building in Beijing, China. The results show that model-free controller improves the comprehensive reward by 15.3% compared to rule-based method. •A model-free controller based on deep reinforcement learning is developed.•Problem formulation including the state, action, and reward is carefully designed.•Randomness affects the initial performance and convergence speed of the controller.•The controller improved the comprehensive reward by 15.3% over the baseline.</description><subject>air</subject><subject>ambient temperature</subject><subject>China</subject><subject>control methods</subject><subject>Deep Q learning</subject><subject>energy costs</subject><subject>energy efficiency</subject><subject>heat</subject><subject>heat pumps</subject><subject>HVAC</subject><subject>Model-free control</subject><subject>Optimal control</subject><subject>Prioritized replay</subject><subject>Reinforcement learning</subject><subject>residential housing</subject><subject>weather</subject><issn>0360-5442</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2023</creationdate><recordtype>article</recordtype><recordid>eNp9kD1PwzAQhj2ARCn8AwaPLAm24zjxgoSq8iFVYoHZiu1L68p1ip2Cyq_HJcwsd8M97yPdi9ANJSUlVNxtSwgQ18eSEcZKygQj8gzNSCVIUXPOLtBlSltCSN1KOUNh-UsX0PfOOAgj3kA3urDGZghjHDzuh4gDdNEf8TfEAU96HCE5m3nXeawPztucSfjLjRtsAfb57kKOGtidpD4LQiau0Hnf-QTXf3uO3h-Xb4vnYvX69LJ4WBWGCTnm2WjTa9tQaJjmjNeiqSyh2rZCy76RjEphhakpazkHKaU1mulGMCtFJdtqjm4n7z4OHwdIo9q5ZMD7LsBwSKqiddVyUjGRUT6hJg4pRejVPrpdF4-KEnWqVG3V9LM6VaqmSnPsfopBfuPTQVTp1J8B6yKYUdnB_S_4ATJ1hZk</recordid><startdate>20230201</startdate><enddate>20230201</enddate><creator>Qin, Haosen</creator><creator>Yu, Zhen</creator><creator>Li, Tailu</creator><creator>Liu, Xueliang</creator><creator>Li, Li</creator><general>Elsevier Ltd</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7S9</scope><scope>L.6</scope></search><sort><creationdate>20230201</creationdate><title>Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning</title><author>Qin, Haosen ; Yu, Zhen ; Li, Tailu ; Liu, Xueliang ; Li, Li</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c269t-c27bcfbd71e72b4245673d01bd86b9f792196d6c512844e999dcb2b762d963983</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2023</creationdate><topic>air</topic><topic>ambient temperature</topic><topic>China</topic><topic>control methods</topic><topic>Deep Q learning</topic><topic>energy costs</topic><topic>energy efficiency</topic><topic>heat</topic><topic>heat pumps</topic><topic>HVAC</topic><topic>Model-free control</topic><topic>Optimal control</topic><topic>Prioritized replay</topic><topic>Reinforcement learning</topic><topic>residential housing</topic><topic>weather</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Qin, Haosen</creatorcontrib><creatorcontrib>Yu, Zhen</creatorcontrib><creatorcontrib>Li, Tailu</creatorcontrib><creatorcontrib>Liu, Xueliang</creatorcontrib><creatorcontrib>Li, Li</creatorcontrib><collection>CrossRef</collection><collection>AGRICOLA</collection><collection>AGRICOLA - Academic</collection><jtitle>Energy (Oxford)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Qin, Haosen</au><au>Yu, Zhen</au><au>Li, Tailu</au><au>Liu, Xueliang</au><au>Li, Li</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning</atitle><jtitle>Energy (Oxford)</jtitle><date>2023-02-01</date><risdate>2023</risdate><volume>264</volume><spage>126209</spage><pages>126209-</pages><artnum>126209</artnum><issn>0360-5442</issn><abstract>Controlling Heating, Ventilation and Air Conditioning (HVAC) systems is critical to improving energy efficiency of demand-side. In this paper, a model-free optimal control method based on deep reinforcement learning is proposed to control the heat pump start/stop and room temperature setting in residential buildings. The optimization goal of this method is to obtain the highest comprehensive reward which considering thermal comfort and energy cost. Firstly, the randomness, learning process, thermal comfort and energy consumption of the model-free controller are systematically investigated by a simulation system based on measured data. The results show that randomness has a significant impact on the initial performance and convergence speed of the model-free controller; The model-free controller has a linear accumulation of comprehensive rewards during the learning process, and the slope of the accumulated comprehensive rewards can be used to determine whether the controller converges; The model-free controller coordinates monitoring data, weather forecasts and building thermal inertia to achieve the highest comprehensive reward. Afterwards, the model-free controller was verified in a nearly zero energy residential building in Beijing, China. The results show that model-free controller improves the comprehensive reward by 15.3% compared to rule-based method. •A model-free controller based on deep reinforcement learning is developed.•Problem formulation including the state, action, and reward is carefully designed.•Randomness affects the initial performance and convergence speed of the controller.•The controller improved the comprehensive reward by 15.3% over the baseline.</abstract><pub>Elsevier Ltd</pub><doi>10.1016/j.energy.2022.126209</doi></addata></record>
fulltext fulltext
identifier ISSN: 0360-5442
ispartof Energy (Oxford), 2023-02, Vol.264, p.126209, Article 126209
issn 0360-5442
language eng
recordid cdi_proquest_miscellaneous_3153840326
source Elsevier ScienceDirect Journals
subjects air
ambient temperature
China
control methods
Deep Q learning
energy costs
energy efficiency
heat
heat pumps
HVAC
Model-free control
Optimal control
Prioritized replay
Reinforcement learning
residential housing
weather
title Energy-efficient heating control for nearly zero energy residential buildings with deep reinforcement learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-30T05%3A46%3A24IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Energy-efficient%20heating%20control%20for%20nearly%20zero%20energy%20residential%20buildings%20with%20deep%20reinforcement%20learning&rft.jtitle=Energy%20(Oxford)&rft.au=Qin,%20Haosen&rft.date=2023-02-01&rft.volume=264&rft.spage=126209&rft.pages=126209-&rft.artnum=126209&rft.issn=0360-5442&rft_id=info:doi/10.1016/j.energy.2022.126209&rft_dat=%3Cproquest_cross%3E3153840326%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=3153840326&rft_id=info:pmid/&rft_els_id=S036054422203095X&rfr_iscdi=true