Sequential recovery method and device for power system based on deep reinforcement learning

The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q networ...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: GAO YUXIN, HUANG ZEZHEN, ZHANG TIANYI, CHENG WEI, HUANG WEI
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator GAO YUXIN
HUANG ZEZHEN
ZHANG TIANYI
CHENG WEI
HUANG WEI
description The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114048989A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114048989A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114048989A3</originalsourceid><addsrcrecordid>eNqNzLEOwiAUhWEWB6O-w_UBTGzs0I6m0Ti56ObQIJwqCVwQsKZvL4MP4HSW7_xzcbvg9QZnIy1FKD8iTuSQn16TZE0ao1GgwUcK_oNIaUoZju4yQZPnAhDK03AhCq6kyEJGNvxYitkgbcLqtwuxPh6u3WmD4HukIBUYue_OVVVv66Zt2v3uH_MFm_E73A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><source>esp@cenet</source><creator>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</creator><creatorcontrib>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</creatorcontrib><description>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES ; PHYSICS ; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220215&amp;DB=EPODOC&amp;CC=CN&amp;NR=114048989A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220215&amp;DB=EPODOC&amp;CC=CN&amp;NR=114048989A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GAO YUXIN</creatorcontrib><creatorcontrib>HUANG ZEZHEN</creatorcontrib><creatorcontrib>ZHANG TIANYI</creatorcontrib><creatorcontrib>CHENG WEI</creatorcontrib><creatorcontrib>HUANG WEI</creatorcontrib><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><description>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES</subject><subject>PHYSICS</subject><subject>SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNzLEOwiAUhWEWB6O-w_UBTGzs0I6m0Ti56ObQIJwqCVwQsKZvL4MP4HSW7_xzcbvg9QZnIy1FKD8iTuSQn16TZE0ao1GgwUcK_oNIaUoZju4yQZPnAhDK03AhCq6kyEJGNvxYitkgbcLqtwuxPh6u3WmD4HukIBUYue_OVVVv66Zt2v3uH_MFm_E73A</recordid><startdate>20220215</startdate><enddate>20220215</enddate><creator>GAO YUXIN</creator><creator>HUANG ZEZHEN</creator><creator>ZHANG TIANYI</creator><creator>CHENG WEI</creator><creator>HUANG WEI</creator><scope>EVB</scope></search><sort><creationdate>20220215</creationdate><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><author>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114048989A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES</topic><topic>PHYSICS</topic><topic>SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</topic><toplevel>online_resources</toplevel><creatorcontrib>GAO YUXIN</creatorcontrib><creatorcontrib>HUANG ZEZHEN</creatorcontrib><creatorcontrib>ZHANG TIANYI</creatorcontrib><creatorcontrib>CHENG WEI</creatorcontrib><creatorcontrib>HUANG WEI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GAO YUXIN</au><au>HUANG ZEZHEN</au><au>ZHANG TIANYI</au><au>CHENG WEI</au><au>HUANG WEI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><date>2022-02-15</date><risdate>2022</risdate><abstract>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN114048989A
source esp@cenet
subjects CALCULATING
COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS
COMPUTING
COUNTING
DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES
PHYSICS
SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR
title Sequential recovery method and device for power system based on deep reinforcement learning
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T19%3A21%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GAO%20YUXIN&rft.date=2022-02-15&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114048989A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true