Sequential recovery method and device for power system based on deep reinforcement learning
The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q networ...
Gespeichert in:
Hauptverfasser: | , , , , |
---|---|
Format: | Patent |
Sprache: | chi ; eng |
Schlagworte: | |
Online-Zugang: | Volltext bestellen |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | |
---|---|
container_issue | |
container_start_page | |
container_title | |
container_volume | |
creator | GAO YUXIN HUANG ZEZHEN ZHANG TIANYI CHENG WEI HUANG WEI |
description | The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found.
本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系 |
format | Patent |
fullrecord | <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114048989A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114048989A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114048989A3</originalsourceid><addsrcrecordid>eNqNzLEOwiAUhWEWB6O-w_UBTGzs0I6m0Ti56ObQIJwqCVwQsKZvL4MP4HSW7_xzcbvg9QZnIy1FKD8iTuSQn16TZE0ao1GgwUcK_oNIaUoZju4yQZPnAhDK03AhCq6kyEJGNvxYitkgbcLqtwuxPh6u3WmD4HukIBUYue_OVVVv66Zt2v3uH_MFm_E73A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><source>esp@cenet</source><creator>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</creator><creatorcontrib>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</creatorcontrib><description>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found.
本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES ; PHYSICS ; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220215&DB=EPODOC&CC=CN&NR=114048989A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220215&DB=EPODOC&CC=CN&NR=114048989A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GAO YUXIN</creatorcontrib><creatorcontrib>HUANG ZEZHEN</creatorcontrib><creatorcontrib>ZHANG TIANYI</creatorcontrib><creatorcontrib>CHENG WEI</creatorcontrib><creatorcontrib>HUANG WEI</creatorcontrib><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><description>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found.
本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES</subject><subject>PHYSICS</subject><subject>SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNzLEOwiAUhWEWB6O-w_UBTGzs0I6m0Ti56ObQIJwqCVwQsKZvL4MP4HSW7_xzcbvg9QZnIy1FKD8iTuSQn16TZE0ao1GgwUcK_oNIaUoZju4yQZPnAhDK03AhCq6kyEJGNvxYitkgbcLqtwuxPh6u3WmD4HukIBUYue_OVVVv66Zt2v3uH_MFm_E73A</recordid><startdate>20220215</startdate><enddate>20220215</enddate><creator>GAO YUXIN</creator><creator>HUANG ZEZHEN</creator><creator>ZHANG TIANYI</creator><creator>CHENG WEI</creator><creator>HUANG WEI</creator><scope>EVB</scope></search><sort><creationdate>20220215</creationdate><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><author>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114048989A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES</topic><topic>PHYSICS</topic><topic>SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</topic><toplevel>online_resources</toplevel><creatorcontrib>GAO YUXIN</creatorcontrib><creatorcontrib>HUANG ZEZHEN</creatorcontrib><creatorcontrib>ZHANG TIANYI</creatorcontrib><creatorcontrib>CHENG WEI</creatorcontrib><creatorcontrib>HUANG WEI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GAO YUXIN</au><au>HUANG ZEZHEN</au><au>ZHANG TIANYI</au><au>CHENG WEI</au><au>HUANG WEI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><date>2022-02-15</date><risdate>2022</risdate><abstract>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found.
本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置,通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</abstract><oa>free_for_read</oa></addata></record> |
fulltext | fulltext_linktorsrc |
identifier | |
ispartof | |
issn | |
language | chi ; eng |
recordid | cdi_epo_espacenet_CN114048989A |
source | esp@cenet |
subjects | CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES PHYSICS SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR |
title | Sequential recovery method and device for power system based on deep reinforcement learning |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T19%3A21%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GAO%20YUXIN&rft.date=2022-02-15&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114048989A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true |