Sequential recovery method and device for power system based on deep reinforcement learning

The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q networ...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	GAO YUXIN, HUANG ZEZHEN, ZHANG TIANYI, CHENG WEI, HUANG WEI
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES PHYSICS SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	GAO YUXIN HUANG ZEZHEN ZHANG TIANYI CHENG WEI HUANG WEI
description	The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置，通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114048989A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114048989A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114048989A3</originalsourceid><addsrcrecordid>eNqNzLEOwiAUhWEWB6O-w_UBTGzs0I6m0Ti56ObQIJwqCVwQsKZvL4MP4HSW7_xzcbvg9QZnIy1FKD8iTuSQn16TZE0ao1GgwUcK_oNIaUoZju4yQZPnAhDK03AhCq6kyEJGNvxYitkgbcLqtwuxPh6u3WmD4HukIBUYue_OVVVv66Zt2v3uH_MFm_E73A</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><source>esp@cenet</source><creator>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</creator><creatorcontrib>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</creatorcontrib><description>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置，通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</description><language>chi ; eng</language><subject>CALCULATING ; COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS ; COMPUTING ; COUNTING ; DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES ; PHYSICS ; SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220215&DB=EPODOC&CC=CN&NR=114048989A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,780,885,25564,76547</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220215&DB=EPODOC&CC=CN&NR=114048989A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>GAO YUXIN</creatorcontrib><creatorcontrib>HUANG ZEZHEN</creatorcontrib><creatorcontrib>ZHANG TIANYI</creatorcontrib><creatorcontrib>CHENG WEI</creatorcontrib><creatorcontrib>HUANG WEI</creatorcontrib><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><description>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置，通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</description><subject>CALCULATING</subject><subject>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</subject><subject>COMPUTING</subject><subject>COUNTING</subject><subject>DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES</subject><subject>PHYSICS</subject><subject>SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNzLEOwiAUhWEWB6O-w_UBTGzs0I6m0Ti56ObQIJwqCVwQsKZvL4MP4HSW7_xzcbvg9QZnIy1FKD8iTuSQn16TZE0ao1GgwUcK_oNIaUoZju4yQZPnAhDK03AhCq6kyEJGNvxYitkgbcLqtwuxPh6u3WmD4HukIBUYue_OVVVv66Zt2v3uH_MFm_E73A</recordid><startdate>20220215</startdate><enddate>20220215</enddate><creator>GAO YUXIN</creator><creator>HUANG ZEZHEN</creator><creator>ZHANG TIANYI</creator><creator>CHENG WEI</creator><creator>HUANG WEI</creator><scope>EVB</scope></search><sort><creationdate>20220215</creationdate><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><author>GAO YUXIN ; HUANG ZEZHEN ; ZHANG TIANYI ; CHENG WEI ; HUANG WEI</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114048989A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CALCULATING</topic><topic>COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS</topic><topic>COMPUTING</topic><topic>COUNTING</topic><topic>DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES</topic><topic>PHYSICS</topic><topic>SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR</topic><toplevel>online_resources</toplevel><creatorcontrib>GAO YUXIN</creatorcontrib><creatorcontrib>HUANG ZEZHEN</creatorcontrib><creatorcontrib>ZHANG TIANYI</creatorcontrib><creatorcontrib>CHENG WEI</creatorcontrib><creatorcontrib>HUANG WEI</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>GAO YUXIN</au><au>HUANG ZEZHEN</au><au>ZHANG TIANYI</au><au>CHENG WEI</au><au>HUANG WEI</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Sequential recovery method and device for power system based on deep reinforcement learning</title><date>2022-02-15</date><risdate>2022</risdate><abstract>The invention discloses a sequential recovery method and device for a power system based on deep reinforcement learning. The method comprises the following steps: constructing a power system recovery model which comprises a deep reinforcement learning Q value estimation network and a Target Q network, and training the power system recovery model. According to the invention, based on the power network after a cascade failure and through a bus recovery sequence obtained after deep reinforcement learning, the recovery capability of the power network system to the cascade failure in a system recovery process is evaluated, reinforcement learning is combined with the power network, and the recovery problem of the power network is considered from the perspective of defenders; and through combination with a neural network, the implementation range of the power network is expanded; that is, an optimal recovery strategy of a large power grid can be found. 本发明公开了一种基于深度强化学习的电力系统顺序恢复方法及装置，通过构建包括深度强化学习Q值估计网络和Target Q网络的电力系</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN114048989A
source	esp@cenet
subjects	CALCULATING COMPUTER SYSTEMS BASED ON SPECIFIC COMPUTATIONAL MODELS COMPUTING COUNTING DATA PROCESSING SYSTEMS OR METHODS, SPECIALLY ADAPTED FORADMINISTRATIVE, COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORYOR FORECASTING PURPOSES PHYSICS SYSTEMS OR METHODS SPECIALLY ADAPTED FOR ADMINISTRATIVE,COMMERCIAL, FINANCIAL, MANAGERIAL, SUPERVISORY OR FORECASTINGPURPOSES, NOT OTHERWISE PROVIDED FOR
title	Sequential recovery method and device for power system based on deep reinforcement learning
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2024-12-25T19%3A21%3A09IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=GAO%20YUXIN&rft.date=2022-02-15&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114048989A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true