Safety reinforcement learning quadrotor control system and method based on control barrier function

The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputtin...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Hauptverfasser: LIU MINGCHENG, ZHOU TIANZE, WANG YAKAI, SUN ZHIWEN, WANG ZHAOSHUN, LIN DEFU, ZHANG FUBIAO, MO LI, CHEN QI, SONG TAO, LANG SHUAIPENG
Format: Patent
Sprache:chi ; eng
Schlagworte:
Online-Zugang:Volltext bestellen
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page
container_issue
container_start_page
container_title
container_volume
creator LIU MINGCHENG
ZHOU TIANZE
WANG YAKAI
SUN ZHIWEN
WANG ZHAOSHUN
LIN DEFU
ZHANG FUBIAO
MO LI
CHEN QI
SONG TAO
LANG SHUAIPENG
description The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统,包括仿真平台和控制器,所述接收仿真模型输出的状态量,向无人机或仿真模型输出控制指令,所述控制器包括强化学习子控制器和控制障碍函数子控制器,通过控制障碍函数与近端策略优化法结合的方式,解决了强化学习类的控制器安全性低的问题,提高了系统的稳定性。
format Patent
fullrecord <record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114326438A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114326438A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114326438A3</originalsourceid><addsrcrecordid>eNqNyjEOwjAMAMAsDAj4g3kAQ0mFWFEFYmKBvXITByK1dnHcob9nQcxMt9zShTsmshmUMifRQAOxQU-onPkJ7wmjiolCEDaVHspcjAZAjjCQvSRCh4UiCP9Kh6qZFNLEwbLw2i0S9oU2X1duezk_muuORmmpjBiIydrmVlW13x9qfzz5f84HIeU_XA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><source>esp@cenet</source><creator>LIU MINGCHENG ; ZHOU TIANZE ; WANG YAKAI ; SUN ZHIWEN ; WANG ZHAOSHUN ; LIN DEFU ; ZHANG FUBIAO ; MO LI ; CHEN QI ; SONG TAO ; LANG SHUAIPENG</creator><creatorcontrib>LIU MINGCHENG ; ZHOU TIANZE ; WANG YAKAI ; SUN ZHIWEN ; WANG ZHAOSHUN ; LIN DEFU ; ZHANG FUBIAO ; MO LI ; CHEN QI ; SONG TAO ; LANG SHUAIPENG</creatorcontrib><description>The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统,包括仿真平台和控制器,所述接收仿真模型输出的状态量,向无人机或仿真模型输出控制指令,所述控制器包括强化学习子控制器和控制障碍函数子控制器,通过控制障碍函数与近端策略优化法结合的方式,解决了强化学习类的控制器安全性低的问题,提高了系统的稳定性。</description><language>chi ; eng</language><subject>CONTROL OR REGULATING SYSTEMS IN GENERAL ; CONTROLLING ; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS ; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS ; PHYSICS ; REGULATING</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220412&amp;DB=EPODOC&amp;CC=CN&amp;NR=114326438A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&amp;date=20220412&amp;DB=EPODOC&amp;CC=CN&amp;NR=114326438A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>LIU MINGCHENG</creatorcontrib><creatorcontrib>ZHOU TIANZE</creatorcontrib><creatorcontrib>WANG YAKAI</creatorcontrib><creatorcontrib>SUN ZHIWEN</creatorcontrib><creatorcontrib>WANG ZHAOSHUN</creatorcontrib><creatorcontrib>LIN DEFU</creatorcontrib><creatorcontrib>ZHANG FUBIAO</creatorcontrib><creatorcontrib>MO LI</creatorcontrib><creatorcontrib>CHEN QI</creatorcontrib><creatorcontrib>SONG TAO</creatorcontrib><creatorcontrib>LANG SHUAIPENG</creatorcontrib><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><description>The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统,包括仿真平台和控制器,所述接收仿真模型输出的状态量,向无人机或仿真模型输出控制指令,所述控制器包括强化学习子控制器和控制障碍函数子控制器,通过控制障碍函数与近端策略优化法结合的方式,解决了强化学习类的控制器安全性低的问题,提高了系统的稳定性。</description><subject>CONTROL OR REGULATING SYSTEMS IN GENERAL</subject><subject>CONTROLLING</subject><subject>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</subject><subject>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</subject><subject>PHYSICS</subject><subject>REGULATING</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyjEOwjAMAMAsDAj4g3kAQ0mFWFEFYmKBvXITByK1dnHcob9nQcxMt9zShTsmshmUMifRQAOxQU-onPkJ7wmjiolCEDaVHspcjAZAjjCQvSRCh4UiCP9Kh6qZFNLEwbLw2i0S9oU2X1duezk_muuORmmpjBiIydrmVlW13x9qfzz5f84HIeU_XA</recordid><startdate>20220412</startdate><enddate>20220412</enddate><creator>LIU MINGCHENG</creator><creator>ZHOU TIANZE</creator><creator>WANG YAKAI</creator><creator>SUN ZHIWEN</creator><creator>WANG ZHAOSHUN</creator><creator>LIN DEFU</creator><creator>ZHANG FUBIAO</creator><creator>MO LI</creator><creator>CHEN QI</creator><creator>SONG TAO</creator><creator>LANG SHUAIPENG</creator><scope>EVB</scope></search><sort><creationdate>20220412</creationdate><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><author>LIU MINGCHENG ; ZHOU TIANZE ; WANG YAKAI ; SUN ZHIWEN ; WANG ZHAOSHUN ; LIN DEFU ; ZHANG FUBIAO ; MO LI ; CHEN QI ; SONG TAO ; LANG SHUAIPENG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114326438A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CONTROL OR REGULATING SYSTEMS IN GENERAL</topic><topic>CONTROLLING</topic><topic>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</topic><topic>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</topic><topic>PHYSICS</topic><topic>REGULATING</topic><toplevel>online_resources</toplevel><creatorcontrib>LIU MINGCHENG</creatorcontrib><creatorcontrib>ZHOU TIANZE</creatorcontrib><creatorcontrib>WANG YAKAI</creatorcontrib><creatorcontrib>SUN ZHIWEN</creatorcontrib><creatorcontrib>WANG ZHAOSHUN</creatorcontrib><creatorcontrib>LIN DEFU</creatorcontrib><creatorcontrib>ZHANG FUBIAO</creatorcontrib><creatorcontrib>MO LI</creatorcontrib><creatorcontrib>CHEN QI</creatorcontrib><creatorcontrib>SONG TAO</creatorcontrib><creatorcontrib>LANG SHUAIPENG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>LIU MINGCHENG</au><au>ZHOU TIANZE</au><au>WANG YAKAI</au><au>SUN ZHIWEN</au><au>WANG ZHAOSHUN</au><au>LIN DEFU</au><au>ZHANG FUBIAO</au><au>MO LI</au><au>CHEN QI</au><au>SONG TAO</au><au>LANG SHUAIPENG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><date>2022-04-12</date><risdate>2022</risdate><abstract>The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统,包括仿真平台和控制器,所述接收仿真模型输出的状态量,向无人机或仿真模型输出控制指令,所述控制器包括强化学习子控制器和控制障碍函数子控制器,通过控制障碍函数与近端策略优化法结合的方式,解决了强化学习类的控制器安全性低的问题,提高了系统的稳定性。</abstract><oa>free_for_read</oa></addata></record>
fulltext fulltext_linktorsrc
identifier
ispartof
issn
language chi ; eng
recordid cdi_epo_espacenet_CN114326438A
source esp@cenet
subjects CONTROL OR REGULATING SYSTEMS IN GENERAL
CONTROLLING
FUNCTIONAL ELEMENTS OF SUCH SYSTEMS
MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS
PHYSICS
REGULATING
title Safety reinforcement learning quadrotor control system and method based on control barrier function
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T23%3A52%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=LIU%20MINGCHENG&rft.date=2022-04-12&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114326438A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true