Safety reinforcement learning quadrotor control system and method based on control barrier function

The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputtin...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	LIU MINGCHENG, ZHOU TIANZE, WANG YAKAI, SUN ZHIWEN, WANG ZHAOSHUN, LIN DEFU, ZHANG FUBIAO, MO LI, CHEN QI, SONG TAO, LANG SHUAIPENG
Format:	Patent
Sprache:	chi ; eng
Schlagworte:	CONTROL OR REGULATING SYSTEMS IN GENERAL CONTROLLING FUNCTIONAL ELEMENTS OF SUCH SYSTEMS MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS PHYSICS REGULATING
Online-Zugang:	Volltext bestellen
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page
container_issue
container_start_page
container_title
container_volume
creator	LIU MINGCHENG ZHOU TIANZE WANG YAKAI SUN ZHIWEN WANG ZHAOSHUN LIN DEFU ZHANG FUBIAO MO LI CHEN QI SONG TAO LANG SHUAIPENG
description	The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统，包括仿真平台和控制器，所述接收仿真模型输出的状态量，向无人机或仿真模型输出控制指令，所述控制器包括强化学习子控制器和控制障碍函数子控制器，通过控制障碍函数与近端策略优化法结合的方式，解决了强化学习类的控制器安全性低的问题，提高了系统的稳定性。
format	Patent
fullrecord	<record><control><sourceid>epo_EVB</sourceid><recordid>TN_cdi_epo_espacenet_CN114326438A</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>CN114326438A</sourcerecordid><originalsourceid>FETCH-epo_espacenet_CN114326438A3</originalsourceid><addsrcrecordid>eNqNyjEOwjAMAMAsDAj4g3kAQ0mFWFEFYmKBvXITByK1dnHcob9nQcxMt9zShTsmshmUMifRQAOxQU-onPkJ7wmjiolCEDaVHspcjAZAjjCQvSRCh4UiCP9Kh6qZFNLEwbLw2i0S9oU2X1duezk_muuORmmpjBiIydrmVlW13x9qfzz5f84HIeU_XA</addsrcrecordid><sourcetype>Open Access Repository</sourcetype><iscdi>true</iscdi><recordtype>patent</recordtype></control><display><type>patent</type><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><source>esp@cenet</source><creator>LIU MINGCHENG ; ZHOU TIANZE ; WANG YAKAI ; SUN ZHIWEN ; WANG ZHAOSHUN ; LIN DEFU ; ZHANG FUBIAO ; MO LI ; CHEN QI ; SONG TAO ; LANG SHUAIPENG</creator><creatorcontrib>LIU MINGCHENG ; ZHOU TIANZE ; WANG YAKAI ; SUN ZHIWEN ; WANG ZHAOSHUN ; LIN DEFU ; ZHANG FUBIAO ; MO LI ; CHEN QI ; SONG TAO ; LANG SHUAIPENG</creatorcontrib><description>The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统，包括仿真平台和控制器，所述接收仿真模型输出的状态量，向无人机或仿真模型输出控制指令，所述控制器包括强化学习子控制器和控制障碍函数子控制器，通过控制障碍函数与近端策略优化法结合的方式，解决了强化学习类的控制器安全性低的问题，提高了系统的稳定性。</description><language>chi ; eng</language><subject>CONTROL OR REGULATING SYSTEMS IN GENERAL ; CONTROLLING ; FUNCTIONAL ELEMENTS OF SUCH SYSTEMS ; MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS ; PHYSICS ; REGULATING</subject><creationdate>2022</creationdate><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220412&DB=EPODOC&CC=CN&NR=114326438A$$EHTML$$P50$$Gepo$$Hfree_for_read</linktohtml><link.rule.ids>230,308,776,881,25542,76289</link.rule.ids><linktorsrc>$$Uhttps://worldwide.espacenet.com/publicationDetails/biblio?FT=D&date=20220412&DB=EPODOC&CC=CN&NR=114326438A$$EView_record_in_European_Patent_Office$$FView_record_in_$$GEuropean_Patent_Office$$Hfree_for_read</linktorsrc></links><search><creatorcontrib>LIU MINGCHENG</creatorcontrib><creatorcontrib>ZHOU TIANZE</creatorcontrib><creatorcontrib>WANG YAKAI</creatorcontrib><creatorcontrib>SUN ZHIWEN</creatorcontrib><creatorcontrib>WANG ZHAOSHUN</creatorcontrib><creatorcontrib>LIN DEFU</creatorcontrib><creatorcontrib>ZHANG FUBIAO</creatorcontrib><creatorcontrib>MO LI</creatorcontrib><creatorcontrib>CHEN QI</creatorcontrib><creatorcontrib>SONG TAO</creatorcontrib><creatorcontrib>LANG SHUAIPENG</creatorcontrib><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><description>The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统，包括仿真平台和控制器，所述接收仿真模型输出的状态量，向无人机或仿真模型输出控制指令，所述控制器包括强化学习子控制器和控制障碍函数子控制器，通过控制障碍函数与近端策略优化法结合的方式，解决了强化学习类的控制器安全性低的问题，提高了系统的稳定性。</description><subject>CONTROL OR REGULATING SYSTEMS IN GENERAL</subject><subject>CONTROLLING</subject><subject>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</subject><subject>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</subject><subject>PHYSICS</subject><subject>REGULATING</subject><fulltext>true</fulltext><rsrctype>patent</rsrctype><creationdate>2022</creationdate><recordtype>patent</recordtype><sourceid>EVB</sourceid><recordid>eNqNyjEOwjAMAMAsDAj4g3kAQ0mFWFEFYmKBvXITByK1dnHcob9nQcxMt9zShTsmshmUMifRQAOxQU-onPkJ7wmjiolCEDaVHspcjAZAjjCQvSRCh4UiCP9Kh6qZFNLEwbLw2i0S9oU2X1duezk_muuORmmpjBiIydrmVlW13x9qfzz5f84HIeU_XA</recordid><startdate>20220412</startdate><enddate>20220412</enddate><creator>LIU MINGCHENG</creator><creator>ZHOU TIANZE</creator><creator>WANG YAKAI</creator><creator>SUN ZHIWEN</creator><creator>WANG ZHAOSHUN</creator><creator>LIN DEFU</creator><creator>ZHANG FUBIAO</creator><creator>MO LI</creator><creator>CHEN QI</creator><creator>SONG TAO</creator><creator>LANG SHUAIPENG</creator><scope>EVB</scope></search><sort><creationdate>20220412</creationdate><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><author>LIU MINGCHENG ; ZHOU TIANZE ; WANG YAKAI ; SUN ZHIWEN ; WANG ZHAOSHUN ; LIN DEFU ; ZHANG FUBIAO ; MO LI ; CHEN QI ; SONG TAO ; LANG SHUAIPENG</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-epo_espacenet_CN114326438A3</frbrgroupid><rsrctype>patents</rsrctype><prefilter>patents</prefilter><language>chi ; eng</language><creationdate>2022</creationdate><topic>CONTROL OR REGULATING SYSTEMS IN GENERAL</topic><topic>CONTROLLING</topic><topic>FUNCTIONAL ELEMENTS OF SUCH SYSTEMS</topic><topic>MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS</topic><topic>PHYSICS</topic><topic>REGULATING</topic><toplevel>online_resources</toplevel><creatorcontrib>LIU MINGCHENG</creatorcontrib><creatorcontrib>ZHOU TIANZE</creatorcontrib><creatorcontrib>WANG YAKAI</creatorcontrib><creatorcontrib>SUN ZHIWEN</creatorcontrib><creatorcontrib>WANG ZHAOSHUN</creatorcontrib><creatorcontrib>LIN DEFU</creatorcontrib><creatorcontrib>ZHANG FUBIAO</creatorcontrib><creatorcontrib>MO LI</creatorcontrib><creatorcontrib>CHEN QI</creatorcontrib><creatorcontrib>SONG TAO</creatorcontrib><creatorcontrib>LANG SHUAIPENG</creatorcontrib><collection>esp@cenet</collection></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext_linktorsrc</fulltext></delivery><addata><au>LIU MINGCHENG</au><au>ZHOU TIANZE</au><au>WANG YAKAI</au><au>SUN ZHIWEN</au><au>WANG ZHAOSHUN</au><au>LIN DEFU</au><au>ZHANG FUBIAO</au><au>MO LI</au><au>CHEN QI</au><au>SONG TAO</au><au>LANG SHUAIPENG</au><format>patent</format><genre>patent</genre><ristype>GEN</ristype><title>Safety reinforcement learning quadrotor control system and method based on control barrier function</title><date>2022-04-12</date><risdate>2022</risdate><abstract>The invention discloses a safety reinforcement learning four-rotor control system based on a control barrier function, which comprises a simulation platform and a controller, and is characterized in that the controller is used for receiving a state quantity output by a simulation model and outputting a control instruction to an unmanned aerial vehicle or the simulation model, and the controller comprises a reinforcement learning sub-controller and a control barrier function sub-controller; through the combination of the control barrier function and the near-end strategy optimization method, the problem of low safety of a reinforcement learning controller is solved, and the stability of the system is improved. 本发明公开了一种基于控制障碍函数的安全强化学习四旋翼控制系统，包括仿真平台和控制器，所述接收仿真模型输出的状态量，向无人机或仿真模型输出控制指令，所述控制器包括强化学习子控制器和控制障碍函数子控制器，通过控制障碍函数与近端策略优化法结合的方式，解决了强化学习类的控制器安全性低的问题，提高了系统的稳定性。</abstract><oa>free_for_read</oa></addata></record>
fulltext	fulltext_linktorsrc
identifier
ispartof
issn
language	chi ; eng
recordid	cdi_epo_espacenet_CN114326438A
source	esp@cenet
subjects	CONTROL OR REGULATING SYSTEMS IN GENERAL CONTROLLING FUNCTIONAL ELEMENTS OF SUCH SYSTEMS MONITORING OR TESTING ARRANGEMENTS FOR SUCH SYSTEMS ORELEMENTS PHYSICS REGULATING
title	Safety reinforcement learning quadrotor control system and method based on control barrier function
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T23%3A52%3A58IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-epo_EVB&rft_val_fmt=info:ofi/fmt:kev:mtx:patent&rft.genre=patent&rft.au=LIU%20MINGCHENG&rft.date=2022-04-12&rft_id=info:doi/&rft_dat=%3Cepo_EVB%3ECN114326438A%3C/epo_EVB%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_id=info:pmid/&rfr_iscdi=true