Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function

Adaptive dynamic programming (ADP) methods have demonstrated their efficiency. However, many of the applications for which ADP offers great potential, are also safety‐critical and need to meet safety specifications in the presence of physical constraints. In this article, an optimal controller for s...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	International journal of robust and nonlinear control 2022-04, Vol.32 (6), p.3408-3424
Hauptverfasser:	Xu, Jiahui, Wang, Jingcheng, Rao, Jun, Zhong, Yanjiu, Wang, Hongyuan
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptive algorithms Adaptive control adaptive dynamic programming control barrier function Dynamic programming Neural networks Nonlinear systems Optimal control Safety state constraints
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	3424
container_issue	6
container_start_page	3408
container_title	International journal of robust and nonlinear control
container_volume	32
creator	Xu, Jiahui Wang, Jingcheng Rao, Jun Zhong, Yanjiu Wang, Hongyuan
description	Adaptive dynamic programming (ADP) methods have demonstrated their efficiency. However, many of the applications for which ADP offers great potential, are also safety‐critical and need to meet safety specifications in the presence of physical constraints. In this article, an optimal controller for solving discrete‐time nonlinear systems with state constraints is proposed. By introducing the control barrier function into the utility function, the problem with state constraints is transformed into an unconstrained optimal control problem, addressing state constraints which are difficult to handle by traditional ADP methods. The constructed sequence of value function is shown to be monotonically non‐increasing and converges to the optimal value. Besides, this article gives the stability proof of the developed algorithm, as well as the conditions for satisfying the state constraints. To implement and approximate the control barrier function based adaptive dynamic programming algorithm, an actor‐critic network structure is built. During the training process, two neural networks are used for approximation separately. The performance of the proposed method is validated by testing it on a simulation example.
doi_str_mv	10.1002/rnc.5955
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2640244159</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2640244159</sourcerecordid><originalsourceid>FETCH-LOGICAL-c2935-705b93c54fd2411032ae7d4ad0ef3c9cb04106dbecfb0a900dbbbac03e77bd533</originalsourceid><addsrcrecordid>eNp1kEtKBDEQhhtR8AkeIeDGTY-VRztmKYMvEAXRdZNHtUa6kzHJKLPzBOIRPItH8SRmHHHnqgr-r6qor6p2KYwoADuI3owa2TQr1QYFKWvKuFxd9ELWR5Lx9WozpUeAkjGxUb0dWzXN7hmJnXs1OEOmMdxHNQzO35MuRBJKPKiemOBzDD0JHbEumYgZv17fS4bEB987jyqSNE8ZB_Li8gNJWWVcjKUclfM5Ea0S2s-P4P-WaRWjw0i6mTfZBb9drXWqT7jzW7equ9OT28l5fXl9djE5vqwNk7ypx9BoyU0jOssEpcCZwrEVygJ23EijQVA4tBpNp0FJAKu1VgY4jsfaNpxvVXvLveXbpxmm3D6GWfTlZMsOBTAhaCMLtb-kTAwpRezaaSwu4ryl0C5st8V2u7Bd0HqJvrge5_9y7c3V5If_BhdhhzA</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2640244159</pqid></control><display><type>article</type><title>Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function</title><source>Wiley Online Library Journals Frontfile Complete</source><creator>Xu, Jiahui ; Wang, Jingcheng ; Rao, Jun ; Zhong, Yanjiu ; Wang, Hongyuan</creator><creatorcontrib>Xu, Jiahui ; Wang, Jingcheng ; Rao, Jun ; Zhong, Yanjiu ; Wang, Hongyuan</creatorcontrib><description>Adaptive dynamic programming (ADP) methods have demonstrated their efficiency. However, many of the applications for which ADP offers great potential, are also safety‐critical and need to meet safety specifications in the presence of physical constraints. In this article, an optimal controller for solving discrete‐time nonlinear systems with state constraints is proposed. By introducing the control barrier function into the utility function, the problem with state constraints is transformed into an unconstrained optimal control problem, addressing state constraints which are difficult to handle by traditional ADP methods. The constructed sequence of value function is shown to be monotonically non‐increasing and converges to the optimal value. Besides, this article gives the stability proof of the developed algorithm, as well as the conditions for satisfying the state constraints. To implement and approximate the control barrier function based adaptive dynamic programming algorithm, an actor‐critic network structure is built. During the training process, two neural networks are used for approximation separately. The performance of the proposed method is validated by testing it on a simulation example.</description><identifier>ISSN: 1049-8923</identifier><identifier>EISSN: 1099-1239</identifier><identifier>DOI: 10.1002/rnc.5955</identifier><language>eng</language><publisher>Bognor Regis: Wiley Subscription Services, Inc</publisher><subject>Adaptive algorithms ; Adaptive control ; adaptive dynamic programming ; control barrier function ; Dynamic programming ; Neural networks ; Nonlinear systems ; Optimal control ; Safety ; state constraints</subject><ispartof>International journal of robust and nonlinear control, 2022-04, Vol.32 (6), p.3408-3424</ispartof><rights>2021 John Wiley & Sons Ltd.</rights><rights>2022 John Wiley & Sons, Ltd.</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c2935-705b93c54fd2411032ae7d4ad0ef3c9cb04106dbecfb0a900dbbbac03e77bd533</citedby><cites>FETCH-LOGICAL-c2935-705b93c54fd2411032ae7d4ad0ef3c9cb04106dbecfb0a900dbbbac03e77bd533</cites><orcidid>0000-0002-4058-0447 ; 0000-0002-1689-0146</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktopdf>$$Uhttps://onlinelibrary.wiley.com/doi/pdf/10.1002%2Frnc.5955$$EPDF$$P50$$Gwiley$$H</linktopdf><linktohtml>$$Uhttps://onlinelibrary.wiley.com/doi/full/10.1002%2Frnc.5955$$EHTML$$P50$$Gwiley$$H</linktohtml><link.rule.ids>314,776,780,1411,27901,27902,45550,45551</link.rule.ids></links><search><creatorcontrib>Xu, Jiahui</creatorcontrib><creatorcontrib>Wang, Jingcheng</creatorcontrib><creatorcontrib>Rao, Jun</creatorcontrib><creatorcontrib>Zhong, Yanjiu</creatorcontrib><creatorcontrib>Wang, Hongyuan</creatorcontrib><title>Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function</title><title>International journal of robust and nonlinear control</title><description>Adaptive dynamic programming (ADP) methods have demonstrated their efficiency. However, many of the applications for which ADP offers great potential, are also safety‐critical and need to meet safety specifications in the presence of physical constraints. In this article, an optimal controller for solving discrete‐time nonlinear systems with state constraints is proposed. By introducing the control barrier function into the utility function, the problem with state constraints is transformed into an unconstrained optimal control problem, addressing state constraints which are difficult to handle by traditional ADP methods. The constructed sequence of value function is shown to be monotonically non‐increasing and converges to the optimal value. Besides, this article gives the stability proof of the developed algorithm, as well as the conditions for satisfying the state constraints. To implement and approximate the control barrier function based adaptive dynamic programming algorithm, an actor‐critic network structure is built. During the training process, two neural networks are used for approximation separately. The performance of the proposed method is validated by testing it on a simulation example.</description><subject>Adaptive algorithms</subject><subject>Adaptive control</subject><subject>adaptive dynamic programming</subject><subject>control barrier function</subject><subject>Dynamic programming</subject><subject>Neural networks</subject><subject>Nonlinear systems</subject><subject>Optimal control</subject><subject>Safety</subject><subject>state constraints</subject><issn>1049-8923</issn><issn>1099-1239</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><recordid>eNp1kEtKBDEQhhtR8AkeIeDGTY-VRztmKYMvEAXRdZNHtUa6kzHJKLPzBOIRPItH8SRmHHHnqgr-r6qor6p2KYwoADuI3owa2TQr1QYFKWvKuFxd9ELWR5Lx9WozpUeAkjGxUb0dWzXN7hmJnXs1OEOmMdxHNQzO35MuRBJKPKiemOBzDD0JHbEumYgZv17fS4bEB987jyqSNE8ZB_Li8gNJWWVcjKUclfM5Ea0S2s-P4P-WaRWjw0i6mTfZBb9drXWqT7jzW7equ9OT28l5fXl9djE5vqwNk7ypx9BoyU0jOssEpcCZwrEVygJ23EijQVA4tBpNp0FJAKu1VgY4jsfaNpxvVXvLveXbpxmm3D6GWfTlZMsOBTAhaCMLtb-kTAwpRezaaSwu4ryl0C5st8V2u7Bd0HqJvrge5_9y7c3V5If_BhdhhzA</recordid><startdate>202204</startdate><enddate>202204</enddate><creator>Xu, Jiahui</creator><creator>Wang, Jingcheng</creator><creator>Rao, Jun</creator><creator>Zhong, Yanjiu</creator><creator>Wang, Hongyuan</creator><general>Wiley Subscription Services, Inc</general><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>7TB</scope><scope>8FD</scope><scope>FR3</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope><orcidid>https://orcid.org/0000-0002-4058-0447</orcidid><orcidid>https://orcid.org/0000-0002-1689-0146</orcidid></search><sort><creationdate>202204</creationdate><title>Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function</title><author>Xu, Jiahui ; Wang, Jingcheng ; Rao, Jun ; Zhong, Yanjiu ; Wang, Hongyuan</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c2935-705b93c54fd2411032ae7d4ad0ef3c9cb04106dbecfb0a900dbbbac03e77bd533</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Adaptive algorithms</topic><topic>Adaptive control</topic><topic>adaptive dynamic programming</topic><topic>control barrier function</topic><topic>Dynamic programming</topic><topic>Neural networks</topic><topic>Nonlinear systems</topic><topic>Optimal control</topic><topic>Safety</topic><topic>state constraints</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xu, Jiahui</creatorcontrib><creatorcontrib>Wang, Jingcheng</creatorcontrib><creatorcontrib>Rao, Jun</creatorcontrib><creatorcontrib>Zhong, Yanjiu</creatorcontrib><creatorcontrib>Wang, Hongyuan</creatorcontrib><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Mechanical & Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>Engineering Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>International journal of robust and nonlinear control</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xu, Jiahui</au><au>Wang, Jingcheng</au><au>Rao, Jun</au><au>Zhong, Yanjiu</au><au>Wang, Hongyuan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function</atitle><jtitle>International journal of robust and nonlinear control</jtitle><date>2022-04</date><risdate>2022</risdate><volume>32</volume><issue>6</issue><spage>3408</spage><epage>3424</epage><pages>3408-3424</pages><issn>1049-8923</issn><eissn>1099-1239</eissn><abstract>Adaptive dynamic programming (ADP) methods have demonstrated their efficiency. However, many of the applications for which ADP offers great potential, are also safety‐critical and need to meet safety specifications in the presence of physical constraints. In this article, an optimal controller for solving discrete‐time nonlinear systems with state constraints is proposed. By introducing the control barrier function into the utility function, the problem with state constraints is transformed into an unconstrained optimal control problem, addressing state constraints which are difficult to handle by traditional ADP methods. The constructed sequence of value function is shown to be monotonically non‐increasing and converges to the optimal value. Besides, this article gives the stability proof of the developed algorithm, as well as the conditions for satisfying the state constraints. To implement and approximate the control barrier function based adaptive dynamic programming algorithm, an actor‐critic network structure is built. During the training process, two neural networks are used for approximation separately. The performance of the proposed method is validated by testing it on a simulation example.</abstract><cop>Bognor Regis</cop><pub>Wiley Subscription Services, Inc</pub><doi>10.1002/rnc.5955</doi><tpages>17</tpages><orcidid>https://orcid.org/0000-0002-4058-0447</orcidid><orcidid>https://orcid.org/0000-0002-1689-0146</orcidid></addata></record>
fulltext	fulltext
identifier	ISSN: 1049-8923
ispartof	International journal of robust and nonlinear control, 2022-04, Vol.32 (6), p.3408-3424
issn	1049-8923 1099-1239
language	eng
recordid	cdi_proquest_journals_2640244159
source	Wiley Online Library Journals Frontfile Complete
subjects	Adaptive algorithms Adaptive control adaptive dynamic programming control barrier function Dynamic programming Neural networks Nonlinear systems Optimal control Safety state constraints
title	Adaptive dynamic programming for optimal control of discrete‐time nonlinear system with state constraints based on control barrier function
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-08T12%3A05%3A43IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Adaptive%20dynamic%20programming%20for%20optimal%20control%20of%20discrete%E2%80%90time%20nonlinear%20system%20with%20state%20constraints%20based%C2%A0on%20control%20barrier%20function&rft.jtitle=International%20journal%20of%20robust%20and%20nonlinear%20control&rft.au=Xu,%20Jiahui&rft.date=2022-04&rft.volume=32&rft.issue=6&rft.spage=3408&rft.epage=3424&rft.pages=3408-3424&rft.issn=1049-8923&rft.eissn=1099-1239&rft_id=info:doi/10.1002/rnc.5955&rft_dat=%3Cproquest_cross%3E2640244159%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2640244159&rft_id=info:pmid/&rfr_iscdi=true