Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses

In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and id...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Automatica (Oxford) 2012-06, Vol.48 (6), p.1017-1030
Hauptverfasser:	Xu, Hao, Jagannathan, S., Lewis, F.L.
Format:	Artikel
Sprache:	eng
Schlagworte:	Adaptive estimator Applied sciences Asymptotic properties Computer science control theory systems Computer systems and distributed systems. User interface Control Control systems Control theory. Systems Delay Exact sciences and technology Modelling and identification Networked control system (NCS) Optimal control Optimization Packets (communication) Q-function Software Stochasticity
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

container_end_page	1030
container_issue	6
container_start_page	1017
container_title	Automatica (Oxford)
container_volume	48
creator	Xu, Hao Jagannathan, S. Lewis, F.L.
description	In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.
doi_str_mv	10.1016/j.automatica.2012.03.007
format	Article
fullrecord	<record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1022850673</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0005109812001033</els_id><sourcerecordid>1022850673</sourcerecordid><originalsourceid>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</originalsourceid><addsrcrecordid>eNqFkMFu1DAQhq0KpC6l7-ALEpeEcZxsnCNUUJAqcQDO1tQZq9517GB7Kfv29WqrcuRkjfT9nvk_xriAVoDYfti1eChxweIMth2IrgXZAowXbCPUKJtOye0rtgGAoREwqUv2JuddHXuhug37-6NE84C5xnlci1vQcxNDSdHzaPkh7EN8DNy7QJh4oPIY057mFyYfc6GFu8DLA_E1UaZg6BRNGOa48Jk8HjOvA1_R7KlwH3Om_Ja9tugzXT-_V-zXl88_b742d99vv918vGtM34-lsQiqHjsBgZw6UlPtjNIS4Tz30yjNsBVC2ZlGNPejGqZpUAptD_dgZgtWXrH353_XFH8fKBe9uGzIewwUD1kL6Do1wHaUFVVn1KR6YiKr11SFpGOF9Em23ul_svVJtgapq-waffe8BbNBb2t54_JLvhvUWNdMlft05qhW_uMo6WzcydjsEpmi5-j-v-wJmlGdTw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1022850673</pqid></control><display><type>article</type><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><source>Elsevier ScienceDirect Journals</source><creator>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</creator><creatorcontrib>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</creatorcontrib><description>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</description><identifier>ISSN: 0005-1098</identifier><identifier>EISSN: 1873-2836</identifier><identifier>DOI: 10.1016/j.automatica.2012.03.007</identifier><identifier>CODEN: ATCAA9</identifier><language>eng</language><publisher>Kidlington: Elsevier Ltd</publisher><subject>Adaptive estimator ; Applied sciences ; Asymptotic properties ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Control ; Control systems ; Control theory. Systems ; Delay ; Exact sciences and technology ; Modelling and identification ; Networked control system (NCS) ; Optimal control ; Optimization ; Packets (communication) ; Q-function ; Software ; Stochasticity</subject><ispartof>Automatica (Oxford), 2012-06, Vol.48 (6), p.1017-1030</ispartof><rights>2012 Elsevier Ltd</rights><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</citedby><cites>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0005109812001033$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=25872289$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Xu, Hao</creatorcontrib><creatorcontrib>Jagannathan, S.</creatorcontrib><creatorcontrib>Lewis, F.L.</creatorcontrib><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><title>Automatica (Oxford)</title><description>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</description><subject>Adaptive estimator</subject><subject>Applied sciences</subject><subject>Asymptotic properties</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Control</subject><subject>Control systems</subject><subject>Control theory. Systems</subject><subject>Delay</subject><subject>Exact sciences and technology</subject><subject>Modelling and identification</subject><subject>Networked control system (NCS)</subject><subject>Optimal control</subject><subject>Optimization</subject><subject>Packets (communication)</subject><subject>Q-function</subject><subject>Software</subject><subject>Stochasticity</subject><issn>0005-1098</issn><issn>1873-2836</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNqFkMFu1DAQhq0KpC6l7-ALEpeEcZxsnCNUUJAqcQDO1tQZq9517GB7Kfv29WqrcuRkjfT9nvk_xriAVoDYfti1eChxweIMth2IrgXZAowXbCPUKJtOye0rtgGAoREwqUv2JuddHXuhug37-6NE84C5xnlci1vQcxNDSdHzaPkh7EN8DNy7QJh4oPIY057mFyYfc6GFu8DLA_E1UaZg6BRNGOa48Jk8HjOvA1_R7KlwH3Om_Ja9tugzXT-_V-zXl88_b742d99vv918vGtM34-lsQiqHjsBgZw6UlPtjNIS4Tz30yjNsBVC2ZlGNPejGqZpUAptD_dgZgtWXrH353_XFH8fKBe9uGzIewwUD1kL6Do1wHaUFVVn1KR6YiKr11SFpGOF9Em23ul_svVJtgapq-waffe8BbNBb2t54_JLvhvUWNdMlft05qhW_uMo6WzcydjsEpmi5-j-v-wJmlGdTw</recordid><startdate>20120601</startdate><enddate>20120601</enddate><creator>Xu, Hao</creator><creator>Jagannathan, S.</creator><creator>Lewis, F.L.</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20120601</creationdate><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><author>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Adaptive estimator</topic><topic>Applied sciences</topic><topic>Asymptotic properties</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Control</topic><topic>Control systems</topic><topic>Control theory. Systems</topic><topic>Delay</topic><topic>Exact sciences and technology</topic><topic>Modelling and identification</topic><topic>Networked control system (NCS)</topic><topic>Optimal control</topic><topic>Optimization</topic><topic>Packets (communication)</topic><topic>Q-function</topic><topic>Software</topic><topic>Stochasticity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xu, Hao</creatorcontrib><creatorcontrib>Jagannathan, S.</creatorcontrib><creatorcontrib>Lewis, F.L.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Automatica (Oxford)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xu, Hao</au><au>Jagannathan, S.</au><au>Lewis, F.L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</atitle><jtitle>Automatica (Oxford)</jtitle><date>2012-06-01</date><risdate>2012</risdate><volume>48</volume><issue>6</issue><spage>1017</spage><epage>1030</epage><pages>1017-1030</pages><issn>0005-1098</issn><eissn>1873-2836</eissn><coden>ATCAA9</coden><abstract>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</abstract><cop>Kidlington</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.automatica.2012.03.007</doi><tpages>14</tpages></addata></record>
fulltext	fulltext
identifier	ISSN: 0005-1098
ispartof	Automatica (Oxford), 2012-06, Vol.48 (6), p.1017-1030
issn	0005-1098 1873-2836
language	eng
recordid	cdi_proquest_miscellaneous_1022850673
source	Elsevier ScienceDirect Journals
subjects	Adaptive estimator Applied sciences Asymptotic properties Computer science control theory systems Computer systems and distributed systems. User interface Control Control systems Control theory. Systems Delay Exact sciences and technology Modelling and identification Networked control system (NCS) Optimal control Optimization Packets (communication) Q-function Software Stochasticity
title	Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses
url	https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T11%3A36%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Stochastic%20optimal%20control%20of%20unknown%20linear%20networked%20control%20system%20in%20the%20presence%20of%20random%20delays%20and%20packet%20losses&rft.jtitle=Automatica%20(Oxford)&rft.au=Xu,%20Hao&rft.date=2012-06-01&rft.volume=48&rft.issue=6&rft.spage=1017&rft.epage=1030&rft.pages=1017-1030&rft.issn=0005-1098&rft.eissn=1873-2836&rft.coden=ATCAA9&rft_id=info:doi/10.1016/j.automatica.2012.03.007&rft_dat=%3Cproquest_cross%3E1022850673%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1022850673&rft_id=info:pmid/&rft_els_id=S0005109812001033&rfr_iscdi=true