Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses

In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and id...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Automatica (Oxford) 2012-06, Vol.48 (6), p.1017-1030
Hauptverfasser: Xu, Hao, Jagannathan, S., Lewis, F.L.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 1030
container_issue 6
container_start_page 1017
container_title Automatica (Oxford)
container_volume 48
creator Xu, Hao
Jagannathan, S.
Lewis, F.L.
description In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.
doi_str_mv 10.1016/j.automatica.2012.03.007
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1022850673</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0005109812001033</els_id><sourcerecordid>1022850673</sourcerecordid><originalsourceid>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</originalsourceid><addsrcrecordid>eNqFkMFu1DAQhq0KpC6l7-ALEpeEcZxsnCNUUJAqcQDO1tQZq9517GB7Kfv29WqrcuRkjfT9nvk_xriAVoDYfti1eChxweIMth2IrgXZAowXbCPUKJtOye0rtgGAoREwqUv2JuddHXuhug37-6NE84C5xnlci1vQcxNDSdHzaPkh7EN8DNy7QJh4oPIY057mFyYfc6GFu8DLA_E1UaZg6BRNGOa48Jk8HjOvA1_R7KlwH3Om_Ja9tugzXT-_V-zXl88_b742d99vv918vGtM34-lsQiqHjsBgZw6UlPtjNIS4Tz30yjNsBVC2ZlGNPejGqZpUAptD_dgZgtWXrH353_XFH8fKBe9uGzIewwUD1kL6Do1wHaUFVVn1KR6YiKr11SFpGOF9Em23ul_svVJtgapq-waffe8BbNBb2t54_JLvhvUWNdMlft05qhW_uMo6WzcydjsEpmi5-j-v-wJmlGdTw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1022850673</pqid></control><display><type>article</type><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><source>Elsevier ScienceDirect Journals</source><creator>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</creator><creatorcontrib>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</creatorcontrib><description>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</description><identifier>ISSN: 0005-1098</identifier><identifier>EISSN: 1873-2836</identifier><identifier>DOI: 10.1016/j.automatica.2012.03.007</identifier><identifier>CODEN: ATCAA9</identifier><language>eng</language><publisher>Kidlington: Elsevier Ltd</publisher><subject>Adaptive estimator ; Applied sciences ; Asymptotic properties ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Control ; Control systems ; Control theory. Systems ; Delay ; Exact sciences and technology ; Modelling and identification ; Networked control system (NCS) ; Optimal control ; Optimization ; Packets (communication) ; Q-function ; Software ; Stochasticity</subject><ispartof>Automatica (Oxford), 2012-06, Vol.48 (6), p.1017-1030</ispartof><rights>2012 Elsevier Ltd</rights><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</citedby><cites>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0005109812001033$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&amp;idt=25872289$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Xu, Hao</creatorcontrib><creatorcontrib>Jagannathan, S.</creatorcontrib><creatorcontrib>Lewis, F.L.</creatorcontrib><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><title>Automatica (Oxford)</title><description>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</description><subject>Adaptive estimator</subject><subject>Applied sciences</subject><subject>Asymptotic properties</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Control</subject><subject>Control systems</subject><subject>Control theory. Systems</subject><subject>Delay</subject><subject>Exact sciences and technology</subject><subject>Modelling and identification</subject><subject>Networked control system (NCS)</subject><subject>Optimal control</subject><subject>Optimization</subject><subject>Packets (communication)</subject><subject>Q-function</subject><subject>Software</subject><subject>Stochasticity</subject><issn>0005-1098</issn><issn>1873-2836</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNqFkMFu1DAQhq0KpC6l7-ALEpeEcZxsnCNUUJAqcQDO1tQZq9517GB7Kfv29WqrcuRkjfT9nvk_xriAVoDYfti1eChxweIMth2IrgXZAowXbCPUKJtOye0rtgGAoREwqUv2JuddHXuhug37-6NE84C5xnlci1vQcxNDSdHzaPkh7EN8DNy7QJh4oPIY057mFyYfc6GFu8DLA_E1UaZg6BRNGOa48Jk8HjOvA1_R7KlwH3Om_Ja9tugzXT-_V-zXl88_b742d99vv918vGtM34-lsQiqHjsBgZw6UlPtjNIS4Tz30yjNsBVC2ZlGNPejGqZpUAptD_dgZgtWXrH353_XFH8fKBe9uGzIewwUD1kL6Do1wHaUFVVn1KR6YiKr11SFpGOF9Em23ul_svVJtgapq-waffe8BbNBb2t54_JLvhvUWNdMlft05qhW_uMo6WzcydjsEpmi5-j-v-wJmlGdTw</recordid><startdate>20120601</startdate><enddate>20120601</enddate><creator>Xu, Hao</creator><creator>Jagannathan, S.</creator><creator>Lewis, F.L.</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20120601</creationdate><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><author>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Adaptive estimator</topic><topic>Applied sciences</topic><topic>Asymptotic properties</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Control</topic><topic>Control systems</topic><topic>Control theory. Systems</topic><topic>Delay</topic><topic>Exact sciences and technology</topic><topic>Modelling and identification</topic><topic>Networked control system (NCS)</topic><topic>Optimal control</topic><topic>Optimization</topic><topic>Packets (communication)</topic><topic>Q-function</topic><topic>Software</topic><topic>Stochasticity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xu, Hao</creatorcontrib><creatorcontrib>Jagannathan, S.</creatorcontrib><creatorcontrib>Lewis, F.L.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics &amp; Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts – Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Automatica (Oxford)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xu, Hao</au><au>Jagannathan, S.</au><au>Lewis, F.L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</atitle><jtitle>Automatica (Oxford)</jtitle><date>2012-06-01</date><risdate>2012</risdate><volume>48</volume><issue>6</issue><spage>1017</spage><epage>1030</epage><pages>1017-1030</pages><issn>0005-1098</issn><eissn>1873-2836</eissn><coden>ATCAA9</coden><abstract>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</abstract><cop>Kidlington</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.automatica.2012.03.007</doi><tpages>14</tpages></addata></record>
fulltext fulltext
identifier ISSN: 0005-1098
ispartof Automatica (Oxford), 2012-06, Vol.48 (6), p.1017-1030
issn 0005-1098
1873-2836
language eng
recordid cdi_proquest_miscellaneous_1022850673
source Elsevier ScienceDirect Journals
subjects Adaptive estimator
Applied sciences
Asymptotic properties
Computer science
control theory
systems
Computer systems and distributed systems. User interface
Control
Control systems
Control theory. Systems
Delay
Exact sciences and technology
Modelling and identification
Networked control system (NCS)
Optimal control
Optimization
Packets (communication)
Q-function
Software
Stochasticity
title Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T11%3A36%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Stochastic%20optimal%20control%20of%20unknown%20linear%20networked%20control%20system%20in%20the%20presence%20of%20random%20delays%20and%20packet%20losses&rft.jtitle=Automatica%20(Oxford)&rft.au=Xu,%20Hao&rft.date=2012-06-01&rft.volume=48&rft.issue=6&rft.spage=1017&rft.epage=1030&rft.pages=1017-1030&rft.issn=0005-1098&rft.eissn=1873-2836&rft.coden=ATCAA9&rft_id=info:doi/10.1016/j.automatica.2012.03.007&rft_dat=%3Cproquest_cross%3E1022850673%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1022850673&rft_id=info:pmid/&rft_els_id=S0005109812001033&rfr_iscdi=true