Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses
In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and id...
Gespeichert in:
Veröffentlicht in: | Automatica (Oxford) 2012-06, Vol.48 (6), p.1017-1030 |
---|---|
Hauptverfasser: | , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
container_end_page | 1030 |
---|---|
container_issue | 6 |
container_start_page | 1017 |
container_title | Automatica (Oxford) |
container_volume | 48 |
creator | Xu, Hao Jagannathan, S. Lewis, F.L. |
description | In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time. |
doi_str_mv | 10.1016/j.automatica.2012.03.007 |
format | Article |
fullrecord | <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_miscellaneous_1022850673</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><els_id>S0005109812001033</els_id><sourcerecordid>1022850673</sourcerecordid><originalsourceid>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</originalsourceid><addsrcrecordid>eNqFkMFu1DAQhq0KpC6l7-ALEpeEcZxsnCNUUJAqcQDO1tQZq9517GB7Kfv29WqrcuRkjfT9nvk_xriAVoDYfti1eChxweIMth2IrgXZAowXbCPUKJtOye0rtgGAoREwqUv2JuddHXuhug37-6NE84C5xnlci1vQcxNDSdHzaPkh7EN8DNy7QJh4oPIY057mFyYfc6GFu8DLA_E1UaZg6BRNGOa48Jk8HjOvA1_R7KlwH3Om_Ja9tugzXT-_V-zXl88_b742d99vv918vGtM34-lsQiqHjsBgZw6UlPtjNIS4Tz30yjNsBVC2ZlGNPejGqZpUAptD_dgZgtWXrH353_XFH8fKBe9uGzIewwUD1kL6Do1wHaUFVVn1KR6YiKr11SFpGOF9Em23ul_svVJtgapq-waffe8BbNBb2t54_JLvhvUWNdMlft05qhW_uMo6WzcydjsEpmi5-j-v-wJmlGdTw</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>1022850673</pqid></control><display><type>article</type><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><source>Elsevier ScienceDirect Journals</source><creator>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</creator><creatorcontrib>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</creatorcontrib><description>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</description><identifier>ISSN: 0005-1098</identifier><identifier>EISSN: 1873-2836</identifier><identifier>DOI: 10.1016/j.automatica.2012.03.007</identifier><identifier>CODEN: ATCAA9</identifier><language>eng</language><publisher>Kidlington: Elsevier Ltd</publisher><subject>Adaptive estimator ; Applied sciences ; Asymptotic properties ; Computer science; control theory; systems ; Computer systems and distributed systems. User interface ; Control ; Control systems ; Control theory. Systems ; Delay ; Exact sciences and technology ; Modelling and identification ; Networked control system (NCS) ; Optimal control ; Optimization ; Packets (communication) ; Q-function ; Software ; Stochasticity</subject><ispartof>Automatica (Oxford), 2012-06, Vol.48 (6), p.1017-1030</ispartof><rights>2012 Elsevier Ltd</rights><rights>2015 INIST-CNRS</rights><lds50>peer_reviewed</lds50><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</citedby><cites>FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</cites></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><linktohtml>$$Uhttps://www.sciencedirect.com/science/article/pii/S0005109812001033$$EHTML$$P50$$Gelsevier$$H</linktohtml><link.rule.ids>314,776,780,3537,27901,27902,65306</link.rule.ids><backlink>$$Uhttp://pascal-francis.inist.fr/vibad/index.php?action=getRecordDetail&idt=25872289$$DView record in Pascal Francis$$Hfree_for_read</backlink></links><search><creatorcontrib>Xu, Hao</creatorcontrib><creatorcontrib>Jagannathan, S.</creatorcontrib><creatorcontrib>Lewis, F.L.</creatorcontrib><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><title>Automatica (Oxford)</title><description>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</description><subject>Adaptive estimator</subject><subject>Applied sciences</subject><subject>Asymptotic properties</subject><subject>Computer science; control theory; systems</subject><subject>Computer systems and distributed systems. User interface</subject><subject>Control</subject><subject>Control systems</subject><subject>Control theory. Systems</subject><subject>Delay</subject><subject>Exact sciences and technology</subject><subject>Modelling and identification</subject><subject>Networked control system (NCS)</subject><subject>Optimal control</subject><subject>Optimization</subject><subject>Packets (communication)</subject><subject>Q-function</subject><subject>Software</subject><subject>Stochasticity</subject><issn>0005-1098</issn><issn>1873-2836</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2012</creationdate><recordtype>article</recordtype><recordid>eNqFkMFu1DAQhq0KpC6l7-ALEpeEcZxsnCNUUJAqcQDO1tQZq9517GB7Kfv29WqrcuRkjfT9nvk_xriAVoDYfti1eChxweIMth2IrgXZAowXbCPUKJtOye0rtgGAoREwqUv2JuddHXuhug37-6NE84C5xnlci1vQcxNDSdHzaPkh7EN8DNy7QJh4oPIY057mFyYfc6GFu8DLA_E1UaZg6BRNGOa48Jk8HjOvA1_R7KlwH3Om_Ja9tugzXT-_V-zXl88_b742d99vv918vGtM34-lsQiqHjsBgZw6UlPtjNIS4Tz30yjNsBVC2ZlGNPejGqZpUAptD_dgZgtWXrH353_XFH8fKBe9uGzIewwUD1kL6Do1wHaUFVVn1KR6YiKr11SFpGOF9Em23ul_svVJtgapq-waffe8BbNBb2t54_JLvhvUWNdMlft05qhW_uMo6WzcydjsEpmi5-j-v-wJmlGdTw</recordid><startdate>20120601</startdate><enddate>20120601</enddate><creator>Xu, Hao</creator><creator>Jagannathan, S.</creator><creator>Lewis, F.L.</creator><general>Elsevier Ltd</general><general>Elsevier</general><scope>IQODW</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7SC</scope><scope>7SP</scope><scope>8FD</scope><scope>JQ2</scope><scope>L7M</scope><scope>L~C</scope><scope>L~D</scope></search><sort><creationdate>20120601</creationdate><title>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</title><author>Xu, Hao ; Jagannathan, S. ; Lewis, F.L.</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c447t-fa0800090e0392e89016a3feeadd4973c56118fde7acb78599588af40b0cdf0f3</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2012</creationdate><topic>Adaptive estimator</topic><topic>Applied sciences</topic><topic>Asymptotic properties</topic><topic>Computer science; control theory; systems</topic><topic>Computer systems and distributed systems. User interface</topic><topic>Control</topic><topic>Control systems</topic><topic>Control theory. Systems</topic><topic>Delay</topic><topic>Exact sciences and technology</topic><topic>Modelling and identification</topic><topic>Networked control system (NCS)</topic><topic>Optimal control</topic><topic>Optimization</topic><topic>Packets (communication)</topic><topic>Q-function</topic><topic>Software</topic><topic>Stochasticity</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Xu, Hao</creatorcontrib><creatorcontrib>Jagannathan, S.</creatorcontrib><creatorcontrib>Lewis, F.L.</creatorcontrib><collection>Pascal-Francis</collection><collection>CrossRef</collection><collection>Computer and Information Systems Abstracts</collection><collection>Electronics & Communications Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest Computer Science Collection</collection><collection>Advanced Technologies Database with Aerospace</collection><collection>Computer and Information Systems Abstracts Academic</collection><collection>Computer and Information Systems Abstracts Professional</collection><jtitle>Automatica (Oxford)</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Xu, Hao</au><au>Jagannathan, S.</au><au>Lewis, F.L.</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses</atitle><jtitle>Automatica (Oxford)</jtitle><date>2012-06-01</date><risdate>2012</risdate><volume>48</volume><issue>6</issue><spage>1017</spage><epage>1030</epage><pages>1017-1030</pages><issn>0005-1098</issn><eissn>1873-2836</eissn><coden>ATCAA9</coden><abstract>In this paper, the stochastic optimal control of linear networked control system (NCS) with uncertain system dynamics and in the presence of network imperfections such as random delays and packet losses is derived. The proposed stochastic optimal control method uses an adaptive estimator (AE) and ideas from Q-learning to solve the infinite horizon optimal regulation of unknown NCS with time-varying system matrices. Next, a stochastic suboptimal control scheme which uses AE and Q-learning is introduced for the regulation of unknown linear time-invariant NCS that is derived using certainty equivalence property. Update laws for online tuning the unknown parameters of the AE to obtain the Q-function are derived. Lyapunov theory is used to show that all signals are asymptotically stable (AS) and that the estimated control signals converge to optimal or suboptimal control inputs. Simulation results are included to show the effectiveness of the proposed schemes. The result is an optimal control scheme that operates forward-in-time manner for unknown linear systems in contrast with standard Riccati equation-based schemes which function backward-in-time.</abstract><cop>Kidlington</cop><pub>Elsevier Ltd</pub><doi>10.1016/j.automatica.2012.03.007</doi><tpages>14</tpages></addata></record> |
fulltext | fulltext |
identifier | ISSN: 0005-1098 |
ispartof | Automatica (Oxford), 2012-06, Vol.48 (6), p.1017-1030 |
issn | 0005-1098 1873-2836 |
language | eng |
recordid | cdi_proquest_miscellaneous_1022850673 |
source | Elsevier ScienceDirect Journals |
subjects | Adaptive estimator Applied sciences Asymptotic properties Computer science control theory systems Computer systems and distributed systems. User interface Control Control systems Control theory. Systems Delay Exact sciences and technology Modelling and identification Networked control system (NCS) Optimal control Optimization Packets (communication) Q-function Software Stochasticity |
title | Stochastic optimal control of unknown linear networked control system in the presence of random delays and packet losses |
url | https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-02-03T11%3A36%3A57IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=Stochastic%20optimal%20control%20of%20unknown%20linear%20networked%20control%20system%20in%20the%20presence%20of%20random%20delays%20and%20packet%20losses&rft.jtitle=Automatica%20(Oxford)&rft.au=Xu,%20Hao&rft.date=2012-06-01&rft.volume=48&rft.issue=6&rft.spage=1017&rft.epage=1030&rft.pages=1017-1030&rft.issn=0005-1098&rft.eissn=1873-2836&rft.coden=ATCAA9&rft_id=info:doi/10.1016/j.automatica.2012.03.007&rft_dat=%3Cproquest_cross%3E1022850673%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=1022850673&rft_id=info:pmid/&rft_els_id=S0005109812001033&rfr_iscdi=true |