U-Model-Based Adaptive Sliding Mode Control Using a Deep Deterministic Policy Gradient

This paper presents a U-model-based adaptive sliding mode control (SMC) using a deep deterministic policy gradient (DDPG) for uncertain nonlinear systems. The configuration of the proposed methodology consisted of a U-model framework and an SMC with a variable boundary layer. The U-model framework f...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Mathematical problems in engineering 2022-10, Vol.2022, p.1-14
Hauptverfasser: Lei, Changyi, Zhu, Quanmin
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
container_end_page 14
container_issue
container_start_page 1
container_title Mathematical problems in engineering
container_volume 2022
creator Lei, Changyi
Zhu, Quanmin
description This paper presents a U-model-based adaptive sliding mode control (SMC) using a deep deterministic policy gradient (DDPG) for uncertain nonlinear systems. The configuration of the proposed methodology consisted of a U-model framework and an SMC with a variable boundary layer. The U-model framework forms the outer feedback loop that adjusts the overall performance of the nonlinear system, while SMC serves as a robust dynamic inverter that cancels the nonlinearity of the original plant. Besides, to alleviate the chattering problem while maintaining the intrinsic advantages of SMC, a DDPG network is designed to adaptively tune the boundary and switching gain. From the control perspective, this controller combines the interpretability of the U-model and the robustness of the SMC. From the deep reinforcement learning (DRL) point of view, the DDPG calculates nearly optimal parameters for SMC based on current states and maximizes its favourable features while minimizing the unfavourable ones. The simulation results of the single-pendulum system are compared with those of a U-model-based SMC optimized by the particle swarm optimization (PSO) algorithm. The comparison, as well as model visualization, demonstrates the superiority of the proposed methodology.
doi_str_mv 10.1155/2022/8980664
format Article
fullrecord <record><control><sourceid>proquest_cross</sourceid><recordid>TN_cdi_proquest_journals_2725126918</recordid><sourceformat>XML</sourceformat><sourcesystem>PC</sourcesystem><sourcerecordid>2725126918</sourcerecordid><originalsourceid>FETCH-LOGICAL-c337t-8f8f9578fa5e9bc51400bb84997e40979dbce103ba9c92f1cdbafcd2781e97913</originalsourceid><addsrcrecordid>eNp9kEFPwzAMhSMEEmNw4wdE4ghlcdo0yXEMGEhDIMEQtypNUsjUNSXpQPv3tNrOXGzL75Of9RA6B3INwNiEEkonQgqS59kBGgHL04RBxg_7mdAsAZp-HKOTGFeEUGAgRuh9mTx5Y-vkRkVr8NSotnM_Fr_WzrjmEw8invmmC77GyzisFL61tu1LZ8PaNS52TuMXXzu9xfOgjLNNd4qOKlVHe7bvY7S8v3ubPSSL5_njbLpIdJryLhGVqCTjolLMylL3vxJSliKTktuMSC5NqS2QtFRSS1qBNqWqtKFcgO1VSMfoYne3Df57Y2NXrPwmNL1lQTllQHMJoqeudpQOPsZgq6INbq3CtgBSDMkVQ3LFPrkev9zhX64x6tf9T_8B5k5ssg</addsrcrecordid><sourcetype>Aggregation Database</sourcetype><iscdi>true</iscdi><recordtype>article</recordtype><pqid>2725126918</pqid></control><display><type>article</type><title>U-Model-Based Adaptive Sliding Mode Control Using a Deep Deterministic Policy Gradient</title><source>Wiley Online Library</source><source>EZB-FREE-00999 freely available EZB journals</source><source>Alma/SFX Local Collection</source><creator>Lei, Changyi ; Zhu, Quanmin</creator><contributor>Khan, Abdul Qadeer ; Abdul Qadeer Khan</contributor><creatorcontrib>Lei, Changyi ; Zhu, Quanmin ; Khan, Abdul Qadeer ; Abdul Qadeer Khan</creatorcontrib><description>This paper presents a U-model-based adaptive sliding mode control (SMC) using a deep deterministic policy gradient (DDPG) for uncertain nonlinear systems. The configuration of the proposed methodology consisted of a U-model framework and an SMC with a variable boundary layer. The U-model framework forms the outer feedback loop that adjusts the overall performance of the nonlinear system, while SMC serves as a robust dynamic inverter that cancels the nonlinearity of the original plant. Besides, to alleviate the chattering problem while maintaining the intrinsic advantages of SMC, a DDPG network is designed to adaptively tune the boundary and switching gain. From the control perspective, this controller combines the interpretability of the U-model and the robustness of the SMC. From the deep reinforcement learning (DRL) point of view, the DDPG calculates nearly optimal parameters for SMC based on current states and maximizes its favourable features while minimizing the unfavourable ones. The simulation results of the single-pendulum system are compared with those of a U-model-based SMC optimized by the particle swarm optimization (PSO) algorithm. The comparison, as well as model visualization, demonstrates the superiority of the proposed methodology.</description><identifier>ISSN: 1024-123X</identifier><identifier>EISSN: 1563-5147</identifier><identifier>DOI: 10.1155/2022/8980664</identifier><language>eng</language><publisher>New York: Hindawi</publisher><subject>Adaptive control ; Algorithms ; Boundary layers ; Control theory ; Controllers ; Feedback loops ; Machine learning ; Methods ; Neural networks ; Nonlinear systems ; Nonlinearity ; Particle swarm optimization ; Simulation ; Sliding mode control ; System theory ; Variables</subject><ispartof>Mathematical problems in engineering, 2022-10, Vol.2022, p.1-14</ispartof><rights>Copyright © 2022 Changyi Lei and Quanmin Zhu.</rights><rights>Copyright © 2022 Changyi Lei and Quanmin Zhu. This is an open access article distributed under the Creative Commons Attribution License (the “License”), which permits unrestricted use, distribution, and reproduction in any medium, provided the original work is properly cited. Notwithstanding the ProQuest Terms and Conditions, you may use this content in accordance with the terms of the License. https://creativecommons.org/licenses/by/4.0</rights><lds50>peer_reviewed</lds50><oa>free_for_read</oa><woscitedreferencessubscribed>false</woscitedreferencessubscribed><citedby>FETCH-LOGICAL-c337t-8f8f9578fa5e9bc51400bb84997e40979dbce103ba9c92f1cdbafcd2781e97913</citedby><cites>FETCH-LOGICAL-c337t-8f8f9578fa5e9bc51400bb84997e40979dbce103ba9c92f1cdbafcd2781e97913</cites><orcidid>0000-0001-8173-1179 ; 0000-0002-9743-693X</orcidid></display><links><openurl>$$Topenurl_article</openurl><openurlfulltext>$$Topenurlfull_article</openurlfulltext><thumbnail>$$Tsyndetics_thumb_exl</thumbnail><link.rule.ids>314,776,780,27901,27902</link.rule.ids></links><search><contributor>Khan, Abdul Qadeer</contributor><contributor>Abdul Qadeer Khan</contributor><creatorcontrib>Lei, Changyi</creatorcontrib><creatorcontrib>Zhu, Quanmin</creatorcontrib><title>U-Model-Based Adaptive Sliding Mode Control Using a Deep Deterministic Policy Gradient</title><title>Mathematical problems in engineering</title><description>This paper presents a U-model-based adaptive sliding mode control (SMC) using a deep deterministic policy gradient (DDPG) for uncertain nonlinear systems. The configuration of the proposed methodology consisted of a U-model framework and an SMC with a variable boundary layer. The U-model framework forms the outer feedback loop that adjusts the overall performance of the nonlinear system, while SMC serves as a robust dynamic inverter that cancels the nonlinearity of the original plant. Besides, to alleviate the chattering problem while maintaining the intrinsic advantages of SMC, a DDPG network is designed to adaptively tune the boundary and switching gain. From the control perspective, this controller combines the interpretability of the U-model and the robustness of the SMC. From the deep reinforcement learning (DRL) point of view, the DDPG calculates nearly optimal parameters for SMC based on current states and maximizes its favourable features while minimizing the unfavourable ones. The simulation results of the single-pendulum system are compared with those of a U-model-based SMC optimized by the particle swarm optimization (PSO) algorithm. The comparison, as well as model visualization, demonstrates the superiority of the proposed methodology.</description><subject>Adaptive control</subject><subject>Algorithms</subject><subject>Boundary layers</subject><subject>Control theory</subject><subject>Controllers</subject><subject>Feedback loops</subject><subject>Machine learning</subject><subject>Methods</subject><subject>Neural networks</subject><subject>Nonlinear systems</subject><subject>Nonlinearity</subject><subject>Particle swarm optimization</subject><subject>Simulation</subject><subject>Sliding mode control</subject><subject>System theory</subject><subject>Variables</subject><issn>1024-123X</issn><issn>1563-5147</issn><fulltext>true</fulltext><rsrctype>article</rsrctype><creationdate>2022</creationdate><recordtype>article</recordtype><sourceid>RHX</sourceid><sourceid>BENPR</sourceid><recordid>eNp9kEFPwzAMhSMEEmNw4wdE4ghlcdo0yXEMGEhDIMEQtypNUsjUNSXpQPv3tNrOXGzL75Of9RA6B3INwNiEEkonQgqS59kBGgHL04RBxg_7mdAsAZp-HKOTGFeEUGAgRuh9mTx5Y-vkRkVr8NSotnM_Fr_WzrjmEw8invmmC77GyzisFL61tu1LZ8PaNS52TuMXXzu9xfOgjLNNd4qOKlVHe7bvY7S8v3ubPSSL5_njbLpIdJryLhGVqCTjolLMylL3vxJSliKTktuMSC5NqS2QtFRSS1qBNqWqtKFcgO1VSMfoYne3Df57Y2NXrPwmNL1lQTllQHMJoqeudpQOPsZgq6INbq3CtgBSDMkVQ3LFPrkev9zhX64x6tf9T_8B5k5ssg</recordid><startdate>20221007</startdate><enddate>20221007</enddate><creator>Lei, Changyi</creator><creator>Zhu, Quanmin</creator><general>Hindawi</general><general>Hindawi Limited</general><scope>RHU</scope><scope>RHW</scope><scope>RHX</scope><scope>AAYXX</scope><scope>CITATION</scope><scope>7TB</scope><scope>8FD</scope><scope>8FE</scope><scope>8FG</scope><scope>ABJCF</scope><scope>ABUWG</scope><scope>AFKRA</scope><scope>ARAPS</scope><scope>AZQEC</scope><scope>BENPR</scope><scope>BGLVJ</scope><scope>CCPQU</scope><scope>COVID</scope><scope>CWDGH</scope><scope>DWQXO</scope><scope>FR3</scope><scope>GNUQQ</scope><scope>HCIFZ</scope><scope>JQ2</scope><scope>K7-</scope><scope>KR7</scope><scope>L6V</scope><scope>M7S</scope><scope>P5Z</scope><scope>P62</scope><scope>PIMPY</scope><scope>PQEST</scope><scope>PQQKQ</scope><scope>PQUKI</scope><scope>PRINS</scope><scope>PTHSS</scope><orcidid>https://orcid.org/0000-0001-8173-1179</orcidid><orcidid>https://orcid.org/0000-0002-9743-693X</orcidid></search><sort><creationdate>20221007</creationdate><title>U-Model-Based Adaptive Sliding Mode Control Using a Deep Deterministic Policy Gradient</title><author>Lei, Changyi ; Zhu, Quanmin</author></sort><facets><frbrtype>5</frbrtype><frbrgroupid>cdi_FETCH-LOGICAL-c337t-8f8f9578fa5e9bc51400bb84997e40979dbce103ba9c92f1cdbafcd2781e97913</frbrgroupid><rsrctype>articles</rsrctype><prefilter>articles</prefilter><language>eng</language><creationdate>2022</creationdate><topic>Adaptive control</topic><topic>Algorithms</topic><topic>Boundary layers</topic><topic>Control theory</topic><topic>Controllers</topic><topic>Feedback loops</topic><topic>Machine learning</topic><topic>Methods</topic><topic>Neural networks</topic><topic>Nonlinear systems</topic><topic>Nonlinearity</topic><topic>Particle swarm optimization</topic><topic>Simulation</topic><topic>Sliding mode control</topic><topic>System theory</topic><topic>Variables</topic><toplevel>peer_reviewed</toplevel><toplevel>online_resources</toplevel><creatorcontrib>Lei, Changyi</creatorcontrib><creatorcontrib>Zhu, Quanmin</creatorcontrib><collection>Hindawi Publishing Complete</collection><collection>Hindawi Publishing Subscription Journals</collection><collection>Hindawi Publishing Open Access</collection><collection>CrossRef</collection><collection>Mechanical &amp; Transportation Engineering Abstracts</collection><collection>Technology Research Database</collection><collection>ProQuest SciTech Collection</collection><collection>ProQuest Technology Collection</collection><collection>Materials Science &amp; Engineering Collection</collection><collection>ProQuest Central (Alumni Edition)</collection><collection>ProQuest Central UK/Ireland</collection><collection>Advanced Technologies &amp; Aerospace Collection</collection><collection>ProQuest Central Essentials</collection><collection>ProQuest Central</collection><collection>Technology Collection</collection><collection>ProQuest One Community College</collection><collection>Coronavirus Research Database</collection><collection>Middle East &amp; Africa Database</collection><collection>ProQuest Central Korea</collection><collection>Engineering Research Database</collection><collection>ProQuest Central Student</collection><collection>SciTech Premium Collection</collection><collection>ProQuest Computer Science Collection</collection><collection>Computer Science Database</collection><collection>Civil Engineering Abstracts</collection><collection>ProQuest Engineering Collection</collection><collection>Engineering Database</collection><collection>Advanced Technologies &amp; Aerospace Database</collection><collection>ProQuest Advanced Technologies &amp; Aerospace Collection</collection><collection>Publicly Available Content Database</collection><collection>ProQuest One Academic Eastern Edition (DO NOT USE)</collection><collection>ProQuest One Academic</collection><collection>ProQuest One Academic UKI Edition</collection><collection>ProQuest Central China</collection><collection>Engineering Collection</collection><jtitle>Mathematical problems in engineering</jtitle></facets><delivery><delcategory>Remote Search Resource</delcategory><fulltext>fulltext</fulltext></delivery><addata><au>Lei, Changyi</au><au>Zhu, Quanmin</au><au>Khan, Abdul Qadeer</au><au>Abdul Qadeer Khan</au><format>journal</format><genre>article</genre><ristype>JOUR</ristype><atitle>U-Model-Based Adaptive Sliding Mode Control Using a Deep Deterministic Policy Gradient</atitle><jtitle>Mathematical problems in engineering</jtitle><date>2022-10-07</date><risdate>2022</risdate><volume>2022</volume><spage>1</spage><epage>14</epage><pages>1-14</pages><issn>1024-123X</issn><eissn>1563-5147</eissn><abstract>This paper presents a U-model-based adaptive sliding mode control (SMC) using a deep deterministic policy gradient (DDPG) for uncertain nonlinear systems. The configuration of the proposed methodology consisted of a U-model framework and an SMC with a variable boundary layer. The U-model framework forms the outer feedback loop that adjusts the overall performance of the nonlinear system, while SMC serves as a robust dynamic inverter that cancels the nonlinearity of the original plant. Besides, to alleviate the chattering problem while maintaining the intrinsic advantages of SMC, a DDPG network is designed to adaptively tune the boundary and switching gain. From the control perspective, this controller combines the interpretability of the U-model and the robustness of the SMC. From the deep reinforcement learning (DRL) point of view, the DDPG calculates nearly optimal parameters for SMC based on current states and maximizes its favourable features while minimizing the unfavourable ones. The simulation results of the single-pendulum system are compared with those of a U-model-based SMC optimized by the particle swarm optimization (PSO) algorithm. The comparison, as well as model visualization, demonstrates the superiority of the proposed methodology.</abstract><cop>New York</cop><pub>Hindawi</pub><doi>10.1155/2022/8980664</doi><tpages>14</tpages><orcidid>https://orcid.org/0000-0001-8173-1179</orcidid><orcidid>https://orcid.org/0000-0002-9743-693X</orcidid><oa>free_for_read</oa></addata></record>
fulltext fulltext
identifier ISSN: 1024-123X
ispartof Mathematical problems in engineering, 2022-10, Vol.2022, p.1-14
issn 1024-123X
1563-5147
language eng
recordid cdi_proquest_journals_2725126918
source Wiley Online Library; EZB-FREE-00999 freely available EZB journals; Alma/SFX Local Collection
subjects Adaptive control
Algorithms
Boundary layers
Control theory
Controllers
Feedback loops
Machine learning
Methods
Neural networks
Nonlinear systems
Nonlinearity
Particle swarm optimization
Simulation
Sliding mode control
System theory
Variables
title U-Model-Based Adaptive Sliding Mode Control Using a Deep Deterministic Policy Gradient
url https://sfx.bib-bvb.de/sfx_tum?ctx_ver=Z39.88-2004&ctx_enc=info:ofi/enc:UTF-8&ctx_tim=2025-01-29T15%3A15%3A41IST&url_ver=Z39.88-2004&url_ctx_fmt=infofi/fmt:kev:mtx:ctx&rfr_id=info:sid/primo.exlibrisgroup.com:primo3-Article-proquest_cross&rft_val_fmt=info:ofi/fmt:kev:mtx:journal&rft.genre=article&rft.atitle=U-Model-Based%20Adaptive%20Sliding%20Mode%20Control%20Using%20a%20Deep%20Deterministic%20Policy%20Gradient&rft.jtitle=Mathematical%20problems%20in%20engineering&rft.au=Lei,%20Changyi&rft.date=2022-10-07&rft.volume=2022&rft.spage=1&rft.epage=14&rft.pages=1-14&rft.issn=1024-123X&rft.eissn=1563-5147&rft_id=info:doi/10.1155/2022/8980664&rft_dat=%3Cproquest_cross%3E2725126918%3C/proquest_cross%3E%3Curl%3E%3C/url%3E&disable_directlink=true&sfx.directlink=off&sfx.report_link=0&rft_id=info:oai/&rft_pqid=2725126918&rft_id=info:pmid/&rfr_iscdi=true