Incremental Q-learning strategy for adaptive PID control of mobile robots

•Adaptive PID control strategy of mobile robots.•Integration of Reinforcement Learning with PID control for complex systems.•Incremental Q-learning algorithm for real-time tuning of multiples PID controllers.•Managing the adaptation process by temporal memories comparison. Expert and intelligent sys...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Expert systems with applications 2017-09, Vol.80, p.183-199
Hauptverfasser: Carlucho, Ignacio, De Paula, Mariano, Villar, Sebastian A., Acosta, Gerardo G.
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Adaptive PID control strategy of mobile robots.•Integration of Reinforcement Learning with PID control for complex systems.•Incremental Q-learning algorithm for real-time tuning of multiples PID controllers.•Managing the adaptation process by temporal memories comparison. Expert and intelligent systems are being developed to control many technological systems including mobile robots. However, the PID (Proportional-Integral-Derivative) controller is a fast low-level control strategy widely used in many control engineering tasks. Classic control theory has contributed with different tuning methods to obtain the gains of PID controllers for specific operation conditions. Nevertheless, when the system is not fully known and the operative conditions are variable and not previously known, classical techniques are not entirely suitable for the PID tuning. To overcome these drawbacks many adaptive approaches have been arisen, mainly from the field of artificial intelligent. In this work, we propose an incremental Q-learning strategy for adaptive PID control. In order to improve the learning efficiency we define a temporal memory into the learning process. While the memory remains invariant, a non-uniform specialization process is carried out generating new limited subspaces of learning. An implementation on a real mobile robot demonstrates the applicability of the proposed approach for a real-time simultaneous tuning of multiples adaptive PID controllers for a real system operating under variable conditions in a real environment.
ISSN:0957-4174
1873-6793
DOI:10.1016/j.eswa.2017.03.002