Incremental Q-learning strategy for adaptive PID control of mobile robots
•Adaptive PID control strategy of mobile robots.•Integration of Reinforcement Learning with PID control for complex systems.•Incremental Q-learning algorithm for real-time tuning of multiples PID controllers.•Managing the adaptation process by temporal memories comparison. Expert and intelligent sys...
Gespeichert in:
Veröffentlicht in: | Expert systems with applications 2017-09, Vol.80, p.183-199 |
---|---|
Hauptverfasser: | , , , |
Format: | Artikel |
Sprache: | eng |
Schlagworte: | |
Online-Zugang: | Volltext |
Tags: |
Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
|
Zusammenfassung: | •Adaptive PID control strategy of mobile robots.•Integration of Reinforcement Learning with PID control for complex systems.•Incremental Q-learning algorithm for real-time tuning of multiples PID controllers.•Managing the adaptation process by temporal memories comparison.
Expert and intelligent systems are being developed to control many technological systems including mobile robots. However, the PID (Proportional-Integral-Derivative) controller is a fast low-level control strategy widely used in many control engineering tasks. Classic control theory has contributed with different tuning methods to obtain the gains of PID controllers for specific operation conditions. Nevertheless, when the system is not fully known and the operative conditions are variable and not previously known, classical techniques are not entirely suitable for the PID tuning. To overcome these drawbacks many adaptive approaches have been arisen, mainly from the field of artificial intelligent. In this work, we propose an incremental Q-learning strategy for adaptive PID control. In order to improve the learning efficiency we define a temporal memory into the learning process. While the memory remains invariant, a non-uniform specialization process is carried out generating new limited subspaces of learning. An implementation on a real mobile robot demonstrates the applicability of the proposed approach for a real-time simultaneous tuning of multiples adaptive PID controllers for a real system operating under variable conditions in a real environment. |
---|---|
ISSN: | 0957-4174 1873-6793 |
DOI: | 10.1016/j.eswa.2017.03.002 |