Reinforcement learning approach to autonomous PID tuning

•Reinforcement Learning for autonomous PID tuning.•Constrained learning for safe operation, improved safety by weight coefficient adjustment.•Simulated and experimental studies together with industrial standard distributed control system implementation.•Comparison of commonly used techniques.•Quanti...

Ausführliche Beschreibung

Gespeichert in:
Bibliographische Detailangaben
Veröffentlicht in:Computers & chemical engineering 2022-05, Vol.161, p.107760, Article 107760
Hauptverfasser: Dogru, Oguzhan, Velswamy, Kirubakaran, Ibrahim, Fadi, Wu, Yuqi, Sundaramoorthy, Arun Senthil, Huang, Biao, Xu, Shu, Nixon, Mark, Bell, Noel
Format: Artikel
Sprache:eng
Schlagworte:
Online-Zugang:Volltext
Tags: Tag hinzufügen
Keine Tags, Fügen Sie den ersten Tag hinzu!
Beschreibung
Zusammenfassung:•Reinforcement Learning for autonomous PID tuning.•Constrained learning for safe operation, improved safety by weight coefficient adjustment.•Simulated and experimental studies together with industrial standard distributed control system implementation.•Comparison of commonly used techniques.•Quantitative analysis by means of sensitivity analysis. Many industrial processes utilize proportional-integral-derivative (PID) controllers due to their practicality and often satisfactory performance. The proper controller parameters depend highly on the operational conditions and process uncertainties. This study combines the recent developments in computer sciences and control theory to address the tuning problem. It formulates the PID tuning problem as a reinforcement learning task with constraints. The proposed scheme identifies an initial approximate step-response model and lets the agent learn dynamics off-line from the model with minimal effort. After achieving a satisfactory training performance on the model, the agent is fine-tuned on-line on the actual process to adapt to the real dynamics, thereby minimizing the training time on the real process and avoiding unnecessary wear, which can be beneficial for industrial applications. This sample efficient method is tested and demonstrated through a pilot-scale multi-modal tank system. The performance of the method is verified through setpoint tracking and disturbance regulatory experiments.
ISSN:0098-1354
1873-4375
DOI:10.1016/j.compchemeng.2022.107760