On-line set-point optimization for intelligent supervisory control and improvement of Q-learning convergence

This paper proposes a design method of the Q-learning based intelligent supervisory control system (ISCS) for optimal operation of the three step kiln process and a new practical method for improvement of Q-learning convergence. First, the Q-learning based intelligent supervisory control system with...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Control engineering practice 2021-09, Vol.114, p.104859, Article 104859
Hauptverfasser:	Kim, Song Ho, Song, Kwang Rim, Kang, Il Yong, Hyon, Chung Il
Format:	Artikel
Sprache:	eng
Schlagworte:	Convergence rate Intelligent supervisory control Operational experience rules Q-learning Set-point optimization
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	This paper proposes a design method of the Q-learning based intelligent supervisory control system (ISCS) for optimal operation of the three step kiln process and a new practical method for improvement of Q-learning convergence. First, the Q-learning based intelligent supervisory control system with two layer-structures is designed to find the on-line optimal set-points of control loops for the kiln process. Next, C4.5 is used to extract automatically the operational experience rules of the human operator from the historical data in the lower layer (i.e., process control layer) and the Q-function value is initialized by using the extracted rules in order to determine the optimal initial point of Q-learning in the higher layer (i.e., supervisory control layer). Hence, the convergence rate of Q-learning is extremely accelerated, so that the hierarchical ISCS can replace the human operator in the kiln process in which trial-and-error operation is not allowed. Through simulations and experiments, Q-learning convergence and the stability of the process operation have been evaluated sufficiently under the variable conditions of the states. •We proposed new method guaranteeing fast Q-learning convergence.•Method extracting experience rules from process database automatically was studied.•Optimal initialization algorithm of Q-value using experience rules was proposed.•Q-learning agent based Intelligent supervisory control system was designed.•Proposed system replaces human operator determining optimal set-points of controller.
ISSN:	0967-0661 1873-6939
DOI:	10.1016/j.conengprac.2021.104859