Actor-critic reinforcement learning to estimate the optimal operating conditions of the hydrocracking process

•Employed an actor-critic reinforcement learning to optimize a hydrocracking unit.•Generated data from a rigorous mathematical model of marginal error less than 2%.•A DNN-surrogate model with high flexibility was formulated from a mathematical model.•Achieved consistent optimal operating parameter e...

Ausführliche Beschreibung

Gespeichert in:

Bibliographische Detailangaben
Veröffentlicht in:	Computers & chemical engineering 2021-06, Vol.149, p.107280, Article 107280
Hauptverfasser:	Oh, Dong-Hoon, Adams, Derrick, Vo, Nguyen Dat, Gbadago, Dela Quarme, Lee, Chang-Ha, Oh, Min
Format:	Artikel
Sprache:	eng
Schlagworte:	Actor-critic reinforcement learning Deep neural network Hydrocracking process Mathematical modeling Optimization of operating conditions Surrogate model
Online-Zugang:	Volltext
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Beschreibung
Zusammenfassung:	•Employed an actor-critic reinforcement learning to optimize a hydrocracking unit.•Generated data from a rigorous mathematical model of marginal error less than 2%.•A DNN-surrogate model with high flexibility was formulated from a mathematical model.•Achieved consistent optimal operating parameter estimation at 97.86% and 98.5% accuracy.•The optimization technique demonstrated adaptability and customization advantages. Determining the optimal operating conditions for hydrocracking units is imperative due to the changing nature of production requirements. However, it is expensive to optimize the hydrocracking process with mathematical models because hydrocracking units have a limited capacity for quick response and customization. This study proposes an actor-critic reinforcement learning optimization strategy using a DNN surrogate model, which was developed from a validated mathematical model with a marginal error of less than 2%. The surrogate model interacted with the A2C algorithm and the optimal operating conditions were determined with an accuracy of 97.86% and 98.5%. To demonstrate the reliability, case studies were executed; the strategy was found to be consistent, with an average efficiency of 98%. The proposed approach offers the advantages of quick response time, low computational burden and customizability for online implementation, which are essential for practical optimization problems. It can be extended beyond hydrocracking to other chemical industries.
ISSN:	0098-1354 1873-4375
DOI:	10.1016/j.compchemeng.2021.107280