Batch process control based on reinforcement learning with segmented prioritized experience replay-Reference-Cited by-同舟云学术

Batch process control based on reinforcement learning with segmented prioritized experience replay

Published:2024-02-01 Issue:5 Volume:35 Page:056202
ISSN:0957-0233
Container-title:Measurement Science and Technology
language:
Short-container-title:Meas. Sci. Technol.

Author:

Xu Chen^ORCID,Ma Junwei,Tao Hongfeng^ORCID

Abstract

Abstract Batch process is difficult to control accurately due to their complex nonlinear dynamics and unstable operating conditions. The traditional methods such as model predictive control, will seriously affect control performance when process model is inaccurate. In contrast, reinforcement learning (RL) provides an viable alternative by interacting directly with the environment to learn optimal strategy. This paper proposes a batch process controller based on the segmented prioritized experience replay (SPER) soft actor-critic (SAC). SAC combines off-policy updates and maximum entropy RL with an actor-critic formulation, which can obtain a more robust control strategy than other RL methods. To improve the efficiency of the experience replay mechanism in tasks with long episodes and multiple phases, a new method of sampling experience called SPER is designed in SAC. In addition, a novel reward function is set for the SPER-SAC based controller to deal with the sparse reward. Finally, the effectiveness of the SPER-SAC based controller for batch process examples is demonstrated by comparing with the conventional RL-based control methods.

Funder

National Key Laboratory of Science and Technology on Helicopter Transmission

Natural Science Foundation of Jiangsu Province

National Natural Science Foundation of China

Publisher

IOP Publishing

Subject

Applied Mathematics,Instrumentation,Engineering (miscellaneous)

Link

https://iopscience.iop.org/article/10.1088/1361-6501/ad21cf/pdf

Reference46 articles.

1. Reinforcement learning for batch process control: review and perspectives;Yoo;Ann. Rev. Control,2021

2. Deep learning with CBAM-based CNN for batch process quality prediction;Zhao;Meas. Sci. Technol.,2023

3. A carrier phase batch processor for differential global positioning system: simulation and real-data results;Huang;Meas. Sci. Technol.,2009

4. Iterative learning control applied to batch processes: an overview;Lee;Control Eng. Pract.,2007

5. Mechanistic modeling and parameter-adaptive nonlinear model predictive control of a microbioreacto;Hong;Comput. Chem. Eng.,2021