Parallel Bootstrap-Based On-Policy Deep Reinforcement Learning for Continuous Fluid Flow Control Applications-Reference-Cited by-同舟云学术

Parallel Bootstrap-Based On-Policy Deep Reinforcement Learning for Continuous Fluid Flow Control Applications

Published:2023-07-14 Issue:7 Volume:8 Page:208
ISSN:2311-5521
Container-title:Fluids
language:en
Short-container-title:Fluids

Author:

Viquerat Jonathan¹,Hachem Elie¹

Affiliation:

1. MINES Paristech, CEMEF, PSL—Research University, 06904 Sophia Antipolis, France

Abstract

The coupling of deep reinforcement learning to numerical flow control problems has recently received considerable attention, leading to groundbreaking results and opening new perspectives for the domain. Due to the usually high computational cost of fluid dynamics solvers, the use of parallel environments during the learning process represents an essential ingredient to attain efficient control in a reasonable time. Yet, most of the deep reinforcement learning literature for flow control relies on on-policy algorithms, for which the massively parallel transition collection may break theoretical assumptions and lead to suboptimal control models. To overcome this issue, we propose a parallelism pattern relying on partial-trajectory buffers terminated by a return bootstrapping step, allowing a flexible use of parallel environments while preserving the on-policiness of the updates. This approach is illustrated on a CPU-intensive continuous flow control problem from the literature.

Funder

ERC

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Mechanical Engineering,Condensed Matter Physics

Link

https://www.mdpi.com/2311-5521/8/7/208/pdf

Reference33 articles.

1. Deep convolutional neural networks for image classification: A comprehensive review;Rawat;Neural Comput.,2017

2. A survey of the recent architectures of deep convolutional neural networks;Khan;Artif. Intell. Rev.,2020

3. Speech recognition using deep neural networks: A systematic review;Nassif;IEEE Access,2019

4. Gui, J., Sun, Z., Wen, Y., Tao, D., and Ye, J. (2020). A review on generative adversarial networks: Algorithms, theory, and applications. arXiv.

5. Ramesh, A., Dhariwal, P., Nichol, A., Chu, C., and Chen, M. (2022). Hierarchical text-conditional image generation with CLIP latents. arXiv.