Policy transfer of reinforcement learning-based flow control: From two- to three-dimensional environment

Author:

Abstract

In the current paper, the zero-mass synthetic jet flow control combined with a proximal policy optimization (PPO) algorithm in deep reinforcement learning is constructed, and a policy transfer strategy which is trained in two-dimensional (2D) environment and migrated to three-dimensional (3D) environment is proposed and analyzed. By policy, we mean the flow control strategy of the agent learned by interacting with environment through deep reinforcement learning (DRL) algorithm. Through comprehensive evaluations of vortex separation in the cylindrical boundary layer and wake region at different Reynolds (Re) numbers, the PPO model trained in the 2D environment can reduce the drag coefficient by approximately 6.3%, 18.6%, and 23.7% at Re = 100, 200, and 300, respectively, when the spanwise length of the 3D environment is equal to the cylinder's diameter. Moreover, when the spanwise length is three times the diameter, the drag reduction capability is about 5.8%, 15.4%, and 13.1% at the three Re numbers, respectively. Additionally, the PPO model trained in the 2D environment also demonstrated outstanding migration learning capability in a new 3D flow field environment with varying Re numbers, successfully suppressing vortex shedding and reducing drag coefficient. Furthermore, the results illustrate that the model trained at high Re numbers could still reduce the drag coefficient in the 3D environment with low Re numbers, while the model trained at low Re numbers was not as effective at achieving drag reduction in the environments under high Re numbers. Overall, the proposed policy transfer strategy has been proven to be an effective method applying DRL agent trained in 2D flow to a new 3D environment.

Funder

Natural Science Foundation of Jiangsu Province

Publisher

AIP Publishing

Subject

Condensed Matter Physics,Fluid Flow and Transfer Processes,Mechanics of Materials,Computational Mechanics,Mechanical Engineering

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3