Actor-critic reinforcement learning leads decision-making in energy systems optimization—steam injection optimization-Reference-Cited by-同舟云学术

Actor-critic reinforcement learning leads decision-making in energy systems optimization—steam injection optimization

Published:2023-04-27 Issue:22 Volume:35 Page:16633-16647
ISSN:0941-0643
Container-title:Neural Computing and Applications
language:en
Short-container-title:Neural Comput & Applic

Author:

Abdalla Ramez^ORCID,Hollstein Wolfgang,Carvajal Carlos Paz,Jaeger Philip

Abstract

AbstractSteam injection is a popular technique to enhance oil recovery in mature oil fields. However, the conventional approach of using a constant steam rate over an extended period can lead to sub-optimal performance due to the complex nature of the problem and reservoir heterogeneity. To address this issue, the Markov decision process can be employed to formulate the problem for reinforcement learning (RL) applications. The RL agent is trained to optimize the steam injection rate by interacting with a reservoir simulation model and receives rewards for each action. The agent’s policy and value functions are updated through continuous interaction with the environment until convergence is achieved, leading to a more efficient steam injection strategy for enhancing oil recovery. In this study, an actor-critic RL architecture was employed to train the agent to find the optimal strategy (i.e., policy). The environment was represented by a reservoir simulation model, and the agent’s actions were based on the observed state. The policy function gave a probability distribution of the actions that the agent could take, while the value function determined the expected yield for an agent starting from a given state. The agent interacted with the environment for several episodes until convergence was achieved. The improvement in net present value (NPV) achieved by the agent was a significant indication of the effectiveness of the RL-based approach. The NPV reflects the economic benefits of the optimized steam injection strategy. The agent was able to achieve this improvement by finding the optimal policies. One of the key advantages of the optimal policy was the decrease in total field heat losses. This is a critical factor in the efficiency of the steam injection process. Heat loss can reduce the efficiency of the process and lead to lower oil recovery rates. By minimizing heat loss, the agent was able to optimize the steam injection process and increase oil recovery rates. The optimal policy had four regions characterized by slight changes in a stable injection rate to increase the average reservoir pressure, increasing the injection rate to a maximum value, steeply decreasing the injection rate, and slightly changing the injection rate to maintain the average reservoir temperature. These regions reflect the different phases of the steam injection process and demonstrate the complexity of the problem. Overall, the results of this study demonstrate the effectiveness of RL in optimizing steam injection in mature oil fields. The use of RL can help address the complexity of the problem and improve the efficiency of the oil recovery process. This study provides a framework for future research in this area and highlights the potential of RL for addressing other complex problems in the energy industry.

Funder

Technische Universität Clausthal

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Software

Link

https://link.springer.com/content/pdf/10.1007/s00521-023-08537-6.pdf

Reference54 articles.

1. Hou J, Zhou K, Zhang X-S, Kang X-D, Xie H (2015) A review of closed-loop reservoir management. Pet Sci 12(1):114–128. https://doi.org/10.1007/s12182-014-0005-6

2. Foss BA, Grimstad B, Gunnerud V (2015) Production optimization—facilitated by divide and conquer strategies. IFAC-PapersOnLine 48:1–8

3. Ali SMF, Meldau RF (1979) Current steamflood technology. J Pet Technol 31:1332–1342

4. Zhang J, Chen Z (2018) Formation damage by thermal methods applied to heavy oil reservoirs. In: Yuan B, Wood DA (eds) Formation damage during improved oil recovery. Gulf Professional Publishing, Houston, pp 361–384. https://doi.org/10.1016/B978-0-12-813782-6

5. Shafiei A, Dusseault MB (2013) Geomechanics of thermal viscous oil production in sandstones. J Pet Sci Eng 103:121–139

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Physics-Informed Reinforcement Learning for Motor Frequency Optimization in Electrical Submersible Pumps: Enhancing AI-Led Decision-Making in Production Optimization;SPE Artificial Lift Conference and Exhibition - Americas;2024-08-19

2. Machine learning-assisted in-situ adaptive strategies for the control of defects and anomalies in metal additive manufacturing;Additive Manufacturing;2024-02

3. Autonomous air traffic separation assurance through machine learning;Journal of Industrial and Management Optimization;2024