Value Creation in Sustainable Energy Transition Using Reinforcement Learning

Author:

Cheraghi Yasaman1,Bratvold Reidar B.1,Muhammad Ressi B.1,Hong Aojie2

Affiliation:

1. Department of Energy Resources, University of Stavanger, Norway

2. Independent Researcher, Stavanger, Norway

Abstract

Abstract The global challenge of climate change has prompted significant steps to reduce CO2 emissions, guided by international agreements like the Paris Agreement, which set targets for the transition from fossil fuels to renewable sources. Acting too slowly could result in future losses and reputational issues, while moving too swiftly could jeopardize shareholder value due to the marginal profitability or potential losses resulting from technology challenges and the immaturity of many renewable projects. To navigate this complex landscape wisely, energy companies will benefit from developing Sequential Decision Making (SDM) policies to maximize value creation from decision flexibility under uncertainties. However, achieving this goal is not trivial. This necessitates the use of decision-analysis and optimization methods that deliver robust decision insights without prohibitive computational costs or delays. In this research, we propose a multi-objective SDM framework to model the dynamic energy landscape up to 2050, allowing exploration of various decision strategies related to different portfolios for allocating funds across three sectors: hydrocarbon, renewables, and CO2 reduction investments. This approach aims to maximize value during the transformation while accounting for uncertainties in hydrocarbon and renewable energy productions, energy prices, and production costs. The framework targets three main objectives: maximizing profit, minimizing CO2 emission, and enhancing competitive advantage. Our research evaluates the benefits of using Reinforcement Learning (RL) to solve for the optimal investment decision policy for the SDM context. The sequential decisions of the energy company shape the virtual dynamic environment by influencing various variables. Subsequently, the environment responds with new variable states and immediate value feedback, reflecting the three defined objectives. Through repeated interactions, the RL algorithm navigates the extensive state space, learning the optimal decision policy amidst uncertainties.

Publisher

SPE

Reference39 articles.

1. Assessing "dangerous climate change": Required reduction of carbon emissions to protect young people, future generations and nature;Hansen;PloS one,2013

2. Global warming, climate change and greenhouse gas mitigation;Kumar;Biofuels: greenhouse gas mitigation and global warming: next generation biofuels and role of biotechnology,2018

3. Paris agreement on climate change, in Reducing Greenhouse Gas Emissions and Improving Air Quality;Erickson,2019

4. Energy Statistics Data Browser;IEA,2023

5. The Energy Transition: Navigating the Shift Towards Renewables in the Oil and Gas Industry;Alagoz;Journal of Energy and Natural Resources,2023

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3