Unveiling the Potential of Random Undersampling in Geothermal Lithology Classification for Improved Geothermal Resource Exploration

Author:

Obika F. C.1,Okereke N. U.1,Eze F. M.1,Ekeh B. C.1

Affiliation:

1. Department of Petroleum Engineering, Federal University of Technology, Owerri, FUTO, Owerri, Imo State

Abstract

Abstract Lithology classification in geothermal exploration has been of great significance in the understanding of subsurface geology and geophysics, which can enhance the exploration and exploitation of geothermal resources. Alongside other known industrial means of classifying lithologies, the application of machine learning models has shown viable prospects in this regard. However, there seems to be poor accuracy in the performance of some of these models due to class imbalance associated with the lithologies to be classified. Hence, in this study, robust class imbalance handling techniques were investigated to efficiently classify lithology in a geothermal field. The investigated techniques which involved Synthetic Minority Oversampling Technique (SMOTE), Random Oversampling (RO), Random Undersampling (RU), and the Near Miss Undersampling (NMU) Techniques, were each employed with two ensemble bagging methods; Random Forest Classifier (RFC) and Balanced Bagging Classifier (BBC). F1 score was the key evaluation metric, as it considers both precision and recall, giving a more comprehensive picture of the models’ performance. It was observed that by leveraging real-time drilling data such as mud flow in, rate of penetration (ROP), surface torque, pump pressure and rotary speed as input parameters, RFC performed better with the resampling techniques than BBC did. Moreover, RFC combined with RU greatly outperformed other combination techniques in the prediction of the geothermal lithology with an F1 score of 93.6% for the minority class (Plutonic) and 99.3% for the majority class (Alluvium) on the testing dataset, while other combinations had F1 scores of less than 37%. This solution alongside other vital insights from this study, showed that class imbalance handling techniques can be efficiently adopted towards building more robust machine learning models for geothermal resource exploration with prevailing high temperature and unfavorable subsurface conditions that limit the use of known traditional methods.

Publisher

SPE

Reference34 articles.

1. SMOTE and Nearmiss Methods for Disease Classification with Unbalanced Data;Alamsyah;Proceedings of The International Conference on Data Science and Official Statistics,2022

2. The Challenge of Correcting Bottom-Hole Temperatures - An Example from FORGE 58-32, near Milford, Utah;Allis;43rd Workshop on Geothermal Reservoir Engineering,2018

3. Balanced training of a hybrid ensemble method for imbalanced datasets: a case of emergency department readmission prediction;Artetxe;Neural Computing and Applications,2020

4. Principal component analysis (PCA) based hybrid models for the accurate estimation of reservoir water saturation;Asante-Okyere;Computers and Geosciences,2020

5. Learning from imbalanced data using methods of sample selection;Chairi;Proceedings of 2012 International Conference on Multimedia Computing and Systems, ICMCS 2012,2012

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3