Application of Data Mining to Small Data Sets: Identification of Key Production Drivers in Heterogeneous Unconventional Resources

Author:

Ning Yanrui1ORCID,Schumann Harrison2,Jin Ge2

Affiliation:

1. Colorado School of Mines (Corresponding author)

2. Colorado School of Mines

Abstract

Summary In this study, we developed a data mining-based multivariate analysis (MVA) workflow to identify correlations in complex high-dimensional data sets of small size. The research was motivated by the integration analysis of geologic, geophysical, completion, and production data from a 4-square-mile study field located in the Northern Denver-Julesburg (DJ) Basin, Colorado, USA. The goal is to establish a workflow that can extract learnings from a small data set to guide the future development of surrounding acreages. In this research, we propose an MVA workflow, which is modified significantly based on the random forest algorithm and assessed using the R2 score from K-fold cross-validation (CV). The MVA workflow performs significantly better in small data sets compared to traditional feature selection methods. This is because the MVA workflow includes (1) the selection of top-performing feature combinations at each step, (2) iterations embedded, (3) avoidance of random correlation, and (4) the summarization of each feature’s occurrence at the end. When the MVA workflow was initially applied on a complex synthetic small data set that included numerical and categorical variables, linear and nonlinear relationships, relationships within independent variables, and high dimensionality, it correctly identified all correlating variables and outperformed traditional feature selection methods. Following that, a field data set consisting of the information from 23 wells was investigated using the MVA workflow aiming at identifying the key factors that affect the production performance in the study area. The MVA workflow reveals the weak correlation between production and legacy well effect. The results show that the key factors affecting production in this study area are total organic carbon (TOC) percentage, open fracture densities, clay content, and legacy well effect, which should receive significant attention when developing neighboring acreage of the DJ Basin. More importantly, this MVA method can be implemented in other basins. Considering the heterogeneity of unconventional resources, it is worthwhile to identify the key production drivers on a small scale. The outperformance of this MVA method on small data sets makes it possible to provide valuable insights for each specific acreage.

Publisher

Society of Petroleum Engineers (SPE)

Subject

Geology,Energy Engineering and Power Technology,Fuel Technology

Reference16 articles.

1. Testing XLE For Cost Savings in the DJ Basin: A Fiber Optic Case Study;Barhaug,2022

2. Stimulating Unconventional Reservoirs: Lessons Learned, Successful Practices, Areas for Improvement;Cramer,2008

3. Downard, A . 2021. Faulting and Natural Fracturing Across the DJ Basin: Impacts on Production from Hereford Field, Northern Colorado. Master Thesis, Colorado School of Mines, Golden, Colorado, USA.

4. Functional Approach to Data Mining, Forecasting, and Uncertainty Quantification in Unconventional Reservoirs;Grujic,2015

5. Understanding Well Performance of Unconventional Extended Laterals in New Mexico, Delaware Basin;Han,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3