Predictive lithology mapping using semisupervised learning: Practical insights using a case study from New South Wales, Australia

Author:

Dunham Michael W.1,Malcolm Alison2ORCID,Welford J. Kim2

Affiliation:

1. Formerly Memorial University of Newfoundland, Department of Earth Sciences, St. John’s, Newfoundland, Canada; presently ALS Goldspot Discoveries, North Vancouver, British Columbia, Canada. (corresponding author)

2. Memorial University of Newfoundland, Department of Earth Sciences, St. John’s, Newfoundland, Canada.

Abstract

We develop a comprehensive study involving three different types of machine learning (unsupervised, supervised, and semisupervised, which we emphasize) for bedrock-lithology classification using a publicly available data set from New South Wales, Australia. The goal of this work is to demonstrate (1) the value each different type of machine learning can provide and (2) which machine learning type(s) may be preferable under different circumstances. Training data are characteristically limited for geoscience problems, which makes supervised techniques susceptible to overfitting; we explore if semisupervised methods can perform better in these circumstances. Using the geophysical data and geologic map provided for the study area, we compare the performance of two supervised methods (the Light Gradient Boosting Machine and eXtreme Gradient Boosting) with one semisupervised algorithm (label propagation [LP]) in three scenarios with varied limited a priori lithologic constraints (i.e., the training data). Hyperparameter tuning is an essential component of supervised and semisupervised techniques, and the default procedure is to choose the hyperparameter combination with the largest mean cross-validation score. However, we use a new hyperparameter selection strategy that simultaneously uses the mean and standard deviation scores, and we test this new tactic for supervised and semisupervised methods. The results indicate (1) that the new hyperparameter selection technique can slightly improve the performance for supervised and semisupervised methods by 1%–2% compared with the standard selection approach and (2) that LP can outperform the two supervised methods by up to 10%, but it depends on how the training data are distributed. As for the unsupervised analysis, the clusters indicate heterogeneous regions that correlate well with the high-entropy areas in the supervised and semisupervised results. The clustering provides complementary results to the other two types of machine learning and is a source of supporting evidence for suggesting where more in-depth field mapping may be needed.

Funder

Chevron

Natural Sciences and Engineering Research Council of Canada

InnovateNL

Publisher

Society of Exploration Geophysicists

Subject

Geochemistry and Petrology,Geophysics

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3