Choosing Appropriate Regularization Parameters by Splitting Data into Training and Validation Sets—Application in Global Surface-Wave Tomography

Author:

Ma Zhitu1,Mir Ramees R.1ORCID,Dalton Colleen A.2ORCID,Godfrey Karen E.3ORCID

Affiliation:

1. 1State Key Laboratory of Marine Geology, Tongji University, Shanghai, China

2. 2Department of Earth, Environmental and Planetary Sciences, Brown University, Providence, Rhode Island, U.S.A.

3. 3Shell International Exploration and Production Inc., Houston, Texas, U.S.A.

Abstract

Abstract Many linear(ized) geophysical inverse problems cannot be solved without regularization. Finding the regularization parameter that best balances the model complexity and data misfit is often a key step in the inversion problem. Traditionally, this is done by first plotting the measure of model complexity versus data misfit for different values of regularization parameter, which manifests as an L-shaped curve, and then choosing the regularization parameter corresponding to the corner point on the L-curve. For this approach, the difference in units between model complexity and data misfit must be considered, otherwise the result will be strongly affected by the scaling between these two quantities. Inspired by the machine learning literature, we here propose an extension to the traditional L-curve method. We first split the raw dataset into training and validation sets, obtain a solution by performing inversion on the training set only, and calculate data misfits on the validation set. We demonstrate the efficacy of this approach with a toy example and with two synthetic datasets. In realistic global surface-wave tomography studies where sampling of the Earth is nonuniform, we devise a procedure to generate a validation dataset with sampling as uniform as possible. We then show that the regularization parameter can be determined using this validation set, and this determination is apparently robust to the ratio of data split between training and validation sets. For both synthetic tests and realistic inversions, we find that our procedure can produce a minimal point that can be easily identified on the misfit curves calculated on the validation sets, and avoids the nuances encountered in the traditional L-curve analysis.

Publisher

Seismological Society of America (SSA)

Reference38 articles.

1. The resolving power of gross earth data;Backus;Geophy. J. Int.,1968

2. Uniqueness in inversion of inaccurate gross earth data;Backus;Phil. Trans. Roy. Soc. Lond. A,1970

3. The current limits of resolution for surface wave tomography in North America;Bassin;Eos Trans. AGU,2000

4. Velocity variations and uncertainty from transdimensional P-wave tomography of North America;Burdick;Geophys. J. Int.,2017

5. Occam’s inversion: A practical algorithm for generating smooth models from electromagnetic sounding data;Constable;Geophysics,1987

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3