Finding the best-fit background function for whole-powder-pattern fitting using LASSO combined with tree search

Author:

Toraya Hideo

Abstract

A new linear function for modelling the background in whole-powder-pattern fitting has been derived by applying LASSO (least absolute shrinkage and selection operator) and the technique of tree search. The background function (BGF) consists of terms b n L(2θ/180)n/2 and b n H(1 − 2θ/180)n/2 for the low- and high-angle sides, respectively. Some variable parameters of the BGF should be fixed at zero while others should be varied in order to find the best fit for a given data set without inducing overfitting. The LASSO algorithm can automatically select the variables in linear regression analysis. However, it finds the best-fit BGF with a set of adjustable parameters for a given data set while it derives a different set of parameters for a different data set. Thus, LASSO derives multiple solutions depending on the data set used. By regarding the individual solutions from LASSO as nodes of trees, tree structures were constructed from these solutions. The root node has the maximum number of adjustable parameters, P. P decreases with descending levels of the tree one by one, and leaf nodes have just one parameter. By evaluating individual solutions (nodes) by their χ2 index, the best-fit single path from a root node to a leaf node was found. The present BGF can be used simply by varying P in the range 1–10. The BGF thus derived as a final single solution was incorporated into computer programs for Pawley-based whole-powder-pattern decomposition and Rietveld refinement, and the performance of the BGF was tested in comparison with the polynomials currently widely used as the BGF. The present BGF has been demonstrated to be stable and to give an excellent fit, comparable to polynomials but with a smaller number of adjustable parameters and without introducing undulation into the calculated background curve. Basic algorithms used in statistics and machine learning have been demonstrated to be useful in developing an analytical model in X-ray crystallography.

Publisher

International Union of Crystallography (IUCr)

Subject

General Biochemistry, Genetics and Molecular Biology

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3