Determination of bioavailable arsenic threshold and validation of modeled permissible total arsenic in paddy soil using machine learning

Author:

Mandal Jajati12ORCID,Jain Vinay3,Sengupta Sudip45,Rahman Md. Aminur67,Bhattacharyya Kallol4,Rahman Mohammad Mahmudur68,Golui Debasis910ORCID,Wood Michael D.1ORCID,Mondal Debapriya11ORCID

Affiliation:

1. School of Science, Engineering and Environment University of Salford Salford UK

2. CSIRO, Land and Water Waite Campus Urrbrae SA Australia

3. Centre of Excellence Agilent Technologies (International) Pvt. Ltd Manesar Haryana India

4. Department of Agricultural Chemistry and Soil Science Bidhan Chandra Krishi Viswavidyalaya Nadia West Bengal India

5. School of Agriculture Swami Vivekananda University Barrackpore West Bengal India

6. Global Centre for Environmental Remediation (GCER), College of Engineering, Science and Environment The University of Newcastle Callaghan New South Wales Australia

7. Department of Public Health Engineering (DPHE) Zonal Laboratory Jashore Khulna Bangladesh

8. Department of General Educational Development, Faculty of Science & Information Technology Daffodil International University Savar, Dhaka Bangladesh

9. Department of Civil, Construction and Environmental Engineering North Dakota State University Fargo North Dakota USA

10. Division of Soil Science and Agricultural Chemistry ICAR‐Indian Agricultural Research Institute New Delhi India

11. Department of Population Health, Faculty of Epidemiology and Population Health School of Hygiene & Tropical Medicine London, England UK

Abstract

AbstractMinimizing arsenic intake from food consumption is a key aspect of the public health response in arsenic (As)‐contaminated regions. In many of these regions, rice is the predominant staple food. Here, we present a validated maximum allowable concentration of total As in paddy soil and provide the first derivation of a maximum allowable soil concentration for bioavailable As. We have previously used meta‐analysis to predict the maximum allowable total As in soil based on decision tree (DT) and logistic regression (LR) models. The models were defined using the maximum tolerable concentration (MTC) of As in rice grains as per the codex recommendation. In the present study, we validated these models using three test data sets derived from purposely collected field data. The DT model performed better than the LR in terms of accuracy and Matthews correlation coefficient (MCC). Therefore, the DT estimated maximum allowable total As in paddy soil of 14 mg kg−1 could confidently be used as an appropriate guideline value. We further used the purposely collected field data to predict the concentration of bioavailable As in the paddy soil with the help of random forest (RF), gradient boosting machine (GBM), and LR models. The category of grain As (<MTC and >MTC) was considered as the dependent variable; bioavailable As (BAs), total As (TAs), pH, organic carbon (OC), available phosphorus (AvP), and available iron (AvFe) were the predictor variables. LR performed better than RF and GBM in terms of accuracy, sensitivity, specificity, kappa, precision, log loss, F1score, and MCC. From the better‐performing LR model, bioavailable As (BAs), TAs, AvFe, and OC were significant variables for grain As. From the partial dependence plots (PDP) and individual conditional expectation (ICE) of the LR model, 5.70 mg kg−1 was estimated to be the limit for BAs in soil.

Publisher

Wiley

Subject

Management, Monitoring, Policy and Law,Pollution,Waste Management and Disposal,Water Science and Technology,Environmental Engineering

Reference53 articles.

1. Akosa J. S.(2017).Predictive accuracy: A misleading performance measure for highly imbalanced data.Proceedings of the SAS Global Forum 2017 Conference.SAS Institute Inc.

2. The Matthews correlation coefficient (MCC) is more reliable than balanced accuracy, bookmaker informedness, and markedness in two‐class confusion matrix evaluation;Chicco D.;BioData Mining,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3