Evaluation of machine learning algorithms for groundwater quality modeling

Author:

Sahour Soheil1,Khanbeyki Matin2,Gholami Vahid3,Sahour Hossein4,Kahvazade Irene4,Karimi Hadi4

Affiliation:

1. Islamic Azad University Sari Branch

2. University of Tehran

3. University of Guilan

4. Western Michigan University

Abstract

Abstract Groundwater quality is measured through water sampling, and lab analysis. The field-based measurements are costly and time-consuming when applied over a large domain. In this study, we developed a machine learning-based framework to map groundwater quality in an unconfined aquifer in the north of Iran. Groundwater samples were provided from 248 monitoring wells across the region. The groundwater quality index (GWQI) in each well was measured and classified into four classes of Very poor, Poor, Good, and Excellent according to their cut-off values. Factors affecting groundwater quality, including distance to industrial centers, distance to residential areas, population density, aquifer transmissivity, precipitation, evaporation, geology, and elevation, were identified and prepared in the GIS environment. Six machine learning classifiers, including extreme gradient boosting (XGB), random forest (RF), support vector machine (SVM), artificial neural networks (ANN), k-nearest neighbor (KNN), and Gaussian classifier model (GCM), were used to establish relationships between GWQI and its controlling factors. The algorithms were evaluated using the receiver operating characteristic curve (ROC) and statistical efficiencies (overall accuracy, precision, recall, and f-1 score). Accuracy assessment showed that ML algorithms provided high accuracy in predicting groundwater quality. However, RF was selected as the optimum model given its higher accuracy (overall accuracy, precision, and recall = 0.92; ROC = 0.95). The trained RF model was used to map GWQI classes across the entire region. Results showed that the Poor GWQI class is dominant in the study area and Good GWQI can be found in southwest. An area of Very Poor GWQI was observed in the north. Findings indicated that the distance to industrial locations is the main factor affecting groundwater quality in the area. The study provides a cost-effective methodology in groundwater quality modeling that can be duplicated in other regions with similar hydrological and geo-logical settings.

Publisher

Research Square Platform LLC

Reference88 articles.

1. Abbasnia A, Alimohammadi M, Mahvi AH, Nabizadeh R, Yousefi M, Mohammadi AA, Pasalari H, Mirzabeigi M (2018) Assessment of groundwater quality and evaluation of scaling and corrosiveness potential of drinking water samples in villages of Chabahr city, Sistan and Baluchistan province in Iran, vol 16. Data in brief, pp 182–192

2. Agrawal P, Sinha A, Kumar S, Agarwal A, Banerjee A, Villuri VG, Annavarapu CS, Dwivedi R, Dera VV, Sinha J, Pasupuleti S (2021) Exploring artificial intelligence techniques for groundwater quality assessment. Water, 13(9), p.1172

3. Ahmed AN, Othman FB, Afan HA, Ibrahim RK, Fai CM, Hossain MS, Ehteram M, Elshafie A (2019) Machine learning methods for better water quality prediction. Journal of Hydrology, 578, p.124084

4. Alexakis E (2021) Linking DPSIR Model and Water Quality Indices to Achieve Sustainable Development Goals in Groundwater Resources. Hydrology, 8(2), p.90

5. Prediction of groundwater nitrate concentration in a semiarid region using hybrid Bayesian artificial intelligence approaches;Alkindi KM;Environ Sci Pollut Res,2022

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3