Abstract
AbstractThe spatial display of clustered data using machine learning (ML) as regions (bordered areas) is currently unfeasible. This problem is commonly encountered in various research fields that utilize clustering algorithms in their workflow. We present in this study an approach utilizing ML algorithm models that can be trained to any specific dataset to produce decision boundaries. These boundaries are overlaid onto the geographic coordinate system (GCS) to generate geographic clustering regions. The proposed approach is implemented in the Python Package Index (PyPI) as a geovisualization library called geographic decision zones (GeoZ). The efficiency of GeoZ was tested using a dataset of groundwater wells in the State of California. We experimented with 13 different ML models to determine the best model that predicts the existing regional distribution (subbasins). The support vector machine (SVM) algorithm produced a relatively high accuracy score and fulfilled the required criteria better than the other models. Consequently, the tested SVM model with optimized parameters was implemented in the GeoZ open-source library. However, it is important to note that limitations in the application of GeoZ may arise from the nature of the SVM algorithm, as well as the volume, discontinuity, and distribution of the data. We have attempted to address these limitations through various suggestions and solutions.
Funder
Research Affairs Office, UAE University
Publisher
Springer Science and Business Media LLC
Subject
Earth and Planetary Sciences (miscellaneous),Computers in Earth Sciences,Geography, Planning and Development
Reference27 articles.
1. Barber CB, Dobkin DP, Huhdanpaa H (1996) The quickhull algorithm for convex hulls. ACM Trans Math Softw 22(4):469–483. https://doi.org/10.1145/235815.235821
2. California Department of Water Resources (DWR) (2021) “California’s groundwater update 2020 (bulletin 118).” The California Department of Water Resources 485. Retrieved from https://data.cnra.ca.gov/dataset/calgw_update2020. Accessed 11 Jan 2023
3. California Natural Resources Agency (2021) “Periodic groundwater level measurements - datasets - California Natural Resources Agency Open Data.” Retrieved from https://data.cnra.ca.gov/dataset/periodic-groundwater-level-measurements/resource/af157380-fb42-4abf-b72a-6f9f98868077. Accessed 1 Mar 2022
4. Carle D (2015) Introduction to water in California. University of California Press, Berkeley. https://doi.org/10.1525/9780520962897
5. De Marchi S, Marchetti F, Perracchione E (2020) Jumping with variably scaled discontinuous kernels (VSDKs). BIT Numer Math 60(2):441–463. https://doi.org/10.1007/s10543-019-00786-z
Cited by
7 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献