An approach for good modelling and forecasting of sea surface salinity in a coastal zone using machine learning LASSO regression models built with sparse satellite time-series datasets

Author:

Ajibola-James Opeyemi1ORCID,Okeke Francis I.2

Affiliation:

1. Geo Inheritance Limited

2. University of Nigeria, Enugu Campus, Nigeria

Abstract

Abstract The risks of upstream seawater intrusion from coastal zones, particularly to the environment and people’s health,are gradually becoming serious issues thatrequire proactive environmental monitoring and good modellingapproaches. However, the temporal resolutions of relevant contemporary all-weather satellites that detect SSS are unable to support real-time applicationsthat can provide the required early warning information for mitigating such risks. Our current practical knowledge of the efficiency of machine learning (ML) least absolute shrinkage and selection operator (LASSO) regression modelsbuilt with relatively sparse all-weather satellite data for achieving relatively accurate predictor variable selection,collinearity detection,and high SSS prediction accuracy is still limited. In this paper, we utilized relatively sparse time series all-weather satellite datasets consisting of 6 potential predictor variables (PPVs), wind speed (WS), high wind speed (HWS), sea surface temperature (SST), absolute dynamic topography (ADT), sea level anomalies (SLAs) and precipitation (PRECIP) (January 2016-December 2020) to construct an ML LASSO model (using the forecastMLlibrary in R/R-studio) to predict SSS ona tropical coast (Nigerian coastal zone). We utilized the same datasets for building the L0-regularized regression (L0) model (using the L0Learnlibrary) to determine the relative importance of the PPVs for the ML time series forecasting of the SSS and to detect collinearity. The output was used to determinethe abilityof the LASSO model to determinethe relative importance of the PPVs for forecasting SSS and detecting collinearity. We determinedthe best combination of lookback (LB) and h-step-ahead (H) parametervalues for building a relatively accurate ML LASSO model with the datasets. We determinedand validatedthe relative importance of the PPVs for forecasting the monthly SSS using the LASSO model with the best combination of parametervalues. We predict and validate the monthly SSS values for January-December 2021 with a relatively accurate model. We show that the LB:24 and H:12 parametervalues,with an RMSE of 0.54437, are the best for building a relatively accurate LASSO model with such datasets. We show that the WS, HWS, and SLA are the most important PPVs for achieving relatively accurate SSS forecasts with the model. However, we show the limitations of such a LASSO model in achieving relatively accurate predictor variable selection and collinearity detection. We show practical solutions to such limitations by utilizing the L0 model to assist the LASSO model in achieving relatively high SSS prediction accuracy. Finally, we predict the monthly SSS values using the relatively accurate LASSO model and validate them with the observed SSS (January-December 2021) and obtain an RMSE of 0.7428 and a MAPE of 1.9039%. AMAPE value approximately5 times less than 10% implies a high SSS prediction accuracy that can be replicated to provide useful early warning information for mitigating such risks in any coastal zone. The results imply that the good practice for using such satellite datasets to build a relatively accurate ML LASSO model for forecasting should begin with rigorous supervised-automatic deletion of observation records with null values and outliers,followed by unbiased selection of appropriate parametervalues and important predictor variables and collinearity assessment.

Publisher

Research Square Platform LLC

Reference31 articles.

1. Experimental investigation and prediction of performance and emission responses of a CI engine fuelled with different metal-oxide based nanoparticles–diesel blends using different machine learning algorithms;Ağbulut Ü;Energy,2021

2. Ajibola-James O, Okeke FI, Ojinnaka OC (2023) Assessment of variability of sea surface salinity using integrated all-weather satellite data in a tropical coast (Nigerian coastal zone). Research Square. Preprint. https://doi.org/10.21203/rs.3.rs-3449318/v1

3. Anyikwa OB, Martinez N (2012) Continental Shelf Act, 2012. A legislation drafting project submitted in partial fulfilment of the requirements for the award of the Degree of Master of Laws (LL.M.) in International Maritime Law at the International Maritime Law Institute, IMO. 1–40. https://imli.org/wp-content/uploads/2021/03/Obiora-Bede-Anyikwa.pdf

4. Aquarius SAC-D (2018) Aquarius salinity validation analysis. Aquarius Project Document: AQ-014-PS-0016, 1–45. https://salinity.oceansciences.org/docs/AQ-014-PS-0016_AquariusSalinityDataValidationAnalysis_DatasetVersion5.0.pdf

5. Bindoff NL, Cheung WWL, Kairo JG, Arístegui J, Guinder VA, Hallberg R, Hilmi N, Jiao N, Karim MS, Levin L, O’Donoghue S, Purca Cuicapusa SR, Rinkevich B, Suga T, Tagliabue A, Williamson P (2019) Changing ocean, marine ecosystems, and dependent communities. In H.-O. Pörtner, D.C. Roberts, V. Masson-Delmotte, P. Zhai, M. Tignor, E. Poloczanska, K. Mintenbeck, A. Alegría, M. Nicolai, A. Okem, J. Petzold, B. Rama, N.M. Weyer (Eds.), IPCC Special Report on the Ocean and Cryosphere in a Changing Climate, 447–588. In press. https://www.ipcc.ch/site/assets/uploads/sites/3/2019/11/09_SROCC_Ch05_FINAL-1.pdf

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3