Affiliation:
1. University of Seoul
2. National Institute of Environmental Research
3. Pusan National University
Abstract
Abstract
Machine learning models (MLMs) are increasingly used with remotely sensed data to monitor chlorophyll-a (Chl-a). MLMs require large amounts of remotely sensed data to monitor Chl-a effectively. However, weather conditions, satellite revisit cycles, and coverage constraints can impede the collection of adequate remotely sensed data. To address this, we tested whether MLMs effectively improved the predictions of Chl-a concentrations within the 16 lakes of the Nakdong River in South Korea using two remotely sensed datasets (Sentinel-2 and Landsat-8). This study evaluated four MLMs: Multi-Layer Perceptron (MLP), Support Vector Machine (SVM), Random Forest (RF), and eXetreme Gradient Boosting (XGB) using three different input datasets: 1) two remotely sensed datasets (Sentinel-2 and Landsat-8), 2) Sentinel-2, and 3) Landsat-8. The MLP model with multiple remotely sensed datasets outperformed other MLMs affected by data imbalance. The predictive map of the spatial distribution of Chl-a generated by the MLP model highlighted areas with high and low Chl-a concentrations. In conclusion, this study emphasizes the value of multiple remotely sensed datasets in enhancing the accuracy and reliability of Chl-a estimation, mainly when using the MLP model. This also highlights the need to address data imbalances when predicting Chl-a concentrations using MLMs. These findings provide valuable insights into utilizing MLMs effectively for Chl-a monitoring.
Publisher
Research Square Platform LLC
Reference97 articles.
1. Tree-based ensemble methods for predicting PV power generation and their comparison with support vector regression;Ahmad MW;Energy,2018
2. Alawadi F (2010) Detection of surface algal blooms using the newly developed algorithm surface algal bloom index (SABI). In: Bostater, Jr. CR, Mertikas SP, Neyt X, Velez-Reyes M (eds) Remote Sensing of the Ocean, Sea Ice, and Large Water Regions 2010. 782506
3. The effect of gamma value on support vector machine performance with different kernels;Al-Mejibli IS;Int J Electr Comput Eng,2020
4. Ball JE, Anderson DT, Wei P (2018) State-of-the-Art and Gaps for Deep Learning on Limited Training Data in Remote Sensing. In: IGARSS 2018–2018 IEEE International Geoscience and Remote Sensing Symposium. IEEE, pp 4119–4122
5. Bergstra J, Yamins D, Cox DD (2013) Making a Science of Model Search: Hyperparameter Optimization in Hundreds of Dimensions for Vision Architectures. TProc of the 30th International Conference on Machine Learning (ICML 2013) 115–123