Affiliation:
1. a National Institute of Technology, Warangal, Telangana, India
2. c Sreenidhi Institute of Science and Technology, Hyderabad, Telangana 501301
3. b Indian Institute of Technology, Kharagpur, West Bengal, India
Abstract
ABSTRACT
In this study, various regression models were utilized to predict total sediment yield in tons, while their performance was evaluated for accuracy and reliability. The dataset utilized contains numerous predictors that have been standardized and processed through principal component analysis to improve model performance. Models evaluated here include linear regression, normalized linear regression, Principal Component Analysis (PCA), Pearson Correlation Coefficient (PCC) with generalized ridge regression, kernel ridge regression, multivariate regression, lasso regression approaches such as artificial neural network Cellular Automata-Artificial Neural Network (CA-ANN or ANN), and more. Results suggest that the ANN model achieved the lowest mean squared error (MSE), 113.641; this suggests superior predictive capability compared to other models. Although environmental data were complex and relationships complex, an ANN model showed less error, followed closely by CA-ANN with an MSE of 124.83. Traditional models such as linear or lasso regression revealed larger errors with negative squared values that indicated poor fits to data. This analysis highlights the effectiveness of advanced machine learning in environmental modeling, emphasizing the importance of selecting models suited to data and specific phenomena, aiding environmental planners in predicting and managing soil erosion and sediment transport.