Affiliation:
1. MUS ALPARSLAN UNIVERSITY
2. VAN YUZUNCU YIL UNIVERSITY
Abstract
The presence of Salmonella in agricultural waters may be a source of produce contamination. Recently, the performance of various algorithms has been tested for the prediction of indicator bacteria population and pathogen occurrence in agricultural water sources. The purpose of this study was to evaluate the effect of meta-heuristic optimization algorithms for feature selection to increase the Salmonella occurrence prediction performance of commonly used algorithms in agricultural waters. Previously collected dataset from six agricultural ponds in Central Florida included the population of indicator microorganisms, physicochemical water attributes, and weather station measurements. Salmonella presence was also reported with PCR-confirmed method in data set. Features were selected by binary meta-heuristic optimization methods including differential evolution optimization (DEO), grey wolf optimization (GWO), Harris hawks optimization (HHO) and particle swarm optimization (PSO). Each meta-heuristic method was run for 100 times for the extraction of features before classification analysis. Selected features after optimization were used in the K-nearest neighbor algorithm (kNN), support vector machine (SVM) and decision tree (DT) classification methods. Microbiological indicators were ranked as the first or second features by all optimization algorithms. Generic Escherichia coli was selected as the first feature 81 and 91 times out of 100 by GWO and DEO, respectively. The meta-heuristic optimization algorithms for the feature selection process followed by machine learning classification methods yielded the prediction accuracy between 93.57 and 95.55%. Meta-heuristic optimization algorithms had a positive effect to improve Salmonella prediction success in agricultural waters despite spatio-temporal variations in agricultural water quality.
Publisher
Ankara University Faculty of Agriculture
Reference50 articles.
1. Abimbola O P, Mittelstet A R, Messer T L, Berry E D, Bartelt-Hunt S L & Hansen S P (2020). Predicting Escherichia coli loads in cascading dams with machine learning: An integration of hydrometeorology, animal density and grazing pattern. The Science of the Total Environment 722: 137894. https://doi.org/10.1016/j.scitotenv.2020.137894
2. Akinola O O, Ezugwu A E, Agushaka J O, Zitar R A & Abualigah L (2022). Multiclass feature selection with metaheuristic optimization algorithms: a review. Neural Computing and Applications 34: 19751-19790. https://doi.org/10.1007/s00521-022-07705-4
3. Agrawal P, Abutarboush H F, Ganesh T & Mohamed A W (2021). Metaheuristic algorithms on feature selection: A survey of one decade of research (2009-2019). IEEE Access 9: 26766-26791. https://doi.org/10.1109/ACCESS.2021.3056407
4. Ashbolt N, Grabow W O K & Snozzi M (2001). Indicators of microbial water quality. In: L Fewtrell & J Bartram (Eds.), Water Quality: Guidelines, Standards and Health, World Health Organization (WHO) IWA Publishing pp. 289-316
5. Ayhan S & Erdoğmuş Ş (2014). Kernel function selection for the solution of classification problems via support vector machines. Destek vektör makineleriyle sınıflandırma problemlerinin çözümü için çekirdek fonksiyonu seçimi (In Turkish). Eskişehir Osmangazi University Journal of Economics and Administrative Sciences 9:175-201