Ensemble machine learning-based recommendation system for effective prediction of suitable agricultural crop cultivation

Author:

Hasan Mahmudul,Marjan Md Abu,Uddin Md Palash,Afjal Masud Ibn,Kardy Seifedine,Ma Shaoqi,Nam Yunyoung

Abstract

Agriculture is the most critical sector for food supply on the earth, and it is also responsible for supplying raw materials for other industrial productions. Currently, the growth in agricultural production is not sufficient to keep up with the growing population, which may result in a food shortfall for the world’s inhabitants. As a result, increasing food production is crucial for developing nations with limited land and resources. It is essential to select a suitable crop for a specific region to increase its production rate. Effective crop production forecasting in that area based on historical data, including environmental and cultivation areas, and crop production amount, is required. However, the data for such forecasting are not publicly available. As such, in this paper, we take a case study of a developing country, Bangladesh, whose economy relies on agriculture. We first gather and preprocess the data from the relevant research institutions of Bangladesh and then propose an ensemble machine learning approach, called K-nearest Neighbor Random Forest Ridge Regression (KRR), to effectively predict the production of the major crops (three different kinds of rice, potato, and wheat). KRR is designed after investigating five existing traditional machine learning (Support Vector Regression, Naïve Bayes, and Ridge Regression) and ensemble learning (Random Forest and CatBoost) algorithms. We consider four classical evaluation metrics, i.e., mean absolute error, mean square error (MSE), root MSE, and R2, to evaluate the performance of the proposed KRR over the other machine learning models. It shows 0.009 MSE, 99% R2 for Aus; 0.92 MSE, 90% R2 for Aman; 0.246 MSE, 99% R2 for Boro; 0.062 MSE, 99% R2 for wheat; and 0.016 MSE, 99% R2 for potato production prediction. The Diebold–Mariano test is conducted to check the robustness of the proposed ensemble model, KRR. In most cases, it shows 1% and 5% significance compared to the benchmark ML models. Lastly, we design a recommender system that suggests suitable crops for a specific land area for cultivation in the next season. We believe that the proposed paradigm will help the farmers and personnel in the agricultural sector leverage proper crop cultivation and production.

Publisher

Frontiers Media SA

Subject

Plant Science

Reference65 articles.

1. Impact of climate change on dryland agricultural systems: A review of current status, potentials, and further work need;Ahmed;Int. J. Plant Production.,2022

2. Prediction of potato crop yield using precision agriculture techniques;Al-Gaadi;PloS One,2016

3. An adaptive spatiotemporal agricultural cropland temperature prediction system based on ground and satellite measurements;Bagis,2012

4. Assessment of the effect of climate change on boro rice production in Bangladesh using DSSAT model;Basak;J. Civil Eng. (IEB).,2010

5. D-swoosh: A family of algorithms for generic, distributed entity resolution;Benjelloun,2007

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3