Machine Learning-Based Crop Yield Prediction in South India: Performance Analysis of Various Models

Author:

Nikhil Uppugunduri1,Pandiyan Athiya1ORCID,Raja S.1,Stamenkovic Zoran23

Affiliation:

1. School of Computer Science and Engineering, Vellore Institute of Technology, Vellore 632 014, Tamil Nadu, India

2. Institute for Computational Science, University of Potsdam, An der Bahn 2, 14476 Potsdam, Germany

3. IHP—Leibniz-Institut für Innovative Mikroelektronik, Im Technologiepark 25, 15236 Frankfurt, Germany

Abstract

Agriculture is one of the most important activities that produces crop and food that is crucial for the sustenance of a human being. In the present day, agricultural products and crops are not only used for local demand, but globalization has allowed us to export produce to other countries and import from other countries. India is an agricultural nation and depends a lot on its agricultural activities. Prediction of crop production and yield is a necessary activity that allows farmers to estimate storage, optimize resources, increase efficiency and decrease costs. However, farmers usually predict crops based on the region, soil, weather conditions and the crop itself based on experience and estimates which may not be very accurate especially with the constantly changing and unpredictable climactic conditions of the present day. To solve this problem, we aim to predict the production and yield of various crops such as rice, sorghum, cotton, sugarcane and rabi using Machine Learning (ML) models. We train these models with the weather, soil and crop data to predict future crop production and yields of these crops. We have compiled a dataset of attributes that impact crop production and yield from specific states in India and performed a comprehensive study of the performance of various ML Regression Models in predicting crop production and yield. The results indicated that the Extra Trees Regressor achieved the highest performance among the models examined. It attained a R-Squared score of 0.9615 and showed lowest Mean Absolute Error (MAE) and Root Mean Squared Error (RMSE) of 21.06 and 33.99. Following closely behind are the Random Forest Regressor and LGBM Regressor, achieving R-Squared scores of 0.9437 and 0.9398 respectively. Moreover, additional analysis revealed that tree-based models, showing a R-Squared score of 0.9353, demonstrate better performance compared to linear and neighbors-based models, which achieved R-Squared scores of 0.8568 and 0.9002 respectively.

Funder

German Federal Ministry for Education and Research

Publisher

MDPI AG

Reference43 articles.

1. (2023, November 01). Available online: https://pib.gov.in/PressReleaseIframePage.aspx?PRID=1909213.

2. (2023, November 01). Available online: https://pib.gov.in/PressReleasePage.aspx?PRID=1601902.

3. (2023, November 01). Available online: https://pib.gov.in/PressReleasePage.aspx?PRID=1968931.

4. Venugopal, A., Aparna, S., Mani, J., Mathew, R., and Williams, V. (2021). Crop Yield Prediction using Machine Learning Algorithms. Int. J. Eng. Res. Technol. IJERT, 9, Available online: https://ieeexplore.ieee.org/abstract/document/8985951.

5. Crop yield prediction using machine learning: A systematic literature review;Kassahun;Comput. Electron. Agric.,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3