A study on comparison of various machine learning models for the best prediction of 305 days first lactation milk yield-Reference-Cited by-同舟云学术

A study on comparison of various machine learning models for the best prediction of 305 days first lactation milk yield

Published:2024-06-11 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

FRAZ NAYLA¹,SHAHI B. N.¹,BARWAL R. S.¹,GHOSH A. K.¹,SINGH C. V.¹,KUMAR PANKAJ¹

Affiliation:

1. Govind Ballabh Pant University of Agriculture and Technology

Abstract

Machine learning models can be used in dairy industries for the prediction of milk yield in dairy cattle to increase the efficiency of dairy farms and early culling of animals based on 305 days milk yield. Analysis and evaluation of the performances of Multiple linear regression (MLR), Random forest (RF), Gradient boosting regression (GBR), Extreme gradient boosting (XGboost) and Light gradient boosting (lightGBM) were done on the basis of root mean square errors (RMSE) and coefficient of determination (R²) values. The values of RMSE for MLR, RF, GBR, XGboost and lightGBM for the training period were 478.82, 176.52, 229.65, 271.44 and 214.97 and for the testing period were 469.02, 267.13, 288.10, 338.36 and 293.80, respectively. Similarly, the values of R² for the training period were 0.76, 0.92, 0.86, 0.81 and 0.88 and for the testing period were 0.55, 0.85, 0.82, 0.76 and 0.82, respectively. The results obtained suggested that the accuracy and precision of RF, LightGBM, GBR and XGboost models were adequate in predicting first lactation 305 days milk yield, but the best results were obtained by RF in both training and testing period; it outperformed other regression models in predicting first lactation 305 days milk yield. Further, an increase in accuracy and precision can be done by increasing the number of independent variables with a high correlation with the dependent variable and by also increasing the number of observations.

Publisher

Research Square Platform LLC

Reference27 articles.

1. Comparison of lactation curve models for fortnightly test day milk yield;Arya V;Indian Journal of Animal Science,2020

2. Random forests;Breiman L;Machine Learning Sci. Technology,2001

3. Prediction and analysis of net ecosystem carbon exchange based on gradient boosting regression and random forest;Cai J;Applied Energy,2020

4. Assessing the transferability of support vector machine model for estimation of global solar radiation from air temperature;Chen J;Energy Convers Management,2015

5. XGBoost: A scalable tree boosting system;Chen T;CoRR.,2016