Author:
Shahhosseini Mohsen,Hu Guiping,Huber Isaiah,Archontoulis Sotirios V.
Abstract
AbstractThis study investigates whether coupling crop modeling and machine learning (ML) improves corn yield predictions in the US Corn Belt. The main objectives are to explore whether a hybrid approach (crop modeling + ML) would result in better predictions, investigate which combinations of hybrid models provide the most accurate predictions, and determine the features from the crop modeling that are most effective to be integrated with ML for corn yield prediction. Five ML models (linear regression, LASSO, LightGBM, random forest, and XGBoost) and six ensemble models have been designed to address the research question. The results suggest that adding simulation crop model variables (APSIM) as input features to ML models can decrease yield prediction root mean squared error (RMSE) from 7 to 20%. Furthermore, we investigated partial inclusion of APSIM features in the ML prediction models and we found soil moisture related APSIM variables are most influential on the ML predictions followed by crop-related and phenology-related variables. Finally, based on feature importance measure, it has been observed that simulated APSIM average drought stress and average water table depth during the growing season are the most important APSIM inputs to ML. This result indicates that weather information alone is not sufficient and ML models need more hydrological inputs to make improved yield predictions.
Funder
National Science Foundation
Publisher
Springer Science and Business Media LLC
Reference80 articles.
1. Archontoulis, S. V. et al. Predicting crop yields and soil–plant nitrogen dynamics in the US Corn Belt. Crop Sci. 60, 721–738 (2020).
2. Bogard, M. et al. Linking genetic maps and simulation to optimize breeding for wheat flowering time in current and future climates. Crop Sci. 60, 678–699 (2020).
3. Ersoz, E. S., Martin, N. F. & Stapleton, A. E. On to the next chapter for crop breeding: Convergence with data science. Crop Sci. 60, 639–655 (2020).
4. Washburn, J. D., Burch, M. B. & Franco, J. A. V. Predictive breeding for maize: Making use of molecular phenotypes, machine learning, and physiological crop models. Crop Sci. 60, 622–638 (2020).
5. Karpatne, A., Watkins, W., Read, J. & Kumar, V. Physics-guided neural networks (pgnn): An application in lake temperature modeling. arXiv Preprint arXiv:1710.11431 (2017).
Cited by
205 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献