Genomic Prediction of Wheat Grain Yield Using Machine Learning-Reference-Cited by-同舟云学术

Genomic Prediction of Wheat Grain Yield Using Machine Learning

Published:2022-09-06 Issue:9 Volume:12 Page:1406
ISSN:2077-0472
Container-title:Agriculture
language:en
Short-container-title:Agriculture

Author:

Sirsat Manisha Sanjay^ORCID,Oblessuc Paula Rodrigues^ORCID,Ramiro Ricardo S.^ORCID

Abstract

Genomic Prediction (GP) is a powerful approach for inferring complex phenotypes from genetic markers. GP is critical for improving grain yield, particularly for staple crops such as wheat and rice, which are crucial to feeding the world. While machine learning (ML) models have recently started to be applied in GP, it is often unclear what are the best algorithms and how their results are affected by the feature selection (FS) methods. Here, we compared ML and deep learning (DL) algorithms with classical Bayesian approaches, across a range of different FS methods, for their performance in predicting wheat grain yield (in three datasets). Model performance was generally more affected by the prediction algorithm than the FS method. Among all models, the best performance was obtained for tree-based ML methods (random forests and gradient boosting) and for classical Bayesian methods. However, the latter was prone to fitting problems. This issue was also observed for models developed with features selected by BayesA, the only Bayesian FS method used here. Nonetheless, the three other FS methods led to models with no fitting problem but similar performance. Thus, our results indicate that the choice of prediction algorithm is more important than the choice of FS method for developing highly predictive models. Moreover, we concluded that random forests and gradient boosting algorithms generate highly predictive and robust wheat grain yield GP models.

Funder

European Social Fund

Fundação para a Ciência e a Tecnologia

Publisher

MDPI AG

Subject

Plant Science,Agronomy and Crop Science,Food Science

Link

https://www.mdpi.com/2077-0472/12/9/1406/pdf

Reference61 articles.

1. Prediction of Total Genetic Value Using Genome-Wide Dense Marker Maps

2. Molecular Markers and Selection for Complex Traits in Plants: Learning from the Last 20 Years