Author:
Kismiantini ,Montesinos-López Abelardo,Cano-Páez Bernabe,Montesinos-López J. Cricelio,Chavira-Flores Moisés,Montesinos-López Osval A.,Crossa José
Abstract
While genomic selection (GS) began revolutionizing plant breeding when it was proposed around 20 years ago, its practical implementation is still challenging as many factors affect its accuracy. One such factor is the choice of the statistical machine learning method. For this reason, we explore the tuning process under a multi-trait framework using the Gaussian kernel with a multi-trait Bayesian Best Linear Unbiased Predictor (GBLUP) model. We explored three methods of tuning (manual, grid search and Bayesian optimization) using 5 real datasets of breeding programs. We found that using grid search and Bayesian optimization improve between 1.9 and 6.8% the prediction accuracy regarding of using manual tuning. While the improvement in prediction accuracy in some cases can be marginal, it is very important to carry out the tuning process carefully to improve the accuracy of the GS methodology, even though this entails greater computational resources.
Funder
Bill and Melinda Gates Foundation
SAID projects
CIMMYT CRP
Foundation for Research Levy on Agricultural Products
Subject
Genetics (clinical),Genetics
Reference23 articles.
1. lme4GS: An R-Package for Genomic Selection;Crossa;Front. Genet.,2021
2. Montesinos-López, O.A., Montesinos-López, A., Cano-Paez, B., Hernández-Suárez, C.M., Santana-Mancilla, P.C., and Crossa, J. (2022). A Comparison of Three Machine Learning Methods for Multivariate Genomic Prediction Using the Sparse Kernels Method (SKM) Library. Genes, 13.
3. Epistasis: What it means, what it doesn’t mean, and statistical methods to detect it in humans;Cordell;Hum. Mol. Genet.,2002
4. Effective genetic-risk prediction using mixed models;Golan;Am. J. Hum. Genet.,2014
5. Genomic-assisted prediction of genetic value with semi parametric procedures;Gianola;Genetics,2006