Author:
Nascimento Moyses,Nascimento Ana Carolina Campana,Azevedo Camila Ferreira,Oliveira Antonio Carlos Baiao de,Caixeta Eveline Teixeira,Jarquin Diego
Abstract
Coffee Breeding programs have traditionally relied on observing plant characteristics over years, a slow and costly process. Genomic selection (GS) offers a DNA-based alternative for faster selection of superior cultivars. Stacking Ensemble Learning (SEL) combines multiple models for potentially even more accurate selection. This study explores SEL potential in coffee breeding, aiming to improve prediction accuracy for important traits [yield (YL), total number of the fruits (NF), leaf miner infestation (LM), and cercosporiosis incidence (Cer)] in Coffea Arabica. We analyzed data from 195 individuals genotyped for 21,211 single-nucleotide polymorphism (SNP) markers. To comprehensively assess model performance, we employed a cross-validation (CV) scheme. Genomic Best Linear Unbiased Prediction (GBLUP), multivariate adaptive regression splines (MARS), Quantile Random Forest (QRF), and Random Forest (RF) served as base learners. For the meta-learner within the SEL framework, various options were explored, including Ridge Regression, RF, GBLUP, and Single Average. The SEL method was able to predict the predictive ability (PA) of important traits in Coffea Arabica. SEL presented higher PA compared with those obtained for all base learner methods. The gains in PA in relation to GBLUP were 87.44% (the ratio between the PA obtained from best Stacking model and the GBLUP), 37.83%, 199.82%, and 14.59% for YL, NF, LM and Cer, respectively. Overall, SEL presents a promising approach for GS. By combining predictions from multiple models, SEL can potentially enhance the PA of GS for complex traits.
Reference62 articles.
1. Deep learning versus parametric and ensemble methods for genomic prediction of complex phenotypes;Abdollahi-Arpanahi;Genet. Selection Evol.,2020
2. Estimation of genetic component and heritability for quantitative traits in amaro coffee (Coffea Arabica L.) landrace at Awada, Southern Ethiopia;Alemayehu;Int. J. Res. Stud. Science Eng. Technology.,2019
3. Designing the best breeding strategy for Coffea Canephora: Genetic Evaluation of pure and hybrid individuals aiming to select for productivity and disease resistance traits;Alkimim;PLoS One,2021
4. Selective efficiency of genome-wide selection in Coffea canephora breeding;Alkimim;Tree Genet. Genomes,2020
5. Low-density marker panels for genomic prediction in Coffea arabica L. Acta Scientiarum;Arcanjo;Agronomy,2024