Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding-Reference-Cited by-同舟云学术

Evaluation of approaches for estimating the accuracy of genomic prediction in plant breeding

Published:2013-12 Issue:1 Volume:14 Page:
ISSN:1471-2164
Container-title:BMC Genomics
language:en
Short-container-title:BMC Genomics

Author:

Ould Estaghvirou Sidi Boubacar,Ogutu Joseph O,Schulz-Streeck Torben,Knaak Carsten,Ouzunova Milena,Gordillo Andres,Piepho Hans-Peter

Abstract

Abstract Background In genomic prediction, an important measure of accuracy is the correlation between the predicted and the true breeding values. Direct computation of this quantity for real datasets is not possible, because the true breeding value is unknown. Instead, the correlation between the predicted breeding values and the observed phenotypic values, called predictive ability, is often computed. In order to indirectly estimate predictive accuracy, this latter correlation is usually divided by an estimate of the square root of heritability. In this study we use simulation to evaluate estimates of predictive accuracy for seven methods, four (1 to 4) of which use an estimate of heritability to divide predictive ability computed by cross-validation. Between them the seven methods cover balanced and unbalanced datasets as well as correlated and uncorrelated genotypes. We propose one new indirect method (4) and two direct methods (5 and 6) for estimating predictive accuracy and compare their performances and those of four other existing approaches (three indirect (1 to 3) and one direct (7)) with simulated true predictive accuracy as the benchmark and with each other. Results The size of the estimated genetic variance and hence heritability exerted the strongest influence on the variation in the estimated predictive accuracy. Increasing the number of genotypes considerably increases the time required to compute predictive accuracy by all the seven methods, most notably for the five methods that require cross-validation (Methods 1, 2, 3, 4 and 6). A new method that we propose (Method 5) and an existing method (Method 7) used in animal breeding programs were the fastest and gave the least biased, most precise and stable estimates of predictive accuracy. Of the methods that use cross-validation Methods 4 and 6 were often the best. Conclusions The estimated genetic variance and the number of genotypes had the greatest influence on predictive accuracy. Methods 5 and 7 were the fastest and produced the least biased, the most precise, robust and stable estimates of predictive accuracy. These properties argue for routinely using Methods 5 and 7 to assess predictive accuracy in genomic selection studies.

Publisher

Springer Science and Business Media LLC

Subject

Genetics,Biotechnology

Link

https://link.springer.com/content/pdf/10.1186/1471-2164-14-860.pdf

Reference27 articles.

1. Meuwissen THE, Hayes BJ, Goddard ME: Prediction of total genetic value using genome-wide dense marker map. Genetics. 2001, 157: 1819-1829.

2. Piepho HP: Ridge regression and extensions for genomewide selection in maize. Crop Sci. 2009, 49: 1165-1176. 10.2135/cropsci2008.10.0595.

3. Whittaker JC, Thomson R, Denham MC: Marker-assisted selection using ridge regression. Genetic Research. 2000, 75: 249-252. 10.1017/S0016672399004462.

4. Bernardo R, Yu J: Prospects for genomewide selection for quantitative traits in maize. Crop Sci. 2007, 47: 1082-1090. 10.2135/cropsci2006.11.0690.

5. Goddard ME, Hayes BJ: Genomic selection. J Anim Breed Genet. 2007, 124: 323-330. 10.1111/j.1439-0388.2007.00702.x.

Cited by 45 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Machine learning methods for genomic prediction of cow behavioral traits measured by automatic milking systems in North American Holstein cattle;Journal of Dairy Science;2024-07

2. Revisiting superiority and stability metrics of cultivar performances using genomic data: derivations of new estimators;Plant Methods;2024-06-06

3. Genomic prediction for agronomic traits in a diverse Flax (Linum usitatissimum L.) germplasm collection;Scientific Reports;2024-02-08

4. Genomic prediction using machine learning: a comparison of the performance of regularized regression, ensemble, instance-based and deep learning methods on synthetic and empirical data;BMC Genomics;2024-02-07

5. Improved genomic prediction using machine learning with Variational Bayesian sparsity;Plant Methods;2023-09-02