Improvement of Predictive Ability by Uniform Coverage of the Target Genetic Space-Reference-Cited by-同舟云学术

Improvement of Predictive Ability by Uniform Coverage of the Target Genetic Space

Published:2016-11-01 Issue:11 Volume:6 Page:3733-3747
ISSN:2160-1836
Container-title:G3 Genes|Genomes|Genetics
language:en
Short-container-title:

Author:

Bustos-Korts Daniela¹²,Malosetti Marcos²,Chapman Scott³,Biddulph Ben⁴,van Eeuwijk Fred¹²

Affiliation:

1. C.T. de Wit Graduate School for Production Ecology and Resource Conservation (PE&RC), Wageningen, The Netherlands

2. Biometris, Wageningen University and Research, The Netherlands

3. Commonwealth Scientific and Industrial Research Organisation (CSIRO) Agriculture, Queensland Bioscience Precinct, St. Lucia, Queensland 4067, Australia

4. Department of Agriculture and Food, Western Australia, South Perth, Western Australia 6151, Australia

Abstract

Abstract Genome-enabled prediction provides breeders with the means to increase the number of genotypes that can be evaluated for selection. One of the major challenges in genome-enabled prediction is how to construct a training set of genotypes from a calibration set that represents the target population of genotypes, where the calibration set is composed of a training and validation set. A random sampling protocol of genotypes from the calibration set will lead to low quality coverage of the total genetic space by the training set when the calibration set contains population structure. As a consequence, predictive ability will be affected negatively, because some parts of the genotypic diversity in the target population will be under-represented in the training set, whereas other parts will be over-represented. Therefore, we propose a training set construction method that uniformly samples the genetic space spanned by the target population of genotypes, thereby increasing predictive ability. To evaluate our method, we constructed training sets alongside with the identification of corresponding genomic prediction models for four genotype panels that differed in the amount of population structure they contained (maize Flint, maize Dent, wheat, and rice). Training sets were constructed using uniform sampling, stratified-uniform sampling, stratified sampling and random sampling. We compared these methods with a method that maximizes the generalized coefficient of determination (CD). Several training set sizes were considered. We investigated four genomic prediction models: multi-locus QTL models, GBLUP models, combinations of QTL and GBLUPs, and Reproducing Kernel Hilbert Space (RKHS) models. For the maize and wheat panels, construction of the training set under uniform sampling led to a larger predictive ability than under stratified and random sampling. The results of our methods were similar to those of the CD method. For the rice panel, all training set construction methods led to similar predictive ability, a reflection of the very strong population structure in this panel.

Publisher

Oxford University Press (OUP)

Subject

Genetics (clinical),Genetics,Molecular Biology

Link

http://academic.oup.com/g3journal/article-pdf/6/11/3733/40641540/g3journal3733.pdf

Reference75 articles.

1. Genome-based prediction of testcross values in maize.;Albrecht;Theor. Appl. Genet.,2011

2. Genome-based prediction of maize hybrid performance across genetic groups, testers, locations, and years.;Albrecht;Theor. Appl. Genet.,2014

3. Population structure and cryptic relatedness in genetic association studies.;Astle;Stat. Sci.,2009

Cited by 31 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Training set optimization is a feasible alternative for perennial orphan crop domestication and germplasm management: an Acrocomia aculeata example;Frontiers in Plant Science;2024-09-10

2. Genomic selection in plant breeding: Key factors shaping two decades of progress;Molecular Plant;2024-04

3. Maximizing efficiency in sunflower breeding through historical data optimization;Plant Methods;2024-03-16

4. Genomic selection for salinity tolerance in japonica rice;PLOS ONE;2023-09-27

5. A one-dimensional mixed model genome scan approach for detecting QTL-by-genetic-background interactions in diallel and nested association mapping designs;2023-06-01