Enlarging a training set for genomic selection by imputation of un-genotyped animals in populations of varying genetic architecture-Reference-Cited by-同舟云学术

Enlarging a training set for genomic selection by imputation of un-genotyped animals in populations of varying genetic architecture

Published:2013-04-26 Issue:1 Volume:45 Page:
ISSN:1297-9686
Container-title:Genetics Selection Evolution
language:en
Short-container-title:Genet Sel Evol

Author:

Pimentel Eduardo CG,Wensch-Dorendorf Monika,König Sven,Swalve Hermann H

Abstract

Abstract Background The most common application of imputation is to infer genotypes of a high-density panel of markers on animals that are genotyped for a low-density panel. However, the increase in accuracy of genomic predictions resulting from an increase in the number of markers tends to reach a plateau beyond a certain density. Another application of imputation is to increase the size of the training set with un-genotyped animals. This strategy can be particularly successful when a set of closely related individuals are genotyped. Methods Imputation on completely un-genotyped dams was performed using known genotypes from the sire of each dam, one offspring and the offspring’s sire. Two methods were applied based on either allele or haplotype frequencies to infer genotypes at ambiguous loci. Results of these methods and of two available software packages were compared. Quality of imputation under different population structures was assessed. The impact of using imputed dams to enlarge training sets on the accuracy of genomic predictions was evaluated for different populations, heritabilities and sizes of training sets. Results Imputation accuracy ranged from 0.52 to 0.93 depending on the population structure and the method used. The method that used allele frequencies performed better than the method based on haplotype frequencies. Accuracy of imputation was higher for populations with higher levels of linkage disequilibrium and with larger proportions of markers with more extreme allele frequencies. Inclusion of imputed dams in the training set increased the accuracy of genomic predictions. Gains in accuracy ranged from close to zero to 37.14%, depending on the simulated scenario. Generally, the larger the accuracy already obtained with the genotyped training set, the lower the increase in accuracy achieved by adding imputed dams. Conclusions Whenever a reference population resembling the family configuration considered here is available, imputation can be used to achieve an extra increase in accuracy of genomic predictions by enlarging the training set with completely un-genotyped dams. This strategy was shown to be particularly useful for populations with lower levels of linkage disequilibrium, for genomic selection on traits with low heritability, and for species or breeds for which the size of the reference population is limited.

Publisher

Springer Science and Business Media LLC

Subject

Genetics,Animal Science and Zoology,General Medicine,Ecology, Evolution, Behavior and Systematics

Link

https://link.springer.com/content/pdf/10.1186/1297-9686-45-12.pdf

Reference39 articles.

1. Meuwissen THE, Hayes BJ, Goddard ME: Prediction of total genetic value using genome-wide dense marker maps. Genetics. 2001, 157: 1819-1829.

2. Sargolzaei M, Schenkel FS, Jansen GB, Schaeffer LR: Extent of linkage disequilibrium in Holstein cattle in North America. J Dairy Sci. 2008, 91: 2106-2117. 10.3168/jds.2007-0553.

3. Pimentel ECG, Erbe M, König S, Simianer H: Genome partitioning of genetic variation for milk production and composition traits in Holstein cattle. Front Genet. 2011, 2: 19-

4. Erbe M, Hayes BJ, Matukumalli LK, Goswami S, Bowman PJ, Reich CM, Mason BA, Goddard ME: Improving accuracy of genomic predictions within and between dairy cattle breeds with imputed high-density single nucleotide polymorphism panels. J Dairy Sci. 2012, 95: 4114-4129. 10.3168/jds.2011-5019.

5. Ober U, Ayroles JF, Stone EA, Richards S, Zhu D, Gibbs RA, Stricker C, Gianola D, Schlather M, Mackay TFC, Simianer H: Using whole-genome sequence data to predict quantitative trait phenotypes in Drosophila melanogaster. PLoS Genet. 2012, 8: e1002685-10.1371/journal.pgen.1002685.

Cited by 28 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. 793. Filling information gaps in swine crossbreeding schemes by imputing non-genotyped F₁ animals to improve genetic gain;Proceedings of 12th World Congress on Genetics Applied to Livestock Production (WCGALP);2022-12-31

2. Imputation of non-genotyped F1 dams to improve genetic gain in swine crossbreeding programs;Journal of Animal Science;2022-04-22

3. Identification of genomic regions affecting production traits in pigs divergently selected for feed efficiency;Genetics Selection Evolution;2021-06-14

4. Genotype Imputation to Improve the Cost-Efficiency of Genomic Selection in Rabbits;Animals;2021-03-13

5. The importance of disease incidence rate on performance of GBLUP, threshold BayesA and machine learning methods in original and imputed data set;Spanish Journal of Agricultural Research;2020-12-29