Effect of minor allele frequency and density of single nucleotide polymorphism marker arrays on imputation performance and prediction ability using the single-step genomic Best Linear Unbiased Prediction in a simulated beef cattle population

Author:

Rodríguez Juan DiegoORCID,Peripolli ElisaORCID,Londoño-Gil MarisolORCID,Espigolan RafaelORCID,Lôbo Raysildo BarbosaORCID,López-Correa RodrigoORCID,Aguilar IgnacioORCID,Baldi FernandoORCID

Abstract

Context In beef cattle populations, there is little evidence regarding the minimum number of genetic markers needed to obtain reliable genomic prediction and imputed genotypes. Aims This study aimed to evaluate the impact of single nucleotide polymorphism (SNP) marker density and minor allele frequency (MAF), on genomic predictions and imputation performance for high and low heritability traits using the single-step genomic Best Linear Unbiased Prediction methodology (ssGBLUP) in a simulated beef cattle population. Methods The simulated genomic and phenotypic data were obtained through QMsim software. 735 293 SNPs markers and 7000 quantitative trait loci (QTL) were randomly simulated. The mutation rate (10−5), QTL effects distribution (gamma distribution with shape parameter = 0.4) and minor allele frequency (MAF ≥ 0.02) of markers were used for quality control. A total of 335k SNPs (high density, HD) and 1000 QTLs were finally considered. Densities of 33 500 (35k), 16 750 (16k), 4186 (4k) and 2093 (2k) SNPs were customised through windows of 10, 20, 80 and 160 SNPs by chromosome, respectively. Three marker selection criteria were used within windows: (1) informative markers with MAF values close to 0.5 (HI); (2) less informative markers with the lowest MAF values (LI); (3) markers evenly distributed (ED). We evaluated the prediction of the high-density array and of 12 scenarios of customised SNP arrays, further the imputation performance of them. The genomic predictions and imputed genotypes were obtained with Blupf90 and FImpute software, respectively, and statistics parameters were applied to evaluate the accuracy of genotypes imputed. The Pearson’s correlation, the coefficient of regression, and the difference between genomic predictions and true breeding values were used to evaluate the prediction ability (PA), inflation (b), and bias (d), respectively. Key results Densities above 16k SNPs using HI and ED criteria displayed lower b, higher PA and higher imputation accuracy. Consequently, similar values of PA, b and d were observed with the use of imputed genotypes. The LI criterion with densities higher than 35k SNPs, showed higher PA and similar predictions using imputed genotypes, however lower b and quality of imputed genotypes were observed. Conclusion The results obtained showed that at least 5% of HI or ED SNPs available in the HD array are necessary to obtain reliable genomic predictions and imputed genotypes. Implications The development of low-density customised arrays based on criteria of MAF and even distribution of SNPs, might be a cost-effective and feasible approach to implement genomic selection in beef cattle.

Funder

PEEPg/AUGM

PECPG-CAPES

Publisher

CSIRO Publishing

Subject

Animal Science and Zoology,Food Science

Reference64 articles.

1. : a unified approach to utilize phenotypic, full pedigree, and genomic information for genetic evaluation of Holstein final score.;Journal of Dairy Science,2010

2. Aguilar I, Misztal I, Tsuruta S, Legarra A, Wang H, Aguilar I, Misztal I, Tsuruta S, Legarra A, Wang H (2014) PREGSF90-POSTGSF90: Computational tools for the implementation of single-step genomic selection and genome-wide association with ungenotyped individuals in BLUPF90 programs. In ‘Proceedings, 10th World congress of genetics applied to livestock production, Vancouver, BC, Canada.’ (American Society of Animal Science)

3. The feasibility of using low-density marker panels for genotype imputation and genomic prediction of crossbred dairy cattle of East Africa.;Journal of Dairy Science,2018

4. Aliloo H, Mrode R, Okeyo M, Ojango J, Dessie T, Rege E, Goddard M, Gibson J (2018) Optimal design of low density marker panels for genotype imputation. In ‘Proceedings of the world congress on genetics applied to livestock production’. vol. 11, p. 146. Available at

5. Comparing different marker densities and various reference populations using pedigree-marker Best Linear Unbiased Prediction (BLUP) model.;Iranian Journal of Applied Animal Science,2020

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3