Marker effect p-values for single-step GWAS with the algorithm for proven and young in large genotyped populations

Author:

Leite Natália GaloroORCID,Bermann Matias,Tsuruta Shogo,Misztal Ignacy,Lourenco Daniela

Abstract

AbstractBackgroundAlthough single-step GBLUP (ssGBLUP) is a breeding value method, single-nucleotide polymorphism (SNP) effects can be backsolved from ssGBLUP genomic estimated breeding values (GEBV), and p-values can be obtained as a measure of estimation certainty. This enables single-step genome-wide association studies (ssGWAS). However, obtaining p-values for ssGWAS relies on the inversion of dense matrices, which poses computational limitations in large genotyped populations. In this study, we present an algorithm to approximate p-values for SNP in ssGWAS with many genotyped animals. The approximation relies on the algorithm for proven and young (APY) and submatrices for core animals. To test that, we first compared SNP p-values obtained with an exact inversion using the genomic relationship matrix (G−1) for 50K genotyped animals to those estimated with an exact inversion usingand those obtained with the proposed approximation based on. Then, we compared these results with those obtained with the proposed approximation using 450K genotyped animals.ResultsThe same genomic regions in chromosomes 7 and 20 were identified with p-values obtained withG−1,, and the approximation based onwhen using 50k genotyped animals and 1.5M in the pedigree. In terms of computational requirements, obtaining p-values with the approximation based onrepresented a reduction of 38 times in wall-clock time and ten times in memory requirement compared to using the exact inversion with. When the approximation was applied to a population of 450K genotyped animals and 1.8 in the pedigree, apart from the two genomic regions in chromosomes 7 and 20 previously identified with the smaller genotyped population, two new significant regions on chromosomes 6 and 14 were uncovered, indicating an increase in GWAS detection power when including more genotypes in the analyses. The process of obtaining p-value with the approximation and 450K genotyped individuals took 24.5 wall-clock hours and 87.66 GB of memory, which is expected to increase linearly with the addition of noncore genotyped individuals.ConclusionsWith an algorithm that approximates the prediction error variance of SNP effects based on APY, ssGWAS with p-values for SNP is possible in large genotyped populations. The computational cost of obtaining p-values in ssGWAS is no longer a limitation in extensive populations with many genotyped animals.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3