Abstract
ABSTRACTSo far SNP heritability (;variance explained by all SNP s used in genome-wide association study) has explained most of genetic variation for many traits but still there is a gap between GWAS heritability (; variance explained by genome-wide significant SNPs) andthat is named hidden heritability.There are several methods for estimating(linear_mixed_model (LMM), PRS, multiple_linear_regression (MLR) and simple_linear_regression(SLR)). However, it is unclear which methods are more accurate under different circumstances. This study proposes a PRS based method for estimatingthat uses pseudo summary statistics. It compares this method with existing methods using both simulated and real data (10 traits from UKBB) to determine when they are realistic and can be trusted as a final estimate.Simulation results showed that PRS-based methods underestimatenear 20% when considering all causal SNPs. But they are relatively accurate when using a subset of causal SNPs. Their performance is much better than SLR method for all 10 traits, although when applied to real data, they do not follow a stable trend of overestimation or underestimation compared to the base model (LMM).My suggestion is to use LMM or adjusted_R2from MLR for reportingwhen an independent data set is available. In cases where only summary statistics is available, the PRS-PSS is relatively an accurate alternative, especially compared to SLR, which tends to overestimateby 20-50% when applying it on real data.
Publisher
Cold Spring Harbor Laboratory