Statistical learning for sparser fine-mapped polygenic models: the prediction of LDL-cholesterol-Reference-Cited by-同舟云学术

Statistical learning for sparser fine-mapped polygenic models: the prediction of LDL-cholesterol

Published:2022-04-10 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Maj Carlo^ORCID,Staerk Christian^ORCID,Borisov Oleg,Klinkhammer Hannah^ORCID,Yeung Ming Wai^ORCID,Krawitz Peter^ORCID,Mayr Andreas^ORCID

Abstract

AbstractPolygenic risk scores quantify the individual genetic predisposition regarding a particular trait. We propose and illustrate the application of existing statistical learning methods to derive sparser models for genome-wide data with a polygenic signal. Our approach is based on three consecutive steps. First, potentially informative loci are identified by a marginal screening approach. Then, fine-mapping is independently applied for blocks of variants in linkage disequilibrium, where informative variants are retrieved by using variable selection methods including boosting with probing and stochastic searches with the Adaptive Subspace method. Finally, joint prediction models with the selected variants are derived using statistical boosting. In contrast to alternative approaches relying on univariate summary statistics from genome-wide association studies, our three-step approach enables to select and fit multivariable regression models on large-scale genotype data. Based on UK Biobank data, we develop prediction models for LDL-cholesterol as a continuous trait. Additionally, we consider a recent scalable algorithm for the Lasso. Results show that statistical learning approaches based on fine-mapping of genetic signals result in a competitive prediction performance compared to classical polygenic risk approaches, while yielding sparser risk models that tend to be more robust regarding deviations from the target population.

Publisher

Cold Spring Harbor Laboratory

Reference53 articles.

1. Patterns of linkage disequilibrium in the human genome

2. FINEMAP: efficient variable selection using summary data from genome-wide association studies

3. Approximately independent linkage disequilibrium blocks in human populations

4. Boosting Algorithms: Regularization, Prediction and Model Fitting

5. Boosting With theL2Loss

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Boosting polygenic risk scores;2022-05-01