Deep Learning captures the effect of epistasis in multifactorial diseases

Author:

Perelygin Vladislav1,Kamelin Alexey1,Syzrantsev Nikita2,Shaheen Layal2,Kim Anna2,Plotnikov Nikolay2,Ilinskaya Anna3,Ilinsky Valery3,Rakitko Alexander2,Poptsova Maria1

Affiliation:

1. HSE University

2. Genotek Ltd

3. Eligens SIA

Abstract

Abstract

Background Polygenic risk score (PRS) prediction is widely used to assess the risk of diagnosis and progression of many diseases. Routinely, the weights of individual SNPs are estimated by the linear regression model that assumes independent and linear contribution of each SNP to the phenotype. However, for complex multifactorial diseases such as Alzheimer's disease, diabetes, cardiovascular disease, cancer, and others, association between individual SNPs and disease could be non-linear due to epistatic interactions. The aim of the presented study is to explore the power of non-linear machine learning algorithms and deep learning models to predict the risk of multifactorial diseases with epistasis. Results First, we tested ensemble tree methods and deep learning neural networks against LASSO linear regression model on simulated data with different types and strength of epistasis. The results showed that with the increase of strength of epistasis effect, non-linear models significantly outperform linear. Then the higher performance of non-linear models over linear was confirmed on real genetic data for multifactorial phenotypes such as obesity, type 1 diabetes, and psoriasis. From non-linear models, gradient boosting appeared to be the best model in obesity and psoriasis while deep learning methods significantly outperform linear approaches in type 1 diabetes. Conclusions Overall, our study underscores the efficacy of non-linear models and deep learning approaches in more accurately accounting for the effects of epistasis in simulations with specific configurations and in the context of certain diseases.

Publisher

Research Square Platform LLC

Reference57 articles.

1. Machine Learning SNP Based Prediction for Precision Medicine;Ho DSW;Front Genet,2019

2. 10 Years of GWAS Discovery: Biology, Function, and Translation;Visscher PM;Am J Hum Genet,2017

3. XV.—The Correlation between Relatives on the Supposition of Mendelian Inheritance;Fisher RA;Earth Environ Sci Trans Royal Soc Edinb,2012

4. Clément C, Samuel L, Vincent T, Cedric C, Deepak R, Franck A. Atlas of epistasis. medRxiv 2021:2021.2003.2017.21253794.

5. Deep neural network improves the estimation of polygenic risk scores for breast cancer;Badré A;J Hum Genet,2021

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3