Author:
Silva Princess P.,Gaudillo Joverlyn D.,Vilela Julianne A.,Roxas-Villanueva Ranzivelle Marianne L.,Tiangco Beatrice J.,Domingo Mario R.,Albia Jason R.
Abstract
AbstractIdentifying disease-associated susceptibility loci is one of the most pressing and crucial challenges in modeling complex diseases. Existing approaches to biomarker discovery are subject to several limitations including underpowered detection, neglect for variant interactions, and restrictive dependence on prior biological knowledge. Addressing these challenges necessitates more ingenious ways of approaching the “missing heritability” problem. This study aims to discover disease-associated susceptibility loci by augmenting previous genome-wide association study (GWAS) using the integration of random forest and cluster analysis. The proposed integrated framework is applied to a hepatitis B virus surface antigen (HBsAg) seroclearance GWAS data. Multiple cluster analyses were performed on (1) single nucleotide polymorphisms (SNPs) considered significant by GWAS and (2) SNPs with the highest feature importance scores obtained using random forest. The resulting SNP-sets from the cluster analyses were subsequently tested for trait-association. Three susceptibility loci possibly associated with HBsAg seroclearance were identified: (1) SNP rs2399971, (2) gene LINC00578, and (3) locus 11p15. SNP rs2399971 is a biomarker reported in the literature to be significantly associated with HBsAg seroclearance in patients who had received antiviral treatment. The latter two loci are linked with diseases influenced by the presence of hepatitis B virus infection. These findings demonstrate the potential of the proposed integrated framework in identifying disease-associated susceptibility loci. With further validation, results herein could aid in better understanding complex disease etiologies and provide inputs for a more advanced disease risk assessment for patients.
Publisher
Springer Science and Business Media LLC
Reference59 articles.
1. Lvovs, D., Favorova, O. O. & Favorov, A. V. A polygenic approach to the study of polygenic diseases. Acta Naturae. 4(3), 59–71 (2012).
2. Schork, N. J. Genetics of complex disease: Approaches, problems, and solutions. Am. J. Respir. Care Med. 156(4), S103–S109. https://doi.org/10.1164/ajrccm.156.4.12-tac-5 (1997).
3. Visscher, P. M. et al. 10 years of GWAS discovery: Biology, function, and translation. Am. J. Hum. Genet. 101(1), 5–22. https://doi.org/10.1016/j.ajhg.2017.06.005 (2017).
4. Torkamani, A., Wineinger, N. E. & Topol, E. J. The personal and clinical utility of polygenic risk scores. Nat. Rev. Genet. 19(9), 581–590. https://doi.org/10.1038/s41576-018-0018-x (2018).
5. Norrgard K. Genetic variation and disease: GWAS. In: Nat Educ. https://www.nature.com/scitable/topicpage/genetic-variation-and-disease-gwas-682/#. Accessed 8 Mar 2022.
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献