A Semi-Supervised Ensemble Approach to Rank Potential Causal Variants and Their Target Genes in Microglia for Alzheimer’s Disease-Reference-Cited by-同舟云学术

A Semi-Supervised Ensemble Approach to Rank Potential Causal Variants and Their Target Genes in Microglia for Alzheimer’s Disease

Published:2022-11-03 Issue: Volume: Page:
ISSN:
Container-title:
language:
Short-container-title:

Author:

Khaire Archita^ORCID,Wen Jia,Yang Xiaoyu^ORCID,Shen Yin,Li Yun

Abstract

AbstractAlzheimer’s disease (AD) is the leading cause of death among individuals over 65. Despite many AD genetic variants detected by large genome-wide association studies (GWAS) a limited number of causal genes have been confirmed. Conventional machine learning techniques integrate functional annotation data and GWAS signals to assign variants functional relevance probabilities. Yet, a large proportion of genetic variation lies in the non-coding genome, where unsupervised and semi-supervised techniques have demonstrated a greater advantage. Furthermore, cell-type specific approaches are needed to better understand disease etiology. Studying AD from a microglia-specific lens is more likely to reveal causal variants involved in immune pathways. Therefore, in this study, we developed a semi-supervised ensemble approach using microglia-specific data to prioritize non-coding variants and their target genes that play roles in immune-related AD mechanisms. We designed a transductive positive-unlabeled and negative-unlabeled learning model that employs a bagging technique to learn from unlabeled variants, generating multiple predicted probabilities of variant risk. Using a combined homogeneous-heterogeneous ensemble framework, we aggregated the predictions. We applied our model to AD variant data, identifying 11 risk variants acting in well-known AD genes, such asTSPAN14, INPP5D, andMS4A2. These results validated our model’s performance and demonstrated a need to study these genes in the context of microglial pathways. We also proposed further experimental study for 37 potential causal variants associated with less-known genes. Our work has utility in predicting AD relevant genes and variants functioning in microglia and can be generalized for application to other complex diseases.

Publisher

Cold Spring Harbor Laboratory

Reference57 articles.

1. Inferring the Molecular Mechanisms of Noncoding Alzheimer’s Disease-Associated Genetic Variants;Journal of Alzheimer’s disease : JAD,2019

2. From GWAS to Function: Using Functional Genomics to Identify the Mechanisms Underlying Complex Diseases;Frontiers in genetics,2020

3. A Robust Ensemble Approach to Learn From Positive and Unlabeled Data Using SVM Base Models;Neurocomputing,2015

4. Functional regulatory variants implicate distinct transcriptional networks in dementia;Science (New York, N.Y,2022

5. P1-465: Role of collagen VI in Alzheimer’s disease: Potential mechanisms of protection;Alzheimer’s & Dementia,2008