Direct causal variable discovery leveraging the invariance principle: application in biomedical studies

Author:

Yin Liangying,Liu Menghui,Shi Yujia,Qiu Jinghong,So Hon-cheong

Abstract

AbstractAccurate identification of direct causal(parental) variables for a target is of primary interest in many applications, especially in biomedicine. It could promote our understanding of the underlying pathophysiological mechanism and facilitate the discovery of new biomarkers and therapeutic targets for studied clinical outcomes. However, many researchers are inclined to resort to association-based machine learning methods to identify outcome-associated variables. And many of the identified variables may prove to be irrelevant. On the other hand, there is a lack of an efficient method for reliable parental set identification, especially in high-dimensional settings (e.g., biomedicine).Here, we proposed a novel and efficient two-stage approach (I-GCM) to discover the direct causal variables (including genetic and clinical variables) for various outcomes. Variable selection was first performed by the PC-simple algorithm. Then it exploited the invariance of causal relations in different (experimental) settings, which was represented by generalized covariance measure calculated from gradient-boosted trees, for efficient and reliable causal variable discovery.We first verified the proposed method through extensive simulations. This approach constantly yielded high precision (a.k.a., positive predictive value) and specificity while maintaining satisfactory sensitivity in general, and consistently outperformed a standard Notably, the precision was larger than 90% in our simulated scenarios, even in high-dimensional settings. We then applied the proposed method to 4 clinical traits to uncover the corresponding direct causal variables. Encouragingly, many identified clinical variables, genes and pathways were supported by the literature. Our proposed method constantly achieved superior performance in identifying actual direct causal variables, making it particularly useful in selecting what (genetic/clinical) risk factors to follow up. Importantly, our work represents one of the first applications of the invariance principle for causal inference in biomedical or clinical studies, and suggests a new avenue for causal discovery in these settings.

Publisher

Cold Spring Harbor Laboratory

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3