Scalable Causal Structure Learning: Traditional and Deep Learning Algorithms and New Opportunities in Biomedicine (Preprint)

Author:

Upadhyaya PulakeshORCID,Zhang KaiORCID,Li CanORCID,Jiang XiaoqianORCID,Kim YejinORCID

Abstract

BACKGROUND

Causal structure learning refers to a problem of identifying causal structure from observational data and can have multiple applications in the field of biomedicine and healthcare.

OBJECTIVE

The paper provides a practical review and tutorial on scalable causal structure learning models with examples of real-world data to help healthcare audiences understand and apply them.

METHODS

We review traditional (combinatorial and score-based) methods for causal structure discovery as well as machine-learning-based schemes. Various traditional approaches have been studied to tackle this problem, the most important among these being the PC algorithm. This was followed by literature on score-based methods, which are computationally faster. Because of the continuous constraint on acyclicity, there are new deep learning approaches to the problem in addition to traditional and score-based methods. Such methods can also offer scalability, especially when there is a large amount of data involving multiple variables. Utilizing our own evaluation metrics and experiments on linear, nonlinear, and benchmark Sachs’s data, we aim to highlight the various advantages and disadvantages associated with these methods for the healthcare community. We also highlight recent developments in biomedicine, where causal structure learning can be applied to discover structures such as gene networks, brain connectivity networks and in cancer epidemiology.

RESULTS

We also compare the performance of traditional and machine learning-based algorithms for causal discovery over some benchmark datasets.

CONCLUSIONS

Machine learning-based approaches, including deep learning, have many advantages over traditional approaches, such as scalability, including a greater number of variables, and potentially being applied in a wide range of biomedical applications like genetics if sufficient data is available. Furthermore, these models are more flexible than traditional models and are poised to positively affect many applications in the future.

Publisher

JMIR Publications Inc.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3