Deep neural networks with knockoff features identify nonlinear causal relations and estimate effect sizes in complex biological systems

Author:

Fan Zhenjiang1ORCID,Kernan Kate F2ORCID,Sriram Aditya3ORCID,Benos Panayiotis V4ORCID,Canna Scott W5ORCID,Carcillo Joseph A2,Kim Soyeon67ORCID,Park Hyun Jung3ORCID

Affiliation:

1. Department of Computer Science, University of Pittsburgh , Pittsburgh, PA 15213 , USA

2. Division of Pediatric Critical Care Medicine, Department of Critical Care Medicine, Children's Hospital of Pittsburgh, Center for Critical Care Nephrology and Clinical Research Investigation and Systems Modeling of Acute Illness Center, University of Pittsburgh , Pittsburgh, PA 15260, USA

3. Department of Human Genetics, University of Pittsburgh , Pittsburgh, PA 15213 , USA

4. Department of Epidemiology, University of Florida , Gainesville, FL 32610 , USA

5. Pediatric Rheumatology, The Children's Hospital of Philadelphia , Philadelphia, PA 19104 , USA

6. Division of Pediatric Pulmonary Medicine, Children's Hospital of Pittsburgh , Pittsburgh, PA 15224 , USA

7. Department of Pediatrics, School of Medicine, University of Pittsburgh , Pittsburgh, PA 15224 , USA

Abstract

AbstractBackgroundLearning the causal structure helps identify risk factors, disease mechanisms, and candidate therapeutics for complex diseases. However, although complex biological systems are characterized by nonlinear associations, existing bioinformatic methods of causal inference cannot identify the nonlinear relationships and estimate their effect size.ResultsTo overcome these limitations, we developed the first computational method that explicitly learns nonlinear causal relations and estimates the effect size using a deep neural network approach coupled with the knockoff framework, named causal directed acyclic graphs using deep learning variable selection (DAG-deepVASE). Using simulation data of diverse scenarios and identifying known and novel causal relations in molecular and clinical data of various diseases, we demonstrated that DAG-deepVASE consistently outperforms existing methods in identifying true and known causal relations. In the analyses, we also illustrate how identifying nonlinear causal relations and estimating their effect size help understand the complex disease pathobiology, which is not possible using other methods.ConclusionsWith these advantages, the application of DAG-deepVASE can help identify driver genes and therapeutic agents in biomedical studies and clinical trials.

Funder

University of Pittsburgh

National Institutes of Health

Eunice Kennedy Shriver National Institutes of Child Health and Human Development

National Heart, Lung, and Blood Institute

UPMC Hillman Cancer Center Biostatistics Shared Resource

Publisher

Oxford University Press (OUP)

Subject

Computer Science Applications,Health Informatics

Reference122 articles.

1. Collective effects of long-range DNA methylations predict gene expressions and estimate phenotypes in cancer;Kim;Sci Rep,2020

2. The microRNA target site landscape is a novel molecular feature associating alternative polyadenylation with immune evasion activity in breast cancer;Kim;Brief Bioinform,2021

3. 3′-UTR shortening contributes to subtype-specific cancer growth by breaking stable ceRNA crosstalk of housekeeping genes;Fan;Front Bioeng Biotechnol,2020

4. Mixed graphical models for causal analysis of multi-modal variables;Sedgewick;CoRR,2017

5. High-dimensional learning of linear causal networks via inverse covariance estimation;Loh;J Mach Learn Res,2014

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3