Causes of Outcome Learning: a causal inference-inspired machine learning approach to disentangling common combinations of potential causes of a health outcome

Author:

Rieckmann Andreas1ORCID,Dworzynski Piotr2ORCID,Arras Leila3,Lapuschkin Sebastian3ORCID,Samek Wojciech34ORCID,Arah Onyebuchi Aniweta56ORCID,Rod Naja Hulvej1ORCID,Ekstrøm Claus Thorn7ORCID

Affiliation:

1. Section of Epidemiology, Department of Public Health, University of Copenhagen , Copenhagen, Denmark

2. Novo Nordisk Foundation Center for Basic Metabolic Research, University of Copenhagen , Copenhagen, Denmark

3. Department of Artificial Intelligence, Fraunhofer Heinrich Hertz Institute , Berlin, Germany

4. BIFOLD—Berlin Institute for the Foundations of Learning and Data , Berlin, Germany

5. Department of Epidemiology, Fielding School of Public Health, University of California, Los Angeles, CA , USA

6. Department of Statistics, UCLA College of Letters and Science , Los Angeles, CA, USA

7. Section of Biostatistics, Department of Public Health, University of Copenhagen , Copenhagen, Denmark

Abstract

Abstract Nearly all diseases are caused by different combinations of exposures. Yet, most epidemiological studies focus on estimating the effect of a single exposure on a health outcome. We present the Causes of Outcome Learning approach (CoOL), which seeks to discover combinations of exposures that lead to an increased risk of a specific outcome in parts of the population. The approach allows for exposures acting alone and in synergy with others. The road map of CoOL involves (i) a pre-computational phase used to define a causal model; (ii) a computational phase with three steps, namely (a) fitting a non-negative model on an additive scale, (b) decomposing risk contributions and (c) clustering individuals based on the risk contributions into subgroups; and (iii) a post-computational phase on hypothesis development, validation and triangulation using new data before eventually updating the causal model. The computational phase uses a tailored neural network for the non-negative model on an additive scale and layer-wise relevance propagation for the risk decomposition through this model. We demonstrate the approach on simulated and real-life data using the R package ‘CoOL’. The presentation focuses on binary exposures and outcomes but can also be extended to other measurement types. This approach encourages and enables researchers to identify combinations of exposures as potential causes of the health outcome of interest. Expanding our ability to discover complex causes could eventually result in more effective, targeted and informed interventions prioritized for their public health impact.

Funder

Independent Research Fund Denmark

Danish Diabetes Academy funded by the Novo Nordisk Foundation

German Ministry for Education and Research as BIFOLD

TraMeExCo

National Institute of Biomedical Imaging and Bioengineering

National Center for Advancing Translational Sciences

National Institutes of Health

NIH

Publisher

Oxford University Press (OUP)

Subject

General Medicine,Epidemiology

Reference74 articles.

Cited by 5 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3