Educational Note: Paradoxical collider effect in the analysis of non-communicable disease epidemiological data: a reproducible illustration and web application

Author:

Luque-Fernandez Miguel Angel12345,Schomaker Michael6,Redondo-Sanchez Daniel15,Jose Sanchez Perez Maria15,Vaidya Anand7,Schnitzer Mireille E89

Affiliation:

1. Biomedical Research Institute, Non-Communicable and Cancer Epidemiology Group (ibs.Granada), Andalusian School of Public Health, University of Granada, Granada, Spain

2. Department of Non-Communicable Disease Epidemiology, London School of Hygiene and Tropical Medicine, London, UK

3. Centre de Recherche en Epidemiologie, Biostatistique et Recherche Clinique Ecole de Sante Publique, Universite Libre de Bruxelles, Brussels, Belgium

4. Department of Epidemiology, Harvard School of Public Health, Harvard University, Boston, MA, USA

5. Biomedical Network Research Centers of Epidemiology and Public Health (CIBERESP), ISCIII, Madrid, Spain

6. Centre of Infectious Disease Epidemiology and Research, University of Cape Town, Cape Town, South Africa

7. Brigham and Women's Hospital, Harvard Medical School, Harvard University, Boston, MA, USA

8. Faculty of Pharmacy and Department of Social and Preventive Medicine, University of Montreal, Montreal, QC, Canada

9. Department of Epidemiology, Biostatistics and Occupational Health, McGill University, Montreal, QC, Canada

Abstract

Abstract Classical epidemiology has focused on the control of confounding, but it is only recently that epidemiologists have started to focus on the bias produced by colliders. A collider for a certain pair of variables (e.g. an outcome Y and an exposure A) is a third variable (C) that is caused by both. In a directed acyclic graph (DAG), a collider is the variable in the middle of an inverted fork (i.e. the variable C in A → C ← Y). Controlling for, or conditioning an analysis on a collider (i.e. through stratification or regression) can introduce a spurious association between its causes. This potentially explains many paradoxical findings in the medical literature, where established risk factors for a particular outcome appear protective. We use an example from non-communicable disease epidemiology to contextualize and explain the effect of conditioning on a collider. We generate a dataset with 1000 observations, and run Monte-Carlo simulations to estimate the effect of 24-h dietary sodium intake on systolic blood pressure, controlling for age, which acts as a confounder, and 24-h urinary protein excretion, which acts as a collider. We illustrate how adding a collider to a regression model introduces bias. Thus, to prevent paradoxical associations, epidemiologists estimating causal effects should be wary of conditioning on colliders. We provide R code in easy-to-read boxes throughout the manuscript, and a GitHub repository [https://github.com/migariane/ColliderApp] for the reader to reproduce our example. We also provide an educational web application allowing real-time interaction to visualize the paradoxical effect of conditioning on a collider [http://watzilei.com/shiny/collider/].

Funder

Spanish National Institute of Health, Carlos III Miguel Servet I Investigator Award

Andalusian Department of Health Research, Development and Innovation Office

National Institutes of Health

Doris Duke Charitable Foundation

New Investigator Salary Award from the Canadian Institutes of Health Research

Publisher

Oxford University Press (OUP)

Subject

General Medicine,Epidemiology

Reference27 articles.

Cited by 27 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Causal Inference Is Not Just a Statistics Problem;Journal of Statistics and Data Science Education;2023-12

2. Exploring Causality Aware Data Synthesis;The Third International Conference on Artificial Intelligence and Machine Learning Systems;2023-10-25

3. Reevaluating the protective effect of smoking on preeclampsia risk through the lens of bias;Journal of Human Hypertension;2023-04-11

4. Rationale and Design of the Diet, CKD, and Apolipoprotein L1 Study in Low-Income and Middle-Income Countries;Kidney International Reports;2023-04

5. Model selection in occupancy models: Inference versus prediction;Ecology;2023-01-18

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3