Affiliation:
1. Leeds Institute for Data Analytics, University of Leeds, Leeds, UK
2. School of Medicine, University of Leeds, Leeds, UK
3. The Alan Turing Institute, London, UK
Abstract
Abstract
Background
Compositional data comprise the parts of some whole, for which all parts sum to that whole. They are prevalent in many epidemiological contexts. Although many of the challenges associated with analysing compositional data have been discussed previously, we do so within a formal causal framework by utilizing directed acyclic graphs (DAGs).
Methods
We depict compositional data using DAGs and identify two distinct effect estimands in the generic case: (i) the total effect, and (ii) the relative effect. We consider each in the context of three specific example scenarios involving compositional data: (1) the relationship between the economically active population and area-level gross domestic product; (2) the relationship between fat consumption and body weight; and (3) the relationship between time spent sedentary and body weight. For each, we consider the distinct interpretation of each effect, and the resulting implications for related analyses.
Results
For scenarios (1) and (2), both the total and relative effects may be identifiable and causally meaningful, depending upon the specific question of interest. For scenario (3), only the relative effect is identifiable. In all scenarios, the relative effect represents a joint effect, and thus requires careful interpretation.
Conclusions
DAGs are useful for considering causal effects for compositional data. In all analyses involving compositional data, researchers should explicitly consider and declare which causal effect is sought and how it should be interpreted.
Funder
Economic and Social Research Council
Medical Research Council
The Alan Turing Institute
Publisher
Oxford University Press (OUP)
Subject
General Medicine,Epidemiology
Reference17 articles.
1. The statistical analysis of compositional data;Aitchison;J R Stat Soc Ser B,1982
2. Principles of compositional data analysis
3. Understanding Simpson's paradox;Pearl;Am Stat,2013
Cited by
34 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献