Causal Discovery from Multiple Data Sets with Non-Identical Variable Sets-Reference-Cited by-同舟云学术

Causal Discovery from Multiple Data Sets with Non-Identical Variable Sets

Published:2020-04-03 Issue:06 Volume:34 Page:10153-10161
ISSN:2374-3468
Container-title:Proceedings of the AAAI Conference on Artificial Intelligence
language:
Short-container-title:AAAI

Author:

Huang Biwei,Zhang Kun,Gong Mingming,Glymour Clark

Abstract

A number of approaches to causal discovery assume that there are no hidden confounders and are designed to learn a fixed causal model from a single data set. Over the last decade, with closer cooperation across laboratories, we are able to accumulate more variables and data for analysis, while each lab may only measure a subset of them, due to technical constraints or to save time and cost. This raises a question of how to handle causal discovery from multiple data sets with non-identical variable sets, and at the same time, it would be interesting to see how more recorded variables can help to mitigate the confounding problem. In this paper, we propose a principled method to uniquely identify causal relationships over the integrated set of variables from multiple data sets, in linear, non-Gaussian cases. The proposed method also allows distribution shifts across data sets. Theoretically, we show that the causal structure over the integrated set of variables is identifiable under testable conditions. Furthermore, we present two types of approaches to parameter estimation: one is based on maximum likelihood, and the other is likelihood free and leverages generative adversarial nets to improve scalability of the estimation procedure. Experimental results on various synthetic and real-world data sets are presented to demonstrate the efficacy of our methods.

Publisher

Association for the Advancement of Artificial Intelligence (AAAI)

Subject

General Medicine

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Causal Dataset Discovery with Large Language Models;Proceedings of the 2024 Workshop on Human-In-the-Loop Data Analytics;2024-06-14

2. Call for Papers: Special Issue on Learning from Multiple Data Sources for Decision Making in Health Care;Journal of Biomedical Informatics;2024-05

3. Towards Privacy-Aware Causal Structure Learning in Federated Setting;IEEE Transactions on Big Data;2023-12

4. Discovering Invariant and Changing Mechanisms from Data;Proceedings of the 28th ACM SIGKDD Conference on Knowledge Discovery and Data Mining;2022-08-14

5. Causal inference in AI education: A primer;Journal of Causal Inference;2022-01-01