Affiliation:
1. Zhejiang University
2. Emory University
3. The University of Melbourne
4. Alibaba Group
5. Zhejiang University, Shanghai Institute for Advanced Study of Zhejiang University, ShanghaiAI Laboratory
6. Zhejiang University, Key Laboratory for Corneal Diseases Research of Zhejiang Province
Abstract
Domain generalization (DG) aims to learn from multiple source domains a model that can generalize well on unseen target domains. Existing DG methods mainly learn the representations with invariant marginal distribution of the input features, however, the invariance of the conditional distribution of the labels given the input features is more essential for unknown domain prediction. Meanwhile, the existing of unobserved confounders which affect the input features and labels simultaneously cause spurious correlation and hinder the learning of the invariant relationship contained in the conditional distribution. Interestingly, with a causal view on the data generating process, we find that the input features of one domain are valid instrumental variables for other domains. Inspired by this finding, we propose an instrumental variable-driven DG method (IV-DG) by removing the bias of the unobserved confounders with two-stage learning. In the first stage, it learns the conditional distribution of the input features of one domain given input features of another domain. In the second stage, it estimates the relationship by predicting labels with the learned conditional distribution. Theoretical analyses and simulation experiments show that it accurately captures the invariant relationship. Extensive experiments on real-world datasets demonstrate that IV-DG method yields state-of-the-art results.
Funder
National Key Research and Development
National Natural Science Foundation of China
Young Elite Scientists Sponsorship Program by CAST
Zhejiang Provincial Natural Science Foundation of China
Major Technological Innovation Project of Hangzhou
Zhejiang Province Natural Science Foundation
Project by Shanghai AI Laboratory
Program of Zhejiang Province Science and Technology
StarryNight Science Fund of Zhejiang University Shanghai Institute for Advanced Study
Fundamental Research Funds for the Central Universities
Publisher
Association for Computing Machinery (ACM)
Reference86 articles.
1. Mostly Harmless Econometrics
2. Martin Arjovsky Léon Bottou Ishaan Gulrajani and David Lopez-Paz. 2019. Invariant risk minimization. arXiv:1907.02893. Retrieved from https://arxiv.org/abs/1907.02893.
3. Yogesh Balaji, Swami Sankaranarayanan, and Rama Chellappa. 2018. Metareg: Towards domain generalization using meta-regularization. In Proceedings of the 32nd International Conference on Neural Information Processing Systems. 998–1008.
4. A theory of learning from different domains
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Domaindiff: Boost out-of-Distribution Generalization with Synthetic Data;ICASSP 2024 - 2024 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP);2024-04-14