Affiliation:
1. School of Computer Science, Hangzhou Dianzi University, Hangzhou, China and Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province, Hangzhou, China
Abstract
Domain generalization primarily mitigates domain shift among multiple source domains, generalizing the trained model to an unseen target domain. However, the spurious correlation usually caused by context prior (e.g., background) makes it challenging to get rid of the domain shift. Therefore, it is critical to model the intrinsic causal mechanism. The existing domain generalization methods only attend to disentangle the semantic and context-related features by modeling the causation between input and labels, which totally ignores the unidentifiable but important confounders. In this article, a Causal Disentangled Intervention Model (CDIM) is proposed for the first time, to the best of our knowledge, to construct confounders via causal intervention. Specifically, a generative model is employed to disentangle the semantic and context-related features. The contextual information of each domain from generative model can be considered as a confounder layer, and the center of all context-related features is utilized for fine-grained hierarchical modeling of confounders. Then the semantic and confounding features from each layer are combined to train an unbiased classifier, which exhibits both transferability and robustness across an unknown distribution domain. CDIM is evaluated on three widely recognized benchmark datasets, namely, Digit-DG, PACS, and NICO, through extensive ablation studies. The experimental results clearly demonstrate that the proposed model achieves state-of-the-art performance.
Funder
National Natural Science Foundation of China
Key Research and Development Project of Zhejiang Province
Key Laboratory of Brain Machine Collaborative Intelligence of Zhejiang Province
Publisher
Association for Computing Machinery (ACM)
Reference61 articles.
1. Kartik Ahuja, Divyat Mahajan, Yixin Wang, and Yoshua Bengio. 2023. Interventional causal representation learning. In International Conference on Machine Learning. PMLR, 372–407.
2. Kartik Ahuja, Karthikeyan Shanmugam, Kush Varshney, and Amit Dhurandhar. 2020. Invariant risk minimization games. In International Conference on Machine Learning. PMLR, 145–155.
3. Invariant risk minimization;Arjovsky Martin;arXiv:1907.02893,2019
4. DecAug: Out-of-Distribution Generalization via Decomposed Feature Representation and Semantic Augmentation
5. Metareg: Towards domain generalization using meta-regularization;Balaji Yogesh;Advances in Neural Information Processing Systems,2018