Causal Factor Disentanglement for Few-Shot Domain Adaptation in Video Prediction-Reference-Cited by-同舟云学术

Causal Factor Disentanglement for Few-Shot Domain Adaptation in Video Prediction

Published:2023-11-17 Issue:11 Volume:25 Page:1554
ISSN:1099-4300
Container-title:Entropy
language:en
Short-container-title:Entropy

Author:

Cornille Nathan¹^ORCID,Laenen Katrien¹^ORCID,Sun Jingyuan¹,Moens Marie-Francine¹

Affiliation:

1. Language Intelligence and Information Retrieval (LIIR) Lab, Department of Computer Science KU Leuven, 3001 Leuven, Belgium

Abstract

An important challenge in machine learning is performing with accuracy when few training samples are available from the target distribution. If a large number of training samples from a related distribution are available, transfer learning can be used to improve the performance. This paper investigates how to do transfer learning more effectively if the source and target distributions are related through a Sparse Mechanism Shift for the application of next-frame prediction. We create Sparse Mechanism Shift-TempoRal Intervened Sequences (SMS-TRIS), a benchmark to evaluate transfer learning for next-frame prediction derived from the TRIS datasets. We then propose to exploit the Sparse Mechanism Shift property of the distribution shift by disentangling the model parameters with regard to the true causal mechanisms underlying the data. We use the Causal Identifiability from TempoRal Intervened Sequences (CITRIS) model to achieve this disentanglement via causal representation learning. We show that encouraging disentanglement with the CITRIS extensions can improve performance, but their effectiveness varies depending on the dataset and backbone used. We find that it is effective only when encouraging disentanglement actually succeeds in increasing disentanglement. We also show that an alternative method designed for domain adaptation does not help, indicating the challenging nature of the SMS-TRIS benchmark.

Funder

European Research Council

Research Foundation—Flanders

Publisher

MDPI AG

Subject

General Physics and Astronomy

Link

https://www.mdpi.com/1099-4300/25/11/1554/pdf

Reference36 articles.

1. Filos, A., Tigkas, P., McAllister, R., Rhinehart, N., Levine, S., and Gal, Y. (2020, January 13–18). Can autonomous vehicles identify, recover from, and adapt to distribution shifts?. Proceedings of the International Conference on Machine Learning, PMLR, Virtual.

2. Guariso, G., Nunnari, G., and Sangiorgio, M. (2020). Multi-step solar irradiance forecasting and domain adaptation of deep neural networks. Energies, 13.

3. Deep Episodic Memory: Encoding, Recalling, and Predicting Episodic Experiences for Robot Action Execution;Rothfuss;IEEE Robot. Autom. Lett.,2018

4. Teshima, T., Sato, I., and Sugiyama, M. (2020, January 13–18). Few-shot Domain Adaptation by Causal Mechanism Transfer. Proceedings of the 37th International Conference on Machine Learning, PMLR, Virtual.

5. Arjovsky, M., Bottou, L., Gulrajani, I., and Lopez-Paz, D. (2019). Invariant Risk Minimization. arXiv.