Causal hybrid modeling with double machine learning—applications in carbon flux modeling-Reference-Cited by-同舟云学术

Causal hybrid modeling with double machine learning—applications in carbon flux modeling

Published:2024-07-19 Issue:3 Volume:5 Page:035021
ISSN:2632-2153
Container-title:Machine Learning: Science and Technology
language:
Short-container-title:Mach. Learn.: Sci. Technol.

Author:

Cohrs Kai-Hendrik^ORCID,Varando Gherardo^ORCID,Carvalhais Nuno^ORCID,Reichstein Markus^ORCID,Camps-Valls Gustau^ORCID

Abstract

Abstract Hybrid modeling integrates machine learning with scientific knowledge to enhance interpretability, generalization, and adherence to natural laws. Nevertheless, equifinality and regularization biases pose challenges in hybrid modeling to achieve these purposes. This paper introduces a novel approach to estimating hybrid models via a causal inference framework, specifically employing double machine learning (DML) to estimate causal effects. We showcase its use for the Earth sciences on two problems related to carbon dioxide fluxes. In the Q 10 model, we demonstrate that DML-based hybrid modeling is superior in estimating causal parameters over end-to-end deep neural network approaches, proving efficiency, robustness to bias from regularization methods, and circumventing equifinality. Our approach, applied to carbon flux partitioning, exhibits flexibility in accommodating heterogeneous causal effects. The study emphasizes the necessity of explicitly defining causal graphs and relationships, advocating for this as a general best practice. We encourage the continued exploration of causality in hybrid models for more interpretable and trustworthy results in knowledge-guided machine learning.

Funder

European Research Council

Publisher

IOP Publishing

Link

https://iopscience.iop.org/article/10.1088/2632-2153/ad5a60/pdf

Reference96 articles.

1. Segment anything;Kirillov,2023

2. Language models are few-shot learners;Brown,2020

3. Pushing the limits of semi-supervised learning for automatic speech recognition;Zhang,2022

4. The unreasonable effectiveness of data;Halevy;IEEE Intell. Syst.,2009

5. The mythos of model interpretability;Zachary;Queue,2018