Affiliation:
1. Aix‐Marseille University AMSE and CNRS Marseille France
2. Aix‐Marseille Graduate School of Management ‐ IAE Marseille France
Abstract
AbstractDespite their high predictive performance, random forest and gradient boosting are often considered as black boxes which has raised concerns from practitioners and regulators. As an alternative, we suggest using partial linear models that are inherently interpretable. Specifically, we propose to combine parametric and non‐parametric functions to accurately capture linearities and non‐linearities prevailing between dependent and explanatory variables, and a variable selection procedure to control for overfitting issues. Estimation relies on a two‐step procedure building upon the double residual method. We illustrate the predictive performance and interpretability of our approach on a regression problem.
Funder
Agence Nationale de la Recherche
Subject
Statistics, Probability and Uncertainty,Economics and Econometrics,Social Sciences (miscellaneous),Statistics and Probability