Affiliation:
1. Department of Computer Science, Stanford University, 353 Serra Mall, Stanford, California 94305, U.S.A
2. Stanford Graduate School of Business, 655 Knight Way, Stanford, California 94305, U.S.A
Abstract
Summary
Flexible estimation of heterogeneous treatment effects lies at the heart of many statistical applications, such as personalized medicine and optimal resource allocation. In this article we develop a general class of two-step algorithms for heterogeneous treatment effect estimation in observational studies. First, we estimate marginal effects and treatment propensities to form an objective function that isolates the causal component of the signal. Then, we optimize this data-adaptive objective function. The proposed approach has several advantages over existing methods. From a practical perspective, our method is flexible and easy to use: in both steps, any loss-minimization method can be employed, such as penalized regression, deep neural networks, or boosting; moreover, these methods can be fine-tuned by cross-validation. Meanwhile, in the case of penalized kernel regression, we show that our method has a quasi-oracle property. Even when the pilot estimates for marginal effects and treatment propensities are not particularly accurate, we achieve the same error bounds as an oracle with prior knowledge of these two nuisance components. We implement variants of our approach based on penalized regression, kernel ridge regression, and boosting in a variety of simulation set-ups, and observe promising performance relative to existing baselines.
Publisher
Oxford University Press (OUP)
Subject
Applied Mathematics,Statistics, Probability and Uncertainty,General Agricultural and Biological Sciences,Agricultural and Biological Sciences (miscellaneous),General Mathematics,Statistics and Probability
Reference69 articles.
1. TensorFlow: A system for large-scale machine learning;Abadi,;Proc. 12th USENIX Sympos. Operating Systems Design and Implementation (OSDI’16),2016
2. Comparing experimental and matching methods using a large-scale voter mobilization experiment;Arceneaux,;Polit. Anal.,2006
3. Beyond prediction: Using big data for policy problems;Athey,;Science,2017
4. Recursive partitioning for heterogeneous causal effects;Athey,;Proc. Nat. Acad. Sci.,2016
5. Generalized random forests;Athey,;Ann. Statist.,2019
Cited by
195 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献