Model-Free Approximate Bayesian Learning for Large-Scale Conversion Funnel Optimization-Reference-Cited by-同舟云学术

Model-Free Approximate Bayesian Learning for Large-Scale Conversion Funnel Optimization

Published:2024-03 Issue:3 Volume:33 Page:775-794
ISSN:1059-1478
Container-title:Production and Operations Management
language:en
Short-container-title:Production and Operations Management

Author:

Iyengar Garud¹,Singal Raghav²^ORCID

Affiliation:

1. IEOR, Columbia University, New York, NY, USA

2. Tuck School of Business, Dartmouth College, Hanover, NH, USA

Abstract

The flexibility of choosing the ad action as a function of the consumer state is critical for modern-day marketing campaigns. We study the problem of identifying the optimal sequential personalized interventions that maximize the adoption probability for a new product. We model consumer behavior by a conversion funnel that captures the state of each consumer (e.g., interaction history with the firm) and allows the consumer behavior to vary as a function of both her state and firm’s sequential interventions. We show our model captures consumer behavior with very high accuracy (out-of-sample area under the curve of over 0.95) in a real-world email marketing dataset. However, it results in a very large-scale learning problem, where the firm must learn the state-specific effects of various interventions from consumer interactions. We propose a novel attribution-based decision-making algorithm for this problem that we call model-free approximate Bayesian learning. Our algorithm inherits the interpretability and scalability of Thompson sampling for bandits and maintains an approximate belief over the value of each state-specific intervention. The belief is updated as the algorithm interacts with the consumers. Despite being an approximation to the Bayes update, we prove the asymptotic optimality of our algorithm and analyze its convergence rate. We show that our algorithm significantly outperforms traditional approaches on extensive simulations calibrated to a real-world email marketing dataset.

Publisher

SAGE Publications

Link

https://journals.sagepub.com/doi/pdf/10.1177/10591478241231857

Reference41 articles.

1. Abe N, Pednault E, Wang H, et al. (2002) Empirical comparison of various reinforcement learning strategies for sequential targeted marketing. In: IEEE international conference on data mining (eds V Kumar, S Tsurnoto, N Zhong, et al.), Maebashi City, Japan, 9–12 December, pp.3–10. New York, NY: IEEE.

2. Agrawal S, Goyal N (2012) Analysis of Thompson sampling for the multi-armed bandit problem. In: Conference on learning theory, Vol. 23, (eds S Mannor, N Srebro and RC Williamson), Edinburgh, Scotland, 25–27 June, pp.39.1–39.26. PMLR.

3. Agrawal S, Goyal N (2013) Thompson sampling for contextual bandits with linear payoffs. In: International conference on machine learning (eds S Dasgupta and D McAllester), Atlanta, USA, 16–21 June, pp.127–135. PMLR.

4. Learning the Minimal Representation of a Dynamic System from Transition Data

5. Beyond the Last Touch: Attribution in Online Advertising