Affiliation:
1. Department of Biostatistics, University of North Carolina at Chapel Hill, Chapel Hill , NC 27599 , USA
2. Department of Biostatistics, School of Public Health, University of Michigan , Ann Arbor, MI 48109 , USA
Abstract
Abstract
Sequential multiple assignment randomized trials (SMARTs) are the gold standard for estimating optimal dynamic treatment regimes (DTRs), but are costly and require a large sample size. We introduce the multi-stage augmented Q-learning estimator (MAQE) to improve efficiency of estimation of optimal DTRs by augmenting SMART data with observational data. Our motivating example comes from the Back Pain Consortium, where one of the overarching aims is to learn how to tailor treatments for chronic low back pain to individual patient phenotypes, knowledge which is lacking clinically. The Consortium-wide collaborative SMART and observational studies within the Consortium collect data on the same participant phenotypes, treatments, and outcomes at multiple time points, which can easily be integrated. Previously published single-stage augmentation methods for integration of trial and observational study (OS) data were adapted to estimate optimal DTRs from SMARTs using Q-learning. Simulation studies show the MAQE, which integrates phenotype, treatment, and outcome information from multiple studies over multiple time points, more accurately estimates the optimal DTR, and has a higher average value than a comparable Q-learning estimator without augmentation. We demonstrate this improvement is robust to a wide range of trial and OS sample sizes, addition of noise variables, and effect sizes.
Funder
National Institutes of Health
Publisher
Oxford University Press (OUP)
Reference23 articles.
1. Introduction to SMART designs for the development of adaptive interventions: with application to weight loss research;Almirall;Translational Behavioral Medicine,2014
2. The Back Pain Consortium (BACPAC) research program data harmonization: Rationale for data elements and standards;Batorsky;Pain Medicine,2023
3. A markovian decision process;Bellman;Journal of Mathematics and Mechanics,1957
4. Dynamic treatment regimes;Chakraborty;Annual Review of Statistics and Its Application,2014
5. Statistical learning methods for optimizing dynamic treatment regimes in subgroup identification;Chen,2020