An Approximation Approach for Response-Adaptive Clinical Trial Design-Reference-Cited by-同舟云学术

An Approximation Approach for Response-Adaptive Clinical Trial Design

Published:2020-05-28 Issue: Volume: Page:
ISSN:1091-9856
Container-title:INFORMS Journal on Computing
language:en
Short-container-title:INFORMS Journal on Computing

Author:

Ahuja Vishal¹^ORCID,Birge John R.²

Affiliation:

1. Cox School of Business, Southern Methodist University, Dallas, Texas 75275;

2. Booth School of Business, University of Chicago, Chicago, Illinois 60637

Abstract

Multiarmed bandit (MAB) problems, typically modeled as Markov decision processes (MDPs), exemplify the learning versus earning trade-off. An area that has motivated theoretical research in MAB designs is the study of clinical trials, where the application of such designs has the potential to significantly improve patient outcomes. However, for many practical problems of interest, the state space is intractably large, rendering exact approaches to solving MDPs impractical. In particular, settings that require multiple simultaneous allocations lead to an expanded state and action-outcome space, necessitating the use of approximation approaches. We propose a novel approximation approach that combines the strengths of multiple methods: grid-based state discretization, value function approximation methods, and techniques for a computationally efficient implementation. The hallmark of our approach is the accurate approximation of the value function that combines linear interpolation with bounds on interpolated value and the addition of a learning component to the objective function. Computational analysis on relevant datasets shows that our approach outperforms existing heuristics (e.g., greedy and upper confidence bound family of algorithms) and a popular Lagrangian-based approximation method, where we find that the average regret improves by up to 58.3%. A retrospective implementation on a recently conducted phase 3 clinical trial shows that our design could have reduced the number of failures by 17% relative to the randomized control design used in that trial. Our proposed approach makes it practically feasible for trial administrators and regulators to implement Bayesian response-adaptive designs on large clinical trials with potential significant gains.

Publisher

Institute for Operations Research and the Management Sciences (INFORMS)

Subject

General Engineering

Reference56 articles.

1. Relaxations of Weakly Coupled Stochastic Dynamic Programs

2. Agrawal S, Goyal N (2012) Analysis of Thompson sampling for the multi-armed bandit problem. Mannor S, Srebro N, Williamson RC, eds. Proc. 25th Annual Conf. Learn. Theory, vol. 23 (PMLR), 39.1–39.26.

3. Response-adaptive designs for clinical trials: Simultaneous learning from multiple patients

4. A Partially Observed Markov Decision Process for Dynamic Pricing

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Information-Directed Policy Sampling for Episodic Bayesian Markov Decision Processes;IISE Transactions;2024-08-19

2. A simulation-based approximate dynamic programming approach to dynamic and stochastic resource-constrained multi-project scheduling problem;European Journal of Operational Research;2023-11

3. Data-driven adaptive testing resource allocation strategies for real-time monitoring of infectious diseases;IISE Transactions;2023-10-04

4. Generalisations of a Bayesian decision-theoretic randomisation procedure and the impact of delayed responses;Computational Statistics & Data Analysis;2021-12

5. An Analytics‐Driven Approach for Optimal Individualized Diabetes Screening;Production and Operations Management;2021-06-11