A simulation study comparing the power of nine tests of the treatment effect in randomized controlled trials with a time-to-event outcome-Reference-Cited by-同舟云学术

A simulation study comparing the power of nine tests of the treatment effect in randomized controlled trials with a time-to-event outcome

Published:2020-04-06 Issue:1 Volume:21 Page:
ISSN:1745-6215
Container-title:Trials
language:en
Short-container-title:Trials

Author:

Royston Patrick^ORCID,B. Parmar Mahesh K.

Abstract

Abstract Background The logrank test is routinely applied to design and analyse randomized controlled trials (RCTs) with time-to-event outcomes. Sample size and power calculations assume the treatment effect follows proportional hazards (PH). If the PH assumption is false, power is reduced and interpretation of the hazard ratio (HR) as the estimated treatment effect is compromised. Using statistical simulation, we investigated the type 1 error and power of the logrank (LR)test and eight alternatives. We aimed to identify test(s) that improve power with three types of non-proportional hazards (non-PH): early, late or near-PH treatment effects. Methods We investigated weighted logrank tests (early, LRE; late, LRL), the supremum logrank test (SupLR) and composite tests (joint, J; combined, C; weighted combined, WC; versatile and modified versatile weighted logrank, VWLR, VWLR2) with two or more components. Weighted logrank tests are intended to be sensitive to particular non-PH patterns. Composite tests attempt to improve power across a wider range of non-PH patterns. Using extensive simulations based on real trials, we studied test size and power under PH and under simple departures from PH comprising pointwise constant HRs with a single change point at various follow-up times. We systematically investigated the influence of high or low control-arm event rates on power. Results With no preconceived type of treatment effect, the preferred test is VWLR2. Expecting an early effect, tests with acceptable power are SupLR, C, VWLR2, J, LRE and WC. Expecting a late effect, acceptable tests are LRL, VWLR, VWLR2, WC and J. Under near-PH, acceptable tests are LR, LRE, VWLR, C, VWLR2 and SupLR. Type 1 error was well controlled for all tests, showing only minor deviations from the nominal 5%. The location of the HR change point relative to the cumulative proportion of control-arm events considerably affected power. Conclusions Assuming ignorance of the likely treatment effect, the best choice is VWLR2. Several non-standard tests performed well when the correct type of treatment effect was assumed. A low control-arm event rate reduced the power of weighted logrank tests targeting early effects. Test size was generally well controlled. Further investigation of test characteristics with different types of non-proportional hazards of the treatment effect is warranted.

Publisher

Springer Science and Business Media LLC

Subject

Pharmacology (medical),Medicine (miscellaneous)

Link

http://link.springer.com/content/pdf/10.1186/s13063-020-4153-2.pdf

Reference26 articles.

1. Trinquart L, Jacot J, Conner SC, Porcher R. Comparison of treatment effects measured by the hazard ratio and by the ratio of restricted mean survival times in oncology randomized controlled trials. J Clin Oncol. 2016; 34:1813–9. https://doi.org/10.1200/JCO.2015.64.2488.

2. Royston P, Choodari-Oskooei B, Parmar MKB, Rogers JK. Combined test versus logrank/Cox test in 50 randomised trials. Trials. 2019; 20:172. https://doi.org/10.1186/s13063-019-3251-5.

3. StataCorp. Stata Statistical Software: Release 15. College Station, TX: StataCorp LLC; 2017.

4. Fleming TR, Harrington DP. Counting processes and survival analysis. New York: Wiley; 1991.

5. Peto R, Peto J. Asymptotically efficient rank invariant test procedures. J Royal Stat Soc, Ser A. 1972; 135:185–207.

Cited by 24 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Sample Size Reestimation in Stochastic Curtailment Tests With Time‐to‐Events Outcome in the Case of Nonproportional Hazards Utilizing Two Weibull Distributions With Unknown Shape Parameters;Pharmaceutical Statistics;2024-08-18

2. Group sequential methods based on supremum logrank statistics under proportional and nonproportional hazards;Statistical Methods in Medical Research;2024-06-05

3. Visualizing hypothesis tests in survival analysis under anticipated delayed effects;Pharmaceutical Statistics;2024-05-06

4. Methods for non-proportional hazards in clinical trials: A systematic review;Statistical Methods in Medical Research;2024-04-09

5. Simultaneous inference procedures for the comparison of multiple characteristics of two survival functions;Statistical Methods in Medical Research;2024-03-11