Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials-Reference-Cited by-同舟云学术

Reverse Bayesian Implications of p-Values Reported in Critical Care Randomized Trials

Published:2021-11-29 Issue: Volume: Page:088506662110537
ISSN:0885-0666
Container-title:Journal of Intensive Care Medicine
language:en
Short-container-title:J Intensive Care Med

Author:

Nostedt Sarah¹²,Joffe Ari R.¹²^ORCID

Affiliation:

1. University of Alberta, Edmonton, Alberta, Canada

2. Stollery Children's Hospital, Edmonton, Alberta, Canada

Abstract

Background Misinterpretations of the p-value in null-hypothesis statistical testing are common. We aimed to determine the implications of observed p-values in critical care randomized controlled trials (RCTs). Methods We included three cohorts of published RCTs: Adult-RCTs reporting a mortality outcome, Pediatric-RCTs reporting a mortality outcome, and recent Consecutive-RCTs reporting p-value ≤.10 in six higher-impact journals. We recorded descriptive information from RCTs. Reverse Bayesian implications of obtained p-values were calculated, reported as percentages with inter-quartile ranges. Results Obtained p-value was ≤.005 in 11/216 (5.1%) Adult-RCTs, 2/120 (1.7%) Pediatric-RCTs, and 37/90 (41.1%) Consecutive-RCTs. An obtained p-value .05–.0051 had high False Positive Rates; in Adult-RCTs, minimum (assuming prior probability of the alternative hypothesis was 50%) and realistic (assuming prior probability of the alternative hypothesis was 10%) False Positive Rates were 16.7% [11.2, 21.8] and 64.3% [53.2, 71.4]. An obtained p-value ≤.005 had lower False Positive Rates; in Adult-RCTs the realistic False Positive Rate was 7.7% [7.7, 16.0]. The realistic probability of the alternative hypothesis for obtained p-value .05–.0051 (ie, Positive Predictive Value) was 28.0% [24.1, 34.8], 30.6% [27.7, 48.5], 29.3% [24.3, 41.0], and 32.7% [24.1, 43.5] for Adult-RCTs, Pediatric-RCTs, Consecutive-RCTs primary and secondary outcome, respectively. The maximum Positive Predictive Value for p-value category .05–.0051 was median 77.8%, 79.8%, 78.8%, and 81.4% respectively. To have maximum or realistic Positive Predictive Value >90% or >80%, RCTs needed to have obtained p-value ≤.005. The credibility of p-value .05–.0051 findings were easy to challenge, and the credibility to rule-out an effect with p-value >.05 to .10 was low. The probability that a replication study would obtain p-value ≤.05 did not approach 90% unless the obtained p-value was ≤.005. Conclusions Unless the obtained p-value was ≤.005, the False Positive Rate was high, and the Positive Predictive Value and probability of replication of “statistically significant” findings were low.

Funder

University of Alberta, Department of Pediatrics

Publisher

SAGE Publications

Subject

Critical Care and Intensive Care Medicine

Link

http://journals.sagepub.com/doi/pdf/10.1177/08850666211053793

Reference45 articles.

1. Why Most Published Research Findings Are False

2. Toward Evidence-Based Medical Statistics. 1: The P Value Fallacy

3. Statistical tests, P values, confidence intervals, and power: a guide to misinterpretations

4. Moving to a World Beyond “p < 0.05”

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Heterogeneity, Bayesian thinking, and phenotyping in critical care: A primer;American Journal of Health-System Pharmacy;2024-05-14

2. A complicated Glenn procedure: risk factors and association with adverse long-term neurodevelopmental and functional outcomes;Cardiology in the Young;2022-08-24

3. Critical Care Randomized Trials Demonstrate Power Failure: A Low Positive Predictive Value of Findings in the Critical Care Research Field;Journal of Intensive Care Medicine;2022-02-18