Statistical significance and its critics: practicing damaging science, or damaging scientific practice?-Reference-Cited by-同舟云学术

Statistical significance and its critics: practicing damaging science, or damaging scientific practice?

Published:2022-05-12 Issue:3 Volume:200 Page:
ISSN:1573-0964
Container-title:Synthese
language:en
Short-container-title:Synthese

Author:

Mayo Deborah G.^ORCID,Hand David

Abstract

AbstractWhile the common procedure of statistical significance testing and its accompanying concept of p-values have long been surrounded by controversy, renewed concern has been triggered by the replication crisis in science. Many blame statistical significance tests themselves, and some regard them as sufficiently damaging to scientific practice as to warrant being abandoned. We take a contrary position, arguing that the central criticisms arise from misunderstanding and misusing the statistical tools, and that in fact the purported remedies themselves risk damaging science. We argue that banning the use of p-value thresholds in interpreting data does not diminish but rather exacerbates data-dredging and biasing selection effects. If an account cannot specify outcomes that will not be allowed to count as evidence for a claim—if all thresholds are abandoned—then there is no test of that claim. The contributions of this paper are: To explain the rival statistical philosophies underlying the ongoing controversy; To elucidate and reinterpret statistical significance tests, and explain how this reinterpretation ameliorates common misuses and misinterpretations; To argue why recent recommendations to replace, abandon, or retire statistical significance undermine a central function of statistics in science: to test whether observed patterns in the data are genuine or due to background variability.

Publisher

Springer Science and Business Media LLC

Subject

General Social Sciences,Philosophy

Link

https://link.springer.com/content/pdf/10.1007/s11229-022-03692-0.pdf

Reference102 articles.

1. Altman, D., & Bland, J. (1995). Absence of evidence is not evidence of absence. BMJ, 311(7003), 485. https://doi.org/10.1136/bmj.311.7003.485

2. Amrhein, V., Greenland, S., & McShane, B. (2019). Comment: Scientists rise up against statistical significance. Nature, 567, 305–307. https://doi.org/10.1038/d41586-019-00857-9

3. Barnard, G. (1972). The logic of statistical inference (Review of “The Logic of Statistical Inference” by Ian Hacking). British Journal for the Philosophy of Science, 23(2), 123–132. https://doi.org/10.1093/bjps/23.2.123

4. Bayarri, M., & Berger, J. (2004). the interplay of Bayesian and frequentist analysis. Statistical Science, 19(1), 58–80. https://doi.org/10.1214/088342304000000116

5. Benjamin, D., Berger, J., Johannesson, M., et al. (2018). Redefine statistical significance. Nature Human Behaviour, 2, 6–10. https://doi.org/10.1038/s41562-017-0189-z

Cited by 13 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A guide to interpreting systematic reviews and meta-analyses in neurosurgery and surgery;Acta Neurochirurgica;2024-06-04

2. How and why alpha should depend on sample size: A Bayesian-frequentist compromise for significance testing;Strategic Organization;2024-01-06

3. Use and misuse of corrections for multiple testing;Methods in Psychology;2023-11

4. Integrating Artificial Intelligence and Machine Learning Into Cancer Clinical Trials;Seminars in Radiation Oncology;2023-10

5. Forecasting Future Crime Rates;Journal of Contemporary Criminal Justice;2023-08-06