Is There Evidence of P-Hacking in Imaging Research?

Author:

Rooprai Paul1,Islam Nayaar2ORCID,Salameh Jean-Paul3,Ebrahimzadeh Sanam3,Kazi Abrar4,Frank Robert5,Ramsay Tim6,Mathur Maya B.7,Absi Marissa3,Khalil Ahmed3,Kazi Sakib3,Dawit Haben5,Lam Eric5,Fabiano Nicholas1,McInnes Matthew D. F.1

Affiliation:

1. Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada

2. School of Epidemiology and Public Health, University of Ottawa, Ottawa, ON, Canada

3. Department of Radiology, Faculty of Medicine, University of Ottawa, Ottawa, ON, Canada

4. Carleton University, Ottawa, ON, Canada

5. Department of Radiology, Faculty of Medicine, Ottawa Hospital, Ottawa, ON, Canada

6. Clinical Epidemiology Program, Ottawa Hospital Research Institute, Ottawa, ON, Canada

7. Quantitative Sciences Unit and Department of Pediatrics, Stanford University, Ottawa, ON, Canada

Abstract

Background P-hacking, the tendency to run selective analyses until they become significant, is prevalent in many scientific disciplines. Purpose This study aims to assess if p-hacking exists in imaging research. Methods Protocol, data, and code available here https://osf.io/xz9ku/?view_only=a9f7c2d841684cb7a3616f567db273fa . We searched imaging journals Ovid MEDLINE from 1972 to 2021. Text mining using Python script was used to collect metadata: journal, publication year, title, abstract, and P-values from abstracts. One P-value was randomly sampled per abstract. We assessed for evidence of p-hacking using a p-curve, by evaluating for a concentration of P-values just below .05. We conducted a one-tailed binomial test (α = .05 level of significance) to assess whether there were more P-values falling in the upper range (e.g., .045 < P < .05) than in the lower range (e.g., .04 < P < .045). To assess variation in results introduced by our random sampling of a single P-value per abstract, we repeated the random sampling process 1000 times and pooled results across the samples. Analysis was done (divided into 10-year periods) to determine if p-hacking practices evolved over time. Results Our search of 136 journals identified 967,981 abstracts. Text mining identified 293,687 P-values, and a total of 4105 randomly sampled P-values were included in the p-hacking analysis. The number of journals and abstracts that were included in the analysis as a fraction and percentage of the total number was, respectively, 108/136 (80%) and 4105/967,981 (.4%). P-values did not concentrate just under .05; in fact, there were more P-values falling in the lower range (e.g., .04 < P < .045) than falling just below .05 (e.g., .045 < P < .05), indicating lack of evidence for p-hacking. Time trend analysis did not identify p-hacking in any of the five 10-year periods. Conclusion We did not identify evidence of p-hacking in abstracts published in over 100 imaging journals since 1972. These analyses cannot detect all forms of p-hacking, and other forms of bias may exist in imaging research such as publication bias and selective outcome reporting.

Funder

University of Ottawa Department of Radiology Research Stipend Program

Publisher

SAGE Publications

Subject

Radiology, Nuclear Medicine and imaging,General Medicine

Cited by 3 articles. 订阅此论文施引文献 订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献

1. Canadian radiology: 2024 update;Diagnostic and Interventional Imaging;2024-06

2. Best Practice for MRI Diagnostic Accuracy Research With Lessons and Examples from the LI‐RADS Individual Participant Data Group;Journal of Magnetic Resonance Imaging;2023-10-11

3. Editor’s Corner: August 2023;Canadian Association of Radiologists Journal;2023-02-28

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3