Analytical code sharing practices in biomedical research

Author:

Sharma Nitesh Kumar1,Ayyala Ram2,Deshpande Dhrithi1,Patel Yesha1,Munteanu Viorel3ORCID,Ciorba Dumitru3ORCID,Bostan Viorel3,Fiscutean Andrada4,Vahed Mohammad1,Sarkar Aditya5,Guo Ruiwei6,Moore Andrew7,Darci-Maher Nicholas8,Nogoy Nicole9,Abedalthagafi Malak1011,Mangul Serghei12

Affiliation:

1. Titus Family Department of Clinical Pharmacy, University of Southern California, Los Angeles, California, United States

2. Quantitative and Computational Biology Department, University of Southern California, Los Angeles, California, United States

3. Department of Computers, Informatics and Microelectronics, Technical University of Moldova, Chisinau, Moldova

4. Faculty of Journalism and Communication Studies, University of Bucharest, Bucharest, Romania

5. School of Computing and Electrical Engineering, Indian Institute of Technology Mandi, Kamand, Himachal Pradesh, India

6. Department of Pharmacology and Pharmaceutical Sciences, University of Southern California, Los Angeles, California, United States

7. Daniel J. Epstein Department of Industrial and Systems Engineering, University of Southern California, Los Angeles, California, United States

8. Computational and Systems Biology, University of California, Los Angeles, Los Angeles, California, United States

9. GigaScience Press, Shek Mun, Hong Kong

10. Department of Pathology & Laboratory Medicine, Emory University Hospital, Atlanta, Georgia, United States

11. King Salman Center for Disability Research, Riyadh, Saudi Arabia

Abstract

Data-driven computational analysis is becoming increasingly important in biomedical research, as the amount of data being generated continues to grow. However, the lack of practices of sharing research outputs, such as data, source code and methods, affects transparency and reproducibility of studies, which are critical to the advancement of science. Many published studies are not reproducible due to insufficient documentation, code, and data being shared. We conducted a comprehensive analysis of 453 manuscripts published between 2016–2021 and found that 50.1% of them fail to share the analytical code. Even among those that did disclose their code, a vast majority failed to offer additional research outputs, such as data. Furthermore, only one in ten articles organized their code in a structured and reproducible manner. We discovered a significant association between the presence of code availability statements and increased code availability. Additionally, a greater proportion of studies conducting secondary analyses were inclined to share their code compared to those conducting primary analyses. In light of our findings, we propose raising awareness of code sharing practices and taking immediate steps to enhance code availability to improve reproducibility in biomedical research. By increasing transparency and reproducibility, we can promote scientific rigor, encourage collaboration, and accelerate scientific discoveries. We must prioritize open science practices, including sharing code, data, and other research products, to ensure that biomedical research can be replicated and built upon by others in the scientific community.

Funder

National Science Foundation

National Institutes of Health

Publisher

PeerJ

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3