Facilitating bioinformatics reproducibility with QIIME 2 Provenance Replay
-
Published:2023-11-27
Issue:11
Volume:19
Page:e1011676
-
ISSN:1553-7358
-
Container-title:PLOS Computational Biology
-
language:en
-
Short-container-title:PLoS Comput Biol
Author:
Keefe Christopher R.ORCID,
Dillon Matthew R.,
Gehret Elizabeth,
Herman Chloe,
Jewell Mary,
Wood Colin V.,
Bolyen Evan,
Caporaso J. GregoryORCID
Abstract
Study reproducibility is essential to corroborate, build on, and learn from the results of scientific research but is notoriously challenging in bioinformatics, which often involves large data sets and complex analytic workflows involving many different tools. Additionally, many biologists are not trained in how to effectively record their bioinformatics analysis steps to ensure reproducibility, so critical information is often missing. Software tools used in bioinformatics can automate provenance tracking of the results they generate, removing most barriers to bioinformatics reproducibility. Here we present an implementation of that idea, Provenance Replay, a tool for generating new executable code from results generated with the QIIME 2 bioinformatics platform, and discuss considerations for bioinformatics developers who wish to implement similar functionality in their software.
Funder
National Cancer Institute
Publisher
Public Library of Science (PLoS)
Subject
Computational Theory and Mathematics,Cellular and Molecular Neuroscience,Genetics,Molecular Biology,Ecology,Modeling and Simulation,Ecology, Evolution, Behavior and Systematics
Reference31 articles.
1. Social, behavioral, and economic sciences perspectives on robust and reliable science;JT Cacioppo;Report of the Subcommittee on Replicability in Science Advisory Committee to the National Science Foundation Directorate for Social, Behavioral, and Economic Sciences.,2015
2. Peer review: still king in the digital age.;D Nicholas;Learn Publ,2015
3. Estimating the reproducibility of psychological science.;Open Science Collaboration;Science,2015