Affiliation:
1. Institute of Technology, University of Washington, Tacoma, WA, USA
2. Department of Clinical Investigation, Madigan Army Medical Center, Tacoma, WA, USA
Abstract
Abstract
Objective
Bioinformatics publications typically include complex software workflows that are difficult to describe in a manuscript. We describe and demonstrate the use of interactive software notebooks to document and distribute bioinformatics research. We provide a user-friendly tool, BiocImageBuilder, that allows users to easily distribute their bioinformatics protocols through interactive notebooks uploaded to either a GitHub repository or a private server.
Materials and methods
We present four different interactive Jupyter notebooks using R and Bioconductor workflows to infer differential gene expression, analyze cross-platform datasets, process RNA-seq data and KinomeScan data. These interactive notebooks are available on GitHub. The analytical results can be viewed in a browser. Most importantly, the software contents can be executed and modified. This is accomplished using Binder, which runs the notebook inside software containers, thus avoiding the need to install any software and ensuring reproducibility. All the notebooks were produced using custom files generated by BiocImageBuilder.
Results
BiocImageBuilder facilitates the publication of workflows with a point-and-click user interface. We demonstrate that interactive notebooks can be used to disseminate a wide range of bioinformatics analyses. The use of software containers to mirror the original software environment ensures reproducibility of results. Parameters and code can be dynamically modified, allowing for robust verification of published results and encouraging rapid adoption of new methods.
Conclusion
Given the increasing complexity of bioinformatics workflows, we anticipate that these interactive software notebooks will become as necessary for documenting software methods as traditional laboratory notebooks have been for documenting bench protocols, and as ubiquitous.
Funder
National Institutes of Health
Publisher
Oxford University Press (OUP)
Reference62 articles.
1. The economics of reproducibility in preclinical research;Freedman;PLoS Biol.,2015
2. Software solutions for reproducible RNA-seq workflows;Meiss;bioRxiv.,2017
3. Bioconductor: open software development for computational biology and bioinformatics;Gentleman;Genome Biol.,2004
4. Rapid and efficient analysis of 20,000 RNA-seq samples with Toil;Vivian;bioRxiv.,2016
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献