<i>Microbench:</i> automated metadata management for systems biology benchmarking and reproducibility in Python-Reference-Cited by-同舟云学术

Microbench: automated metadata management for systems biology benchmarking and reproducibility in Python

Published:2022-08-24 Issue:20 Volume:38 Page:4823-4825
ISSN:1367-4803
Container-title:Bioinformatics
language:en
Short-container-title:

Author:

Lubbock Alexander L R¹²^ORCID,Lopez Carlos F¹²³^ORCID

Affiliation:

1. Department of Biochemistry, Vanderbilt University , Nashville, TN 37232, USA

2. Vanderbilt-Ingram Cancer Center, Vanderbilt University , Nashville, TN 37232, USA

3. Department of Biomedical Informatics, Vanderbilt University Medical Center , Nashville, TN 37203, USA

Abstract

Abstract Motivation Computational systems biology analyses typically make use of multiple software and their dependencies, which are often run across heterogeneous compute environments. This can introduce differences in performance and reproducibility. Capturing metadata (e.g. package versions, GPU model) currently requires repetitious code and is difficult to store centrally for analysis. Even where virtual environments and containers are used, updates over time mean that versioning metadata should still be captured within analysis pipelines to guarantee reproducibility. Results Microbench is a simple and extensible Python package to automate metadata capture to a file or Redis database. Captured metadata can include execution time, software package versions, environment variables, hardware information, Python version and more, with plugins. We present three case studies demonstrating Microbench usage to benchmark code execution and examine environment metadata for reproducibility purposes. Availability and implementation Install from the Python Package Index using pip install microbench. Source code is available from https://github.com/alubbock/microbench. Supplementary information Supplementary data are available at Bioinformatics online.

Funder

National Science Foundation

National Cancer Institute

Publisher

Oxford University Press (OUP)

Subject

Computational Mathematics,Computational Theory and Mathematics,Computer Science Applications,Molecular Biology,Biochemistry,Statistics and Probability

Link

https://academic.oup.com/bioinformatics/advance-article-pdf/doi/10.1093/bioinformatics/btac580/45722165/btac580.pdf

Reference7 articles.

1. An introduction to Docker for reproducible research;Boettiger;SIGOPS Oper. Syst. Rev,2015

2. Tellurium: a Python based modeling and reproducibility platform for systems biology;Choi,2016

3. The role of metadata in reproducible computational research;Leipzig;Patterns,2021

4. Programming biological models in Python using PySB;Lopez;Mol. Syst. Biol,2013

5. Continuous integration and its tools;Meyer;IEEE Softw,2014

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Modeling of Financial Risk Control Imbalance Dataset Based on Benchmarking Management Optimization Algorithm;Lecture Notes in Electrical Engineering;2024