Ensuring scientific reproducibility in bio-macromolecular modeling via extensive, automated benchmarks

Author:

Koehler Leman JuliaORCID,Lyskov SergeyORCID,Lewis Steven M.,Adolf-Bryfogle JaredORCID,Alford Rebecca F.,Barlow KyleORCID,Ben-Aharon ZivORCID,Farrell DanielORCID,Fell JasonORCID,Hansen William A.,Harmalkar AmeyaORCID,Jeliazkov JeliazkoORCID,Kuenze Georg,Krys Justyna D.ORCID,Ljubetič AjasjaORCID,Loshbaugh Amanda L.,Maguire Jack,Moretti Rocco,Mulligan Vikram Khipple,Nance Morgan L.ORCID,Nguyen Phuong T.,Ó Conchúir Shane,Roy Burman Shourya S.ORCID,Samanta Rituparna,Smith Shannon T.ORCID,Teets Frank,Tiemann Johanna K. S.ORCID,Watkins Andrew,Woods HopeORCID,Yachnin Brahm J.ORCID,Bahl Christopher D.,Bailey-Kellogg Chris,Baker DavidORCID,Das RhijuORCID,DiMaio Frank,Khare Sagar D.,Kortemme Tanja,Labonte Jason W.,Lindorff-Larsen KrestenORCID,Meiler JensORCID,Schief WilliamORCID,Schueler-Furman OraORCID,Siegel Justin B.,Stein AmelieORCID,Yarov-Yarovoy VladimirORCID,Kuhlman BrianORCID,Leaver-Fay AndrewORCID,Gront Dominik,Gray Jeffrey J.ORCID,Bonneau RichardORCID

Abstract

AbstractEach year vast international resources are wasted on irreproducible research. The scientific community has been slow to adopt standard software engineering practices, despite the increases in high-dimensional data, complexities of workflows, and computational environments. Here we show how scientific software applications can be created in a reproducible manner when simple design goals for reproducibility are met. We describe the implementation of a test server framework and 40 scientific benchmarks, covering numerous applications in Rosetta bio-macromolecular modeling. High performance computing cluster integration allows these benchmarks to run continuously and automatically. Detailed protocol captures are useful for developers and users of Rosetta and other macromolecular modeling tools. The framework and design concepts presented here are valuable for developers and users of any type of scientific software and for the scientific community to create reproducible methods. Specific examples highlight the utility of this framework, and the comprehensive documentation illustrates the ease of adding new tests in a matter of hours.

Funder

Simons Foundation

Publisher

Springer Science and Business Media LLC

Subject

General Physics and Astronomy,General Biochemistry, Genetics and Molecular Biology,General Chemistry

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3