Author:
Hunt Martin,Clark Steven,Mejia Daniel,Desai Saaketh,Strachan Alejandro
Abstract
Just like the scientific data they generate, simulation workflows for research should be findable, accessible, interoperable, and reusable (FAIR). However, while significant progress has been made towards FAIR data, the majority of science and engineering workflows used in research remain poorly documented and often unavailable, involving ad hoc scripts and manual steps, hindering reproducibility and stifling progress. We introduce Sim2Ls (pronounced simtools) and the Sim2L Python library that allow developers to create and share end-to-end computational workflows with well-defined and verified inputs and outputs. The Sim2L library makes Sim2Ls, their requirements, and their services discoverable, verifies inputs and outputs, and automatically stores results in a globally-accessible simulation cache and results database. This simulation ecosystem is available in nanoHUB, an open platform that also provides publication services for Sim2Ls, a computational environment for developers and users, and the hardware to execute runs and store results at no cost. We exemplify the use of Sim2Ls using two applications and discuss best practices towards FAIR simulation workflows and associated data.
Funder
National Science Foundation
National Nuclear Security Administration
Publisher
Public Library of Science (PLoS)
Reference38 articles.
1. Reproducibility crisis;Monya Baker;Nature,2016
2. What does research reproducibility mean?;Steven N Goodman;Science translational medicine,2016
3. Machine learning for molecular and materials science;Keith T Butler;Nature,2018
4. Data-driven materials science: status, challenges, and perspectives;Lauri Himanen;Advanced Science,2019
5. The fair guiding principles for scientific data management and stewardship;Mark D Wilkinson;Scientific data,2016
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献