Abstract
AbstractMachine learning (ML) models are widely used in life sciences and medicine; however, they are scattered across various platforms and there are several challenges that hinder their accessibility, reproducibility and reuse. In this manuscript, we present the formalisation and pilot implementation of community protocol to enable FAIReR (Findable, Accessible, Interoperable, Reusable, and Reproducible) sharing of ML models. The protocol consists of eight steps, including sharing model training code, dataset information, reproduced figures, model evaluation metrics, trained models, Dockerfiles, model metadata, and FAIR dissemination. Applying these measures we aim to build and share a comprehensive public collection of FAIR ML models in the BioModels repository through incentivized community curation. In a pilot implementation, we curated diverse ML models to demonstrate the feasibility of our approach and we discussed the current challenges. Building a FAIReR collection of ML models will directly enhance the reproducibility and reusability of ML models, minimising the effort needed to reimplement models, maximising the impact on the application and significantly accelerating the advancement in the field of life science and medicine.
Publisher
Cold Spring Harbor Laboratory