Author:
Giustizia Joseph,Hodgson Wilson,Andress Cameron,Bilkhu Satpal,Macklin James A,Kess Tony
Abstract
AbstractAs sequencing technologies have matured, bioinformatics tasks have become more complex, computationally demanding, and data intensive. Workflow management software has been developed to aid in simplifying the replicable chaining of complex bioinformatics jobs, and cloud computing has emerged as a potential solution to the computational demands of this work. However, the capacity to effectively deploy these resources is limited by the expertise required to implement these solutions. Here, we develop Maloja, an easily deployed cloud workflow orchestrator. This tool interprets existing scientific workflows written in Snakemake and deploys them in appropriately scaled AWS cloud resources. We test the utility of this new toolset using previously published and custom built Snakemake workflows for ecological genomics tasks, revealing how this tool can facilitate the use of cloud resources without prior cloud architecture expertise.
Publisher
Cold Spring Harbor Laboratory