Abstract
SummaryMaking reproducible, auditable and scalable data-processing analysis workflows is an important challenge in the field of bioinformatics. Recently, software containers and cloud computing introduced a novel solution to address these challenges. They simplify software installation, management and reproducibility by packaging tools and their dependencies. In this work we implemented a cloud provider agnostic and scalable container orchestration setup for the popular Galaxy workflow environment. This solution enables Galaxy to run on and offload jobs to most cloud providers (e.g. Amazon Web Services, Google Cloud or OpenStack, among others) through the Kubernetes container orchestrator.AvailabilityAll code has been contributed to the Galaxy Project and is available (since Galaxy 17.05) at https://github.com/galaxyproject/ in the galaxy and galaxy-kubernetes repositories. https://public.phenomenal-h2020.eu/ is an example deployment.Suppl. InformationSupplementary Files are available online.Contactpmoreno@ebi.ac.uk, European Molecular Biology Laboratory, EMBL-EBI, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK, Tel: +44-1223-494267, Fax: +44-1223-484696.
Publisher
Cold Spring Harbor Laboratory
Reference10 articles.
1. CloudMan as a platform for tool, data, and analysis distribution
2. The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2018 update
3. Agapito, G. et al. (2017) Parallel and Cloud-Based Analysis of Omics Data: Modelling and Simulation in Medicine. In, 2017 25th Euromicro International Conference on Parallel, Distributed and Network-based Processing (PDP)., pp. 519–526.
4. Grüning, B. , Chilton, J. , et al. (2018) bgruening/docker-galaxy-stable: Galaxy Docker Image 18.05.
5. Bioconda: sustainable and comprehensive software distribution for the life sciences;Nat. Methods,2018
Cited by
11 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献