Analysis of workflow schedulers in simulated distributed environments-Reference-Cited by-同舟云学术

Analysis of workflow schedulers in simulated distributed environments

Published:2022-04-14 Issue:13 Volume:78 Page:15154-15180
ISSN:0920-8542
Container-title:The Journal of Supercomputing
language:en
Short-container-title:J Supercomput

Author:

Beránek Jakub^ORCID,Böhm Stanislav,Cima Vojtěch

Abstract

AbstractTask graphs provide a simple way to describe scientific workflows (sets of tasks with dependencies) that can be executed on both HPC clusters and in the cloud. An important aspect of executing such graphs is the used scheduling algorithm. Many scheduling heuristics have been proposed in existing works; nevertheless, they are often tested in oversimplified environments. We provide an extensible simulation environment designed for prototyping and benchmarking task schedulers, which contains implementations of various scheduling algorithms and is open-sourced, in order to be fully reproducible. We use this environment to perform a comprehensive analysis of workflow scheduling algorithms with a focus on quantifying the effect of scheduling challenges that have so far been mostly neglected, such as delays between scheduler invocations or partially unknown task durations. Our results indicate that network models used by many previous works might produce results that are off by an order of magnitude in comparison to a more realistic model. Additionally, we show that certain implementation details of scheduling algorithms which are often neglected can have a large effect on the scheduler’s performance, and they should thus be described in great detail to enable proper evaluation.

Publisher

Springer Science and Business Media LLC

Subject

Hardware and Architecture,Information Systems,Theoretical Computer Science,Software

Link

https://link.springer.com/content/pdf/10.1007/s11227-022-04438-y.pdf

Reference47 articles.

1. Adam TL, Chandy KM, Dickson JR (1974) A comparison of list schedules for parallel processing systems. Commun ACM 17(12):685–690. https://doi.org/10.1145/361604.361619

2. Adhikari M, Amgoth T, Srirama SN (2019) A survey on scheduling strategies for workflows in cloud environment and emerging trends. ACM Comput Surv 52(4):5097. https://doi.org/10.1145/3325097

3. Amstutz P, Crusoe MR, Tijanić N et al (2016) Common workflow language, v1.0. https://doi.org/10.6084/m9.figshare.3115156.v2

4. Babuji Y, Woodard A, Li Z, et al (2019) Parsl: pervasive parallel programming in python. In: Proceedings of the 28th International Symposium on High-Performance Parallel and Distributed Computing. Association for Computing Machinery, New York, NY, USA, HPDC’19, pp 25–36. https://doi.org/10.1145/3307681.3325400

5. Bauer M, Garland M (2019) Legate numpy: accelerated and distributed array computing. In: Proceedings of the International Conference for High Performance Computing, Networking, Storage and Analysis. Association for Computing Machinery, New York, NY, USA, SC’19. https://doi.org/10.1145/3295500.3356175

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. EasyDock: customizable and scalable docking tool;Journal of Cheminformatics;2023-11-01