Proactive Task Offloading for Load Balancing in Iterative Applications-Reference-Cited by-同舟云学术

Proactive Task Offloading for Load Balancing in Iterative Applications

Published:2023 Issue: Volume: Page:263-275
ISSN:0302-9743
Container-title:Parallel Processing and Applied Mathematics
language:
Short-container-title:

Author:

Chung Minh Thanh^ORCID,Weidendorfer Josef^ORCID,Fürlinger Karl^ORCID,Kranzlmüller Dieter^ORCID

Abstract

AbstractLoad imbalance is often a challenge for applications in parallel systems. Static cost models and pre-partitioning algorithms distribute the load at the beginning. Nevertheless, dynamic changes during execution or inaccurate cost indicators may lead to imbalance at runtime. Reactive work-stealing strategies can help monitor the execution and perform task migration to balance the load. However, the benefits depend on migration overhead and assumption about future execution.Our proactive approach further improves existing solutions by applying machine learning to online load prediction. Following that, we propose a fully distributed algorithm for adapting the prediction result to guide task offloading. The experiments are performed with an artificial test case and a realistic application named Sam(oa)

$$^2$$

2 on three systems with different communication overhead. Our results confirm improvements for important use cases compared to previous solutions. Furthermore, this approach can support co-scheduling tasks across multiple applications.

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-031-30442-2_20

Reference28 articles.

1. Amiri, M., et al.: Survey on prediction models of applications for resources provisioning in cloud. J. Netw. Comput. Appl. 82, 93–113 (2017). https://doi.org/10.1016/j.jnca.2017.01.016

2. Blumofe, R.D., Joerg, C.F., et al.: Cilk: an efficient multithreaded runtime system. SIGPLAN Not. 30(8), 207–216 (1995). https://doi.org/10.1145/209937.209958

3. Carrington, L.C., Laurenzano, M., et al.: How well can simple metrics represent the performance of HPC applications? In: Proceedings of the ACM/IEEE Conference on Supercomputing (2015). https://doi.org/10.1109/SC.2005.33

4. Catalyurek, U.V., Boman, E.G., et al.: Hypergraph-based dynamic load balancing for adaptive scientific computations. In: International Parallel and Distributed Processing Symposium, pp. 1–11 (2007). https://doi.org/10.1109/IPDPS.2007.370258

5. Chow, Y.C., et al.: Models for dynamic load balancing in a heterogeneous multiple processor system. IEEE Trans. Comput. C-28(5), 354–361 (1979)

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. From reactive to proactive load balancing for task‐based parallel applications in distributed memory machines;Concurrency and Computation: Practice and Experience;2023-06-26