Performance Analysis of Work Stealing in Large-scale Multithreaded Computing-Reference-Cited by-同舟云学术

Performance Analysis of Work Stealing in Large-scale Multithreaded Computing

Published:2021-06-30 Issue:2 Volume:6 Page:1-28
ISSN:2376-3639
Container-title:ACM Transactions on Modeling and Performance Evaluation of Computing Systems
language:en
Short-container-title:ACM Trans. Model. Perform. Eval. Comput. Syst.

Author:

Sonenberg Nikki¹^ORCID,Kielanski Grzegorz²,Van Houdt Benny²

Affiliation:

1. The Alan Turing Institute, United Kingdom

2. University of Antwerp, Antwerpen, Belgium

Abstract

Randomized work stealing is used in distributed systems to increase performance and improve resource utilization. In this article, we consider randomized work stealing in a large system of homogeneous processors where parent jobs spawn child jobs that can feasibly be executed in parallel with the parent job. We analyse the performance of two work stealing strategies: one where only child jobs can be transferred across servers and the other where parent jobs are transferred. We define a mean-field model to derive the response time distribution in a large-scale system with Poisson arrivals and exponential parent and child job durations. We prove that the model has a unique fixed point that corresponds to the steady state of a structured Markov chain, allowing us to use matrix analytic methods to compute the unique fixed point. The accuracy of the mean-field model is validated using simulation. Using numerical examples, we illustrate the effect of different probe rates, load, and different child job size distributions on performance with respect to the two stealing strategies, individually, and compared to each other.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Safety, Risk, Reliability and Quality,Media Technology,Information Systems,Software,Computer Science (miscellaneous)

Link

https://dl.acm.org/doi/pdf/10.1145/3470887

Reference27 articles.

1. M. Bladt and B. F. Nielsen. 2017. Matrix-exponential Distributions in Applied Probability. Vol. 81. Springer. M. Bladt and B. F. Nielsen. 2017. Matrix-exponential Distributions in Applied Probability. Vol. 81. Springer.

2. Cilk: An efficient multithreaded runtime system;Blumofe R.;J. Parallel Distrib. Comput.,1996

3. Scheduling multithreaded computations by work stealing;Blumofe R.;J. ACM,1999

4. Asymptotic independence of queues under randomized load balancing. Queue;Bramson M.;Syst.,2012

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Dynamic load balancing in energy packet networks;Performance Evaluation;2024-08

2. A Product-form Network for Systems with Job Stealing Policies;ACM Transactions on Modeling and Performance Evaluation of Computing Systems;2024-03-18

3. Performance Analysis of Work Stealing Strategies in Large-Scale Multithreaded Computing;ACM Transactions on Modeling and Computer Simulation;2023-10-26

4. A Blind Load-Balancing Algorithm (BLBA) for Distributing Tasks in Fog Nodes;Wireless Communications and Mobile Computing;2022-08-11

5. Performance Analysis of Work Stealing Strategies in Large Scale Multi-threaded Computing;Quantitative Evaluation of Systems;2021