Lock-contention-aware scheduler-Reference-Cited by-同舟云学术

Lock-contention-aware scheduler

Published:2013-01 Issue:4 Volume:9 Page:1-25
ISSN:1544-3566
Container-title:ACM Transactions on Architecture and Code Optimization
language:en
Short-container-title:ACM Trans. Archit. Code Optim.

Author:

Cui Yan¹,Wang Yingxin¹,Chen Yu¹,Shi Yuanchun¹

Affiliation:

1. Tsinghua University, Beijing, China

Abstract

In response to the increasing ubiquity of multicore processors, there has been widespread development of multithreaded applications that strive to realize their full potential. Unfortunately, lock contention within operating systems can limit the scalability of multicore systems so severely that an increase in the number of cores can actually lead to reduced performance (i.e., scalability collapse). Existing efforts of solving scalability collapse mainly focus on making critical sections of kernel code fine-grained or designing new synchronization primitives. However, these methods have disadvantages in scalability or energy efficiency. In this article, we observe that the percentage of lock-waiting time over the total execution time for a lock intensive task has a significant correlation with the occurrence of scalability collapse. Based on this observation, a lock-contention-aware scheduler is proposed. Specifically, each task in the scheduler monitors its percentage of lock waiting time continuously. If the percentage exceeds a predefined threshold, this task is considered as lock intensive and migrated to a Special Set of Cores (i.e., SSC). In this way, the number of concurrently running lock-intensive tasks is limited to the number of cores in the SSC, and therefore, the degree of lock contention is controlled. A central challenge of using this scheme is how many cores should be allocated in the SSC to handle lock-intensive tasks. In our scheduler, the optimal number of cores is determined online by the model-driven search. The proposed scheduler is implemented in the recent Linux kernel and evaluated using micro- and macrobenchmarks on AMD and Intel 32-core systems. Experimental results suggest that our proposal is able to remove scalability collapse completely and sustains the maximal throughput of the spin-lock-based system for most applications. Furthermore, the percentage of lock-waiting time can be reduced by up to 84%. When compared with scalability collapse reduction methods such as requester-based locking scheme and sleeping-based synchronization primitives, our scheme exhibits significant advantages in scalability, power consumption, and energy efficiency.

Funder

National Natural Science Foundation of China

Publisher

Association for Computing Machinery (ACM)

Subject

Hardware and Architecture,Information Systems,Software

Link

https://dl.acm.org/doi/pdf/10.1145/2400682.2400703

Reference27 articles.

1. 380801 Power Analyzer. 2012. http://extech.com/instruments/product.asp?catid=14&prodid=205. 380801 Power Analyzer. 2012. http://extech.com/instruments/product.asp?catid=14&prodid=205.

2. Adaptive Spinning Mutexes. 2009. http://lkml.org/lkml/2009/1/14/393. Adaptive Spinning Mutexes. 2009. http://lkml.org/lkml/2009/1/14/393.

3. The kill rule for multicore

4. Experience distributing objects in an SMMP OS

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. nOS-V: Co-Executing HPC Applications Using System-Wide Task Scheduling;2024 IEEE International Parallel and Distributed Processing Symposium (IPDPS);2024-05-27

2. GiantVM: A Novel Distributed Hypervisor for Resource Aggregation with DSM-aware Optimizations;ACM Transactions on Architecture and Code Optimization;2022-03-07

3. Toggle: Contention-Aware Task Scheduler for Concurrent Hierarchical Operations;Lecture Notes in Computer Science;2019

4. tScale: A Contention-Aware Multithreaded Framework for Multicore Multiprocessor Systems;2017 IEEE 23rd International Conference on Parallel and Distributed Systems (ICPADS);2017-12

5. A Distributed Process Management Model for Better Scalability on Multicore Platform;Chinese Journal of Electronics;2017-03