Affiliation:
1. The Pennsylvania State University, University Park, PA, USA
2. Oak Ridge National Laboratory, Oak Ridge, TN, USA
Abstract
Scheduling multiple jobs onto a platform enhances system utilization by sharing resources. The benefits from higher resource utilization include reduced cost to construct, operate, and maintain a system, which often include energy consumption. Maximizing these benefits, while satisfying performance limits, comes at a price -- resource contention among jobs increases job completion time. In this paper, we analyze slow-downs of jobs due to contention for multiple resources in a system; referred to as
dilation factor
. We observe that multiple-resource contention creates non-linear dilation factors of jobs. From this observation, we establish a general quantitative model for dilation factors of jobs in multi-resource systems. A job is characterized by a vector-valued loading statistics and dilation factors of a job set are given by a quadratic function of their loading vectors. We demonstrate how to systematically characterize a job, maintain the data structure to calculate the dilation factor (loading matrix), and calculate the dilation factor of each job. We validated the accuracy of the model with multiple processes running on a native Linux server, virtualized servers, and with multiple MapReduce workloads co-scheduled in a cluster. Evaluation with measured data shows that the D-factor model has an error margin of less than 16%. We also show that the model can be integrated with an existing on-line scheduler to minimize the makespan of workloads.
Publisher
Association for Computing Machinery (ACM)
Subject
Computer Networks and Communications,Hardware and Architecture,Software
Reference48 articles.
1. Apache Hadoop. http://hadoop.apache.org. Apache Hadoop. http://hadoop.apache.org.
2. FileBench. http://www.solarisinternals.com/wiki/index.php/FileBench. FileBench. http://www.solarisinternals.com/wiki/index.php/FileBench.
3. A view of cloud computing
4. Xen and the art of virtualization
5. Coordinated management of multiple interacting resources in chip multiprocessors: A machine learning approach
Cited by
24 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献