The Limit of Horizontal Scaling in Public Clouds-Reference-Cited by-同舟云学术

The Limit of Horizontal Scaling in Public Clouds

Published:2020-02-07 Issue:1 Volume:5 Page:1-22
ISSN:2376-3639
Container-title:ACM Transactions on Modeling and Performance Evaluation of Computing Systems
language:en
Short-container-title:ACM Trans. Model. Perform. Eval. Comput. Syst.

Author:

Jiang Qingye¹,Lee Young Choon²,Zomaya Albert Y.³

Affiliation:

1. The University of Sydney, Australia

2. Macquarie University, Australia

3. The University of Sydney

Abstract

Public cloud users are educated to practice horizontal scaling at the application level, with the assumption that more processing capacity can be achieved by adding nodes into the server fleet. In reality, however, applications—even those specifically designed to be horizontally scalable—often face unpredictable scalability issues when running at scale. In this article, we study the limit of horizontal scaling in public clouds by identifying sources of such limitations and quantitatively measuring their impact on processing capacity. To this end, we develop ScaleBench as a distributed and parallel cloud-scale testing framework and propose a capacity degradation index (CDI) to describe the level of capacity degradation observed in our benchmark studies. We have conducted extensive experiments in four real public clouds to identify possible bottlenecks in compute, block storage, networking, and object storage. Further, we carry out large-scale experiments with a real-life video transcoding application on worker fleets with up to 3200 vCPU cores. Our experimental results provide the quantitative evidence on the limit of horizontal scaling in public clouds. This helps cloud users make better design decisions on horizontally scalable applications.

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Safety, Risk, Reliability and Quality,Media Technology,Information Systems,Software,Computer Science (miscellaneous)

Link

https://dl.acm.org/doi/pdf/10.1145/3373356

Reference48 articles.

1. HPC Benchmarks on Amazon EC2

2. Validity of the single processor approach to achieving large scale computing capabilities

3. Empirical evaluation of latency-sensitive application performance in the cloud

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Challenges and Specificities of Adopting Continuous Integration within Scalable Cloud Environments;2023 IEEE 18th International Conference on Computer Science and Information Technologies (CSIT);2023-10-19

2. AI/ML for Service-Level Objectives;Edge Intelligence;2022-11-28

3. SENTRY: Selective Entropy Optimization via Committee Consistency for Unsupervised Domain Adaptation;2021 IEEE/CVF International Conference on Computer Vision (ICCV);2021-10

4. SLO Script: A Novel Language for Implementing Complex Cloud-Native Elasticity-Driven SLOs;2021 IEEE International Conference on Web Services (ICWS);2021-09

5. A Novel Middleware for Efficiently Implementing Complex Cloud-Native SLOs;2021 IEEE 14th International Conference on Cloud Computing (CLOUD);2021-09