Affiliation:
1. East China Normal University and Alibaba Group
2. Alibaba Group
3. Aalborg University
4. East China Normal University
Abstract
Predictive autoscaling is a key enabler for optimizing cloud resource allocation in Alibaba Cloud's computing platforms, which dynamically adjust the
Elastic Compute Service
(ECS) instances based on predicted user demands to ensure Quality of Service (QoS). However, user demands in the cloud are often highly complex, with high
uncertainty
and
scale-sensitive
temporal dependencies, thus posing great challenges for accurate prediction of future demands. These in turn make autoscaling challenging---autoscaling needs to properly account for demand uncertainty while maintaining a reasonable trade-off between two contradictory factors, i.e., low instance running costs vs. low QoS violation risks.
To address the above challenges, we propose a novel predictive autoscaling framework
MagicScaler
, consisting of a Multi-scale attentive Gaussian process based predictor and an uncertainty-aware scaler. First, the predictor carefully bridges the best of two successful prediction methodologies---multi-scale attention mechanisms, which are good at capturing complex, multi-scale features, and stochastic process regression, which can quantify prediction uncertainty, thus achieving accurate demand prediction with quantified uncertainty. Second, the scaler takes the quantified future demand uncertainty into a judiciously designed loss function with stochastic constraints, enabling flexible trade-off between running costs and QoS violation risks. Extensive experiments on three clusters of Alibaba Cloud in different Chinese cities demonstrate the effectiveness and efficiency of
MagicScaler
, which outperforms other commonly adopted scalers, thus justifying our design choices.
Publisher
Association for Computing Machinery (ACM)
Subject
General Earth and Planetary Sciences,Water Science and Technology,Geography, Planning and Development
Reference50 articles.
1. Burst-Aware Predictive Autoscaling for Containerized Microservices
2. Muhammad Abdullah , Waheed Iqbal , Abdelkarim Erradi , and Faisal Bukhari . 2019 . Learning Predictive Autoscaling Policies for Cloud-Hosted Microservices Using Trace-Driven Modeling. In 2019 IEEE International Conference on Cloud Computing Technology and Science (CloudCom) , Sydney, Australia, December 11--13 , 2019. IEEE, 119--126. Muhammad Abdullah, Waheed Iqbal, Abdelkarim Erradi, and Faisal Bukhari. 2019. Learning Predictive Autoscaling Policies for Cloud-Hosted Microservices Using Trace-Driven Modeling. In 2019 IEEE International Conference on Cloud Computing Technology and Science (CloudCom), Sydney, Australia, December 11--13, 2019. IEEE, 119--126.
3. Theory and applications of HVAC control systems – A review of model predictive control (MPC)
4. Elasticity in Cloud Computing: State of the Art and Research Challenges
5. The views, measurements and challenges of elasticity in the cloud: A review
Cited by
5 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献