Author:
Liu Peng,Zhao Weisen,Zhang Baoliang,Wang Jing
Abstract
Abstract
To harness the advantages of both proactive and responsive scaling, adapting to various workload scenarios, this paper introduces a container hybrid scaling strategy called HyPredRL, rooted in load prediction and reinforcement learning. Within the proactive scaling module RL-PM, a load prediction model, MSC-LSTM, predict workloads and, in conjunction with current workload states, leverages reinforcement learning agents for intelligent scaling decisions. The responsive scaling strategy, SLA-HPA, enhances Kubernetes’ native scaling strategy, which primarily considers resource utilization, by incorporating response time metrics. Ultimately, a hybrid scaling controller is designed, applying the principles of “rapid scaling out” and “balanced conflicts” to coordinate proactive and responsive scaling. Experimental results demonstrate that HyPredRL outperforms existing methods in SLA violation rate, resource utilization, and request response time, effectively improving application performance and scalability.
Reference16 articles.
1. Cloud Computing: Architecture and Key Technologies[J];Luo;Journal of Communications,2011
2. Elasticity in cloud computing: state of the art and research challenges[J];Al-Dhuraibi;IEEE Transactions on Services Computing,2017
3. Reactive resource provisioning heuristics for dynamic dataflows on cloud infrastructure[J];Kumbhare;IEEE Transactions on Cloud Computing,2015
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献