A Time Series-Based Approach to Elastic Kubernetes Scaling-Reference-Cited by-同舟云学术

A Time Series-Based Approach to Elastic Kubernetes Scaling

Published:2024-01-08 Issue:2 Volume:13 Page:285
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Yuan Haibin¹,Liao Shengchen¹

Affiliation:

1. School of Automation Science and Electrical Engineering, Beihang University, Beijing 100191, China

Abstract

With the increasing popularity of cloud-native architectures and containerized applications, Kubernetes has become a critical platform for managing these applications. However, Kubernetes still faces challenges when it comes to resource management. Specifically, the platform cannot achieve timely scaling of the resources of applications when their workloads fluctuate, leading to insufficient resource allocation and potential service disruptions. To address this challenge, this study proposes a predictive auto-scaling Kubernetes Operator based on time series forecasting algorithms, aiming to dynamically adjust the number of running instances in the cluster to optimize resource management. In this study, the Holt–Winter forecasting method and the Gated Recurrent Unit (GRU) neural network, two robust time series forecasting algorithms, are employed and dynamically managed. To evaluate the effectiveness, we collected workload metrics from a deployed RESTful HTTP application, implemented predictive auto-scaling, and assessed the differences in service quality before and after the implementation. The experimental results demonstrate that the predictive auto-scaling component can accurately predict the future trend of the metrics and intelligently scale resources based on the prediction results, with a Mean Squared Error (MSE) of 0.00166. Compared to the deployment using a single algorithm, the cold start time is reduced by 1 h and 41 min, and the fluctuation in service quality is reduced by 83.3%. This process effectively enhances the quality of service and offers a novel solution for resource management in Kubernetes clusters.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/2/285/pdf

Reference27 articles.

1. Using Xen and KVM as real-time hypervisors;Abeni;J. Syst. Archit.,2020

2. Malviya, A., and Dwivedi, R.K. (2022, January 23–25). A comparative analysis of container orchestration tools in cloud computing. Proceedings of the 2022 9th International Conference on Computing for Sustainable Global Development (INDIACom), New Delhi, India.

3. Docker [software engineering];Anderson;IEEE Softw.,2015

4. Cloud container technologies: A state-of-the-art review;Pahl;IEEE Trans. Cloud Comput.,2017

5. Shah, J., and Dubaria, D. (2019, January 7–9). Building modern clouds: Using docker, kubernetes and Google cloud platform. Proceedings of the 2019 IEEE 9th Annual Computing and Communication Workshop and Conference (CCWC), Las Vegas, NV, USA.

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Optimizing multi-time series forecasting for enhanced cloud resource utilization based on machine learning;Knowledge-Based Systems;2024-11

2. Optimizing resource allocation using proactive scaling with predictive models and custom resources;Computers and Electrical Engineering;2024-09

3. Predictive Energy Management for Docker Containers in Cloud Computing: A Time Series Analysis Approach;IEEE Access;2024