SLA-Adaptive Threshold Adjustment for a Kubernetes Horizontal Pod Autoscaler-Reference-Cited by-同舟云学术

SLA-Adaptive Threshold Adjustment for a Kubernetes Horizontal Pod Autoscaler

Published:2024-03-27 Issue:7 Volume:13 Page:1242
ISSN:2079-9292
Container-title:Electronics
language:en
Short-container-title:Electronics

Author:

Pozdniakova Olesia¹^ORCID,Mažeika Dalius¹^ORCID,Cholomskis Aurimas¹

Affiliation:

1. Department of Information Systems, Faculty of Fundamental Sciences, Vilnius Gediminas Technical University, Saulėtekio al. 11, 10223 Vilnius, Lithuania

Abstract

Kubernetes is an open-source container orchestration system that provides a built-in module for dynamic resource provisioning named the Horizontal Pod Autoscaler (HPA). The HPA identifies the number of resources to be provisioned by calculating the ratio between the current and target utilisation metrics. The target utilisation metric, or threshold, directly impacts how many and how quickly resources will be provisioned. However, the determination of the threshold that would allow satisfying performance-based Service Level Objectives (SLOs) is a long, error-prone, manual process because it is based on the static threshold principle and requires manual configuration. This might result in underprovisioning or overprovisioning, leading to the inadequate allocation of computing resources or SLO violations. Numerous autoscaling solutions have been introduced as alternatives to the HPA to simplify the process. However, the HPA is still the most widely used solution due to its ease of setup, operation, and seamless integration with other Kubernetes functionalities. The present study proposes a method that utilises exploratory data analysis techniques along with moving average smoothing to identify the target utilisation threshold for the HPA. The objective is to ensure that the system functions without exceeding the maximum number of events that result in a violation of the response time defined in the SLO. A prototype was created to adjust the threshold values dynamically, utilising the proposed method. This prototype enables the evaluation and comparison of the proposed method with the HPA, which has the highest threshold set that meets the performance-based SLOs. The results of the experiments proved that the suggested method adjusts the thresholds to the desired service level with a 1–2% accuracy rate and only 4–10% resource overprovisioning, depending on the type of workload.

Publisher

MDPI AG

Link

https://www.mdpi.com/2079-9292/13/7/1242/pdf

Reference51 articles.

1. The Kubernetes Authors (2023, September 30). Kubernetes. Available online: https://kubernetes.io/.

2. (2023, September 30). Horizontal Pod Autoscaling. Available online: https://kubernetes.io/docs/tasks/run-application/horizontal-pod-autoscale/.

3. Shafi, N., Abdullah, M., Iqbal, W., Erradi, A., and Bukhari, F. (2024). Cluster Computing, Springer.

4. Best practices for running cost-optimized Kubernetes applications on GKE|Cloud Architecture Center|Google Cloud.

5. A Survey on SLA Management for Cloud Computing and Cloud-Hosted Big Data Analytic Applications;Sahal;Int. J. Database Theory Appl.,2016