Affiliation:
1. Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences Shenzhen China
2. Alibaba Group Hangzhou China
3. State Key Lab of IOTSC, Department of Computer Science University of Macau Macau SAR China
Abstract
SummaryCloud‐native architecture is becoming increasingly crucial for today's cloud computing environments due to the need for speed and flexibility in developing applications. It utilizes microservice technology to break down traditional monolithic applications into light‐weight and self‐contained microservice components. However, as microservices grow in scale and have dynamic inter‐dependencies, they also pose new challenges in resource provisioning that cannot be fully addressed by traditional resource scheduling approaches. The various microservices with different resource demands and latency requirements can create complex calling chains, making it difficult to provide fine‐grained and accurate resource allocation to each component while maintaining the overall quality of service in the chain. Alibaba Cloud has fully embraced cloud‐native and microservice technologies to drive its key business and scenarios, including Double 11 Shopping Festival. In this work, we aim to address the research problem on how to efficiently provision resources for the growing scale of microservice platform and ensure the performance of latency‐critical microservices. To address the problem, we present in‐depth analyses of Alibaba's microservice cluster and propose optimized resource provisioning algorithms to enhance resource utilization while ensuring the latency requirement. First, we analyze the distinct features of microservices in Alibaba's cluster compared to traditional applications. Then we present Alibaba's resource capacity provisioning workflow and framework to address challenges in resource provisioning for large‐scale and latency‐critical microservice clusters. Finally, we propose enhanced resource provisioning algorithms over Alibaba's current practice by making both proactive and reactive scheduling decisions based on different workloads patterns, which can improve resource usage by 10%–15% in Alibaba's clusters, while maintaining the necessary latency for microservices.
Funder
National Key Research and Development Program of China
National Natural Science Foundation of China
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
1. Containerized Microservices: A Survey of Resource Management Frameworks;IEEE Transactions on Network and Service Management;2024-08
2. Software compliance in various industries using CI/CD, dynamic microservices, and containers;Open Computer Science;2024-01-01
3. Optimizing Cloud Performance: A Microservice Scheduling Strategy for Enhanced Fault-Tolerance, Reduced Network Traffic, and Lower Latency;IEEE Access;2024
4. An Interference-aware Approach for Co-located Container Orchestration with Novel Metric;2023 IEEE International Conferences on Internet of Things (iThings) and IEEE Green Computing & Communications (GreenCom) and IEEE Cyber, Physical & Social Computing (CPSCom) and IEEE Smart Data (SmartData) and IEEE Congress on Cybermatics (Cybermatics);2023-12-17