A <i>Labeled</i> Architecture for Low-Entropy Clouds: Theory, Practice, and Lessons-Reference-Cited by-同舟云学术

A Labeled Architecture for Low-Entropy Clouds: Theory, Practice, and Lessons

Published:2022-01 Issue: Volume:2022 Page:
ISSN:2771-5892
Container-title:Intelligent Computing
language:en
Short-container-title:Intell Comput

Author:

Zhang Chuanqi¹²^ORCID,Wang Sa¹²³,Yu Zihao¹²,Wang Huizhe¹²,Xu Yinan¹²,Cai Luoshan¹²,Tang Dan¹,Sun Ninghui¹²,Bao Yungang¹²⁴

Affiliation:

1. Institute of Computing Technology, Chinese Academy of Sciences, Beijing, China

2. University of Chinese Academy of Sciences, Beijing, China

3. Institute of Computing Technology (Nanjing), Chinese Academy of Sciences, Nanjing, China

4. Peng Cheng Laboratory, Shenzhen, China

Abstract

Resource efficiency and quality of service (QoS) are both long-pursuit goals for cloud providers over the last decade. However, hardly any cloud platform can exactly achieve them perfectly even until today. Improving resource efficiency or resource utilization often could cause complicated resource contention between colocated cloud applications on different resources, spanning from the underlying hardware to the software stack, leading to unexpected performance degradation. The low-entropy cloud proposes a new software-hardware codesigned technology stack to holistically curb performance interference from the bottom up and obtain both high resource efficiency and high quality of application performance. In this paper, we introduce a new computer architecture for the low-entropy cloud stack, called labeled von Neumann architecture (LvNA), which incorporates a set of label-powered control mechanisms to enable shared components and resources on chip to differentiate, isolate, and prioritize user-defined application requests when competing for hardware resource. With the power of these mechanisms, LvNA was able to protect the performance of certain applications, such as latency-critical applications, from disorderly resource contention while improving resource utilization. We further build and tapeout Beihai, a 1.2 GHz 8-core RISC-V processor based on the LvNA architecture. The evaluation results show that Beihai could drastically reduce the performance degradation caused by memory bandwidth contention from 82.8% to 0.4%. When improving the CPU utilization over 70%, Beihai could reduce the 99th tail latency of Redis from 115 ms to 18.1 ms. Furthermore, Beihai can realize hardware virtualization, which boots up two unmodified virtual machines concurrently without the intervention of any software hypervisor.

Funder

Strategic Priority Research Program of the Chinese Academy of Sciences

Youth Innovation Promotion Association of the Chinese Academy of Sciences

National Natural Science Foundation of China

National Basic Research Program of China

Publisher

American Association for the Advancement of Science (AAAS)

Link

http://downloads.spj.sciencemag.org/icomputing/2022/9795476.pdf

Reference44 articles.

1. J. Leverich and C. Kozyrakis “Reconciling high server utilization and sub-millisecond quality–of-service ” in Proceedings of the Ninth European Conference on Computer Systems Amsterdam The Netherlands 2014 pp. 1–14

2. Y. Xu M. Bailey B. Noble and F. Jahanian “Small is better: avoiding latency traps in virtualized data centers ” in Proceedings of the 4th annual Symposium on Cloud Computing Santa Clara California 2013 pp. 1–16

3. Y. Xu Z. Musgrave B. Noble and M. Bailey “Bobtail: avoiding long tails in the cloud ” in 10th USENIX Symposium on Networked Systems Design and Implementation (NSDI 13) Lombard IL 2013 pp. 329–341

4. M. Yu A. Greenberg D. Maltz J. Rexford L. Yuan S. Kandula and C. Kim “Profiling network performance for multi-tier data center applications ” in 8th USENIX Symposium on Networked Systems Design and Implementation (NSDI 11) Boston MA 2011

5. Deadline-aware datacenter tcp (d2tcp);Vamanan B.;ACM SIGCOMM Computer Communication Review,2012