PBQ-Enhanced QUIC: QUIC with Deep Reinforcement Learning Congestion Control Mechanism
Author:
Zhang Zhifei123ORCID, Li Shuo123, Ge Yiyang123, Xiong Ge24, Zhang Yu5, Xiong Ke123ORCID
Affiliation:
1. Engineering Research Center of Network Management Technology for High Speed Railway of Ministry of Education, School of Computer and Information Technology, Beijing Jiaotong University, Beijing 100044, China 2. Collaborative Innovation Center of Railway Traffic Safety, Beijing Jiaotong University, Beijing 100044, China 3. National Engineering Research Center of Advanced Network Technologies, Beijing Jiaotong University, Beijing 100044, China 4. China Software and Technical Service Co., Ltd., Beijing 100081, China 5. Institute of Economics and Energy Supply and Demand, State Grid Energy Research Institute Co., Ltd., Beijing 102209, China
Abstract
Currently, the most widely used protocol for the transportation layer of computer networks for reliable transportation is the Transmission Control Protocol (TCP). However, TCP has some problems such as high handshake delay, head-of-line (HOL) blocking, and so on. To solve these problems, Google proposed the Quick User Datagram Protocol Internet Connection (QUIC) protocol, which supports 0-1 round-trip time (RTT) handshake, a congestion control algorithm configuration in user mode. So far, the QUIC protocol has been integrated with traditional congestion control algorithms, which are not efficient in numerous scenarios. To solve this problem, we propose an efficient congestion control mechanism on the basis of deep reinforcement learning (DRL), i.e., proximal bandwidth-delay quick optimization (PBQ) for QUIC, which combines traditional bottleneck bandwidth and round-trip propagation time (BBR) with proximal policy optimization (PPO). In PBQ, the PPO agent outputs the congestion window (CWnd) and improves itself according to network state, and the BBR specifies the pacing rate of the client. Then, we apply the presented PBQ to QUIC and form a new version of QUIC, i.e., PBQ-enhanced QUIC. The experimental results show that the proposed PBQ-enhanced QUIC achieves much better performance in both throughput and RTT than existing popular versions of QUIC, such as QUIC with Cubic and QUIC with BBR.
Funder
CAAI-Huawei MindSpore Open Fund Fundamental Research Funds for the Central Universities
Subject
General Physics and Astronomy
Reference29 articles.
1. Langley, A., Riddoch, A., Wilk, A., Vicente, A., Krasic, C., Zhang, D., Yang, F., Kouranov, F., Swett, I., and Iyengar, J. (2017, January 21–25). The QUIC Transport Protocol: Design and Internet-Scale Deployment. Proceedings of the ACM SIGCOMM, Los Angeles, CA, USA. 2. Congestion avoidance and control;Jacobson;ACM SIGCOMM Comput. Commun. Rev.,1988 3. Sander, C., Rüth, J., Hohlfeld, O., and Wehrle, K. (2019, January 23). Deepcci: Deep learning-based passive congestion control identification. Proceedings of the 2019 Workshop on Network Meets AI & ML, Beijing, China. 4. Chen, X., Xu, S., Chen, X., Cao, S., Zhang, S., and Sun, Y. (2019, January 24–28). Passive TCP identification for wired and wireless networks: A long-short term memory approach. Proceedings of the 2019 15th International Wireless Communications & Mobile Computing Conference (IWCMC), Tangier, Morocco. 5. Safe and nested subgame solving for imperfect-information games;Brown;Adv. Neural Inf. Process. Syst.,2017
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|