Affiliation:
1. College of Mechatronics Engineering, Xi’an University of Architecture and Technology, Xi’an 710055, China
Abstract
Network congestion control is an important means to improve network throughput and reduce data transmission delay. To further optimize the network data transmission capability, this research suggests a proximal policy optimization-based intelligent TCP congestion management method, creates a proxy that can communicate with the real-time network environment, and abstracts the TCP congestion control mechanism into a partially observable Markov decision process. Changes in the real-time state of the network are fed back to the agent, and the agent makes action commands to control the size of the congestion window, which will produce a new network state, and the agent will immediately receive a feedback reward value. To guarantee that the actions taken are optimum, the agent’s goal is to obtain the highest feedback reward value. The state space of network characteristics should be designed so that agents can observe enough information to make appropriate decisions. The reward function is designed through a weighted algorithm that enables the agent to balance and optimize throughput and latency. The model parameters of the agent are updated by the proximal policy optimization algorithm, and the truncation function keeps the parameters within a certain range, reducing the possibility of oscillation during gradient descent and ensuring that the training process can converge quickly. Compared to the traditional CUBIC control method, the results show that the TCP-PPO2 policy reduces latency by 11.7–87.5%.
Funder
Xi’an Key Laboratory of Clean Energy
Key R&D Program of Shaanxi Province
Subject
Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science
Reference21 articles.
1. Henderson, T., Floyd, S., and Gurtov, A. (2023, April 12). The NewReno Modification to TCP’s Fast Recovery Algorithm. Available online: https://www.rfc-editor.org/rfc/rfc6582.html.
2. CUBIC: A new TCP-friendly high-speed TCP variant;Ha;ACM SIGOPS Oper. Syst. Rev.,2008
3. Mascolo, S., Casetti, C., Gerla, M., Sanadidi, M.Y., and Wang, R. (2001, January 16). TCP Westwood: Bandwidth estimation for enhanced transport over wireless links. Proceedings of the 7th Annual International Conference on Mobile Computing and Networking, New York, NY, USA.
4. Van Der Hooft, J., Petrangeli, S., Claeys, M., Famaey, J., and Turck, F. (2015, January 11–15). A learning-based algorithm for improved bandwidth-awareness of adaptive streaming clients. Proceedings of the 2015 IFIP/IEEE International Symposium on Integrated Network Management (IM), Ottawa, ON, Canada.
5. Improving the congestion control performance for mobile networks in high-speed railway via deep reinforcement learning;Cui;IEEE Trans. Veh. Technol.,2020
Cited by
6 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献