Affiliation:
1. Chinese Academy of Sciences
Abstract
For the congestion problems with multi-user existing in high-speed networks, a pricing scheme based Nash Q-learning flow controller is proposed. It considers a network with a single service provider, and some non-cooperative users. The pricing scheme is introduced to the design of the reward function in the learning process of Q-learning. Because of the uncertainties and highly time-varying, it is not easy to accurately obtain the complete information for high-speed networks. The Nash Q-learning, which is independent of mathematic model, shows particular superiority. It obtains the Nash Q-values through trial-and-error and interaction with the environment to improve its behavior policy. By means of learning process, the proposed controller can learn to take the best actions to regulate source flow with the features of high quality of service. Simulation results show that the proposed controller can promote the performance of the networks and avoid the occurrence of congestion effectively.
Publisher
Trans Tech Publications, Ltd.
Subject
Mechanical Engineering,Mechanics of Materials,General Materials Science
Reference8 articles.
1. R. G. Cheng, C. J. Chang and L. F. Lin: A QoS-provisioning Neural Fuzzy Connection Admission Controller for Multimedia High-speed Networks. IEEE/ACM Transactions on Networking, vol. 7, no. 1 (1999), pp.111-121.
2. M. Lestas, A. Pitsillides, P. Ioannou, and G. Hadjipollas: Adaptive Congestion Protocol: a Congestion Control Protocol with Learning Capability. Computer Networks, vol. 51, no. 13 (2007), pp.3773-3798.
3. R. S. Sutton and A. G. Barto: Reinforcement Learning an Introduction. (MIT Press, Cambridge, 1998).
4. A. Chatovich, S. Okug, and G. Dundar: Hierarchical Neuro-fuzzy Call Admission Controller for ATM Networks. Computer Communications, vol. 24, no. 11 (2001), pp.1031-1044.
5. M. C. Hsiao, S. W. Tan, K. S. Hwang, and C. S. Wu: A Reinforcement Learning Approach to Congestion Control of High-speed Multimedia Networks. Cybernetics and Systems, vol. 36, no. 2 (2003), pp.181-202.