Use of Logarithmic Rates in Multi-Armed Bandit-Based Transmission Rate Control Embracing Frame Aggregations in Wireless Networks-Reference-Cited by-同舟云学术

Use of Logarithmic Rates in Multi-Armed Bandit-Based Transmission Rate Control Embracing Frame Aggregations in Wireless Networks

Published:2023-07-22 Issue:14 Volume:13 Page:8485
ISSN:2076-3417
Container-title:Applied Sciences
language:en
Short-container-title:Applied Sciences

Author:

Cho Soohyun¹^ORCID

Affiliation:

1. Department of General Studies, Hongik University, Seoul 04066, Republic of Korea

Abstract

Herein, we propose the use of the logarithmic values of data transmission rates for multi-armed bandit (MAB) algorithms that adjust the modulation and coding scheme (MCS) levels of data packets in carrier-sensing multiple access/collision avoidance (CSMA/CA) wireless networks. We argue that the utilities of the data transmission rates of the MCS levels may not be proportional to their nominal values and suggest using their logarithmic values instead of directly using their data transmission rates when MAB algorithms compute the expected throughputs of the MCS levels. To demonstrate the effectiveness of the proposal, we introduce two MAB algorithms that adopt the logarithmic rates of the transmission rates. The proposed MAB algorithms also support frame aggregations available in wireless network standards that aim for a high throughput. In addition, the proposed MAB algorithms use a sliding window over time to adapt to rapidly changing wireless channel environments. To evaluate the performance of the proposed MAB algorithms, we used the event-driven network simulator, ns-3. We evaluated their performance using various scenarios of stationary and non-stationary wireless network environments including multiple spatial streams and frame aggregations. The experiment results show that the proposed MAB algorithms outperform the MAB algorithms that do not adopt the logarithmic transmission rates in both the stationary and non-stationary scenarios.

Funder

2021 Hongik University Research Fund

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Link

https://www.mdpi.com/2076-3417/13/14/8485/pdf

Reference40 articles.

1. Sutton, R.S., and Barto, A.G. (2018). Reinforcement Learning: An Introduction, Bradford Book. [2nd ed.].

2. Using Confidence Bounds for Exploitation-Exploration Trade-offs;Auer;J. Mach. Learn. Res.,2002

3. The non-stochastic multi-armed bandit problem;Auer;SIAM J. Comput.,2002

4. Garivier, A., and Cappe, O. (2011, January 24). The KL-UCB Algorithm for Bounded Stochastic Bandits and Beyond. Proceedings of the 24th Annual Conference on Learning Theory, Budapest, Hungary.

5. On the Likelihood that One Unknown Probability Exceeds Another in View of the Evidence of Two Samples;Thompson;Biometrika,1933

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. New Technologies and Applications of Edge/Fog Computing Based on Artificial Intelligence and Machine Learning;Applied Sciences;2024-06-27

2. Reinforcement Learning Approach for Adaptive C-V2X Resource Management;Future Internet;2023-10-15