Distributed Data-Driven Learning-Based Optimal Dynamic Resource Allocation for Multi-RIS-Assisted Multi-User Ad-Hoc Network-Reference-Cited by-同舟云学术

Distributed Data-Driven Learning-Based Optimal Dynamic Resource Allocation for Multi-RIS-Assisted Multi-User Ad-Hoc Network

Published:2024-01-19 Issue:1 Volume:17 Page:45
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Zhang Yuzhu¹,Xu Hao¹

Affiliation:

1. Department of Electrical and Biomedical Engineering, University of Nevada, Reno, NV 89557, USA

Abstract

This study investigates the problem of decentralized dynamic resource allocation optimization for ad-hoc network communication with the support of reconfigurable intelligent surfaces (RIS), leveraging a reinforcement learning framework. In the present context of cellular networks, device-to-device (D2D) communication stands out as a promising technique to enhance the spectrum efficiency. Simultaneously, RIS have gained considerable attention due to their ability to enhance the quality of dynamic wireless networks by maximizing the spectrum efficiency without increasing the power consumption. However, prevalent centralized D2D transmission schemes require global information, leading to a significant signaling overhead. Conversely, existing distributed schemes, while avoiding the need for global information, often demand frequent information exchange among D2D users, falling short of achieving global optimization. This paper introduces a framework comprising an outer loop and inner loop. In the outer loop, decentralized dynamic resource allocation optimization has been developed for self-organizing network communication aided by RIS. This is accomplished through the application of a multi-player multi-armed bandit approach, completing strategies for RIS and resource block selection. Notably, these strategies operate without requiring signal interaction during execution. Meanwhile, in the inner loop, the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm has been adopted for cooperative learning with neural networks (NNs) to obtain optimal transmit power control and RIS phase shift control for multiple users, with a specified RIS and resource block selection policy from the outer loop. Through the utilization of optimization theory, distributed optimal resource allocation can be attained as the outer and inner reinforcement learning algorithms converge over time. Finally, a series of numerical simulations are presented to validate and illustrate the effectiveness of the proposed scheme.

Funder

National Science Foundation

Publisher

MDPI AG

Link

https://www.mdpi.com/1999-4893/17/1/45/pdf

Reference27 articles.

1. A survey on beyond 5G network with the advent of 6G: Architecture and emerging technologies;Dogra;IEEE Access,2020

2. Rekkas, V.P., Sotiroudis, S., Sarigiannidis, P., Wan, S., Karagiannidis, G.K., and Goudos, S.K. (2021). Machine learning in beyond 5G/6G networks—State-of-the-art and future trends. Electronics, 10.

3. Internet of Things (IoT): A literature review;Madakam;J. Comput. Commun.,2015

4. A review and state of art of Internet of Things (IoT);Laghari;Arch. Comput. Methods Eng.,2021

5. Channel characteristics of visible light communications within dynamic indoor environment;Chvojka;J. Light. Technol.,2015

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. The feasibility analysis of load based resource optimization algorithm for cooperative communication in 5G wireless ad-hoc networks;Alexandria Engineering Journal;2024-10

2. Reconfigurable-Intelligent-Surface-Enhanced Dynamic Resource Allocation for the Social Internet of Electric Vehicle Charging Networks with Causal-Structure-Based Reinforcement Learning;Future Internet;2024-05-11