Optimizing Relay Selection in D2DCommunication for Next-Generation WirelessNetworks Using Multi-Agent ReinforcementLearning: A Novel Approach

Author:

SIRMA Muharrem1,KAVAK Adnan1,INNER A. Burak1

Affiliation:

1. Wireless Communication and Infromation Systems Research Center, Artificial Intelligence and Simulation Systems Research Lab

Abstract

Abstract

Device-to-device (D2D) communication offering a direct communication channel between devices has been introduced as an alternative communication technique for next generation wireless networks, which aims to alleviate the workload traditionally shouldered by base stations (BS) in cellular systems. However, D2D pairs encountering connectivity issues or requiring extended communication ranges will necessitate the involvement of a relay node (RN) to facilitate communication. For the cases wherein an efficient communication with the target device is only possible over a relay device, finding the candidate relay among source-relay-destination devices with the best link availability and optimized end-to-end throughput are challenges to be considered in relay assisted D2D communication. Despite the plethora of studies on relay selection in D2D communication, there persists a need for a method that systematically integrates multiple disruptive factors inherent in wireless channels. While Reinforcement Learning (RL) has primarily found application in resource management tasks like power control, resource block (RB) availability, and spectrum allocation, its utilization for relay selection in D2D communication within dynamic wireless environments remains largely unexplored. In this paper, we propose to use multi-agent reinforcement learning based relay selection (MARS) in which the source device and/or pairing devices act as learning agents. The resource selection agent (RSA), the link agent (LA), and the transmission agent (TA) are involved cooperatively in the MARS method to find the optimum relay. Source nodes in D2D pairs iteratively update their strategies through interactions with the wireless environment and other devices in order to maximize its cost function to select the most convenient relay in the multi hop D2D communication scenario. The MARS method takes into account the combined effect of SINR, link reliability, and throughput values for the estimation of cost function order to select the optimum relay node. We have performed extensive simulations for different device density scenarios in wireless environment and compared the performance of our MARS method with SINR based relay selection approach. The findings show that our relay selection approach for D2D communication outperforms SINR-based methods in terms of end-to-end link reliability and throughput performance.

Publisher

Springer Science and Business Media LLC

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3