A Hybrid Multi-Agent Reinforcement Learning Approach for Spectrum Sharing in Vehicular Networks-Reference-Cited by-同舟云学术

A Hybrid Multi-Agent Reinforcement Learning Approach for Spectrum Sharing in Vehicular Networks

Published:2024-04-28 Issue:5 Volume:16 Page:152
ISSN:1999-5903
Container-title:Future Internet
language:en
Short-container-title:Future Internet

Author:

Jamal Mansoor¹,Ullah Zaib²^ORCID,Naeem Muddasar²^ORCID,Abbas Musarat¹,Coronato Antonio²^ORCID

Affiliation:

1. Department of Electronics, Quaid-i-Azam University, Islamabad 44000, Pakistan

2. Artificial Intelligence and Robotics Lab, Università Telematica Giustino Fortunato, 82100 Benevento, Italy

Abstract

Efficient spectrum sharing is essential for maximizing data communication performance in Vehicular Networks (VNs). In this article, we propose a novel hybrid framework that leverages Multi-Agent Reinforcement Learning (MARL), thereby combining both centralized and decentralized learning approaches. This framework addresses scenarios where multiple vehicle-to-vehicle (V2V) links reuse the frequency spectrum preoccupied by vehicle-to-infrastructure (V2I) links. We introduce the QMIX technique with the Deep Q Networks (DQNs) algorithm to facilitate collaborative learning and efficient spectrum management. The DQN technique uses a neural network to approximate the Q value function in high-dimensional state spaces, thus mapping input states to (action, Q value) tables that facilitate self-learning across diverse scenarios. Similarly, the QMIX is a value-based technique for multi-agent environments. In the proposed model, each V2V agent having its own DQN observes the environment, receives observation, and obtains a common reward. The QMIX network receives Q values from all agents considering individual benefits and collective objectives. This mechanism leads to collective learning while V2V agents dynamically adapt to real-time conditions, thus improving VNs performance. Our research finding highlights the potential of hybrid MARL models for dynamic spectrum sharing in VNs and paves the way for advanced cooperative learning strategies in vehicular communication environments. Furthermore, we conducted an in-depth exploration of the simulation environment and performance evaluation criteria, thus concluding in a comprehensive comparative analysis with cutting-edge solutions in the field. Simulation results show that the proposed framework efficiently performs against the benchmark architecture in terms of V2V transmission probability and V2I peak data transfer.

Publisher

MDPI AG

Link

https://www.mdpi.com/1999-5903/16/5/152/pdf

Reference29 articles.

1. Vehicle to vehicle “V2V” communication: Scope, importance, challenges, research directions and future;Yasser;Open Transp. J.,2020

2. Vehicular Communications: A Physical Layer Perspective;Liang;IEEE Trans. Veh. Technol.,2017

3. A near optimal scheduling algorithm for efficient radio resource management in multi-user MIMO systems;Naeem;Wirel. Pers. Commun.,2019

4. Revolutionizing Intelligent Transportation Systems with Cellular Vehicle-to-Everything (C-V2X) technology: Current trends, use cases, emerging technologies, standardization bodies, industry analytics and future directions;Rammohan;Veh. Commun.,2023

5. Naeem, M., Coronato, A., Ullah, Z., Bashir, S., and Paragliola, G. (2022). Optimal user scheduling in multi antenna system using multi agent reinforcement learning. Sensors, 22.

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Defining a Metric-Driven Approach for Learning Hazardous Situations;Technologies;2024-07-04