A Deep Reinforcement Learning Scheme for Spectrum Sensing and Resource Allocation in ITS

Author:

Wei Huang1,Peng Yuyang1,Yue Ming1,Long Jiale2,AL-Hazemi Fawaz3ORCID,Mirza Mohammad Meraj4ORCID

Affiliation:

1. The School of Computer Science and Engineering, Macau University of Science and Technology, Macau 999078, China

2. Faculty of Intelligent Manufacturing, Wuyi University, Jiangmen 529020, China

3. Department of Computer and Network Engineering, University of Jeddah, Jeddah 21959, Saudi Arabia

4. Department of Computer Science, College of Computers and Information Technology, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia

Abstract

In recent years, the Internet of Vehicles (IoV) has been found to be of huge potential value in the promotion of the development of intelligent transportation systems (ITSs) and smart cities. However, the traditional scheme in IoV has difficulty in dealing with an uncertain environment, while reinforcement learning has the advantage of being able to deal with an uncertain environment. Spectrum resource allocation in IoV faces the uncertain environment in most cases. Therefore, this paper investigates the spectrum resource allocation problem by deep reinforcement learning after using spectrum sensing technology in the ITS, including the vehicle-to-infrastructure (V2I) link and the vehicle-to-vehicle (V2V) link. The spectrum resource allocation is modeled as a reinforcement learning-based multi-agent problem which is solved by using the soft actor critic (SAC) algorithm. Considered an agent, each V2V link interacts with the vehicle environment and makes a joint action. After that, each agent receives different observations as well as the same reward, and updates networks through the experiences from the memory. Therefore, during a certain time, each V2V link can optimize its spectrum allocation scheme to maximize the V2I capacity as well as increase the V2V payload delivery transmission rate. However, the number of SAC networks increases linearly as the number of V2V links increases, which means that the networks may have a problem in terms of convergence when there are an excessive number of V2V links. Consequently, a new algorithm, namely parameter sharing soft actor critic (PSSAC), is proposed to reduce the complexity for which the model is easier to converge. The simulation results show that both SAC and PSSAC can improve the V2I capacity and increase the V2V payload transmission success probability within a certain time. Specifically, these novel schemes have a 10 percent performance improvement compared with the existing scheme in the vehicular environment. Additionally, PSSAC has a lower complexity.

Funder

The Science and Technology Development Fund, Macau SAR

Wuyi University-Hong Kong-Macau joint Research and Development Fund

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Reference22 articles.

1. Deep reinforcement learning based resource allocation algorithm in cellular networks;Liao;J. Commun.,2019

2. Research of dynamic channel allocation algorithm for multi-radio multi-channel VANET;Min;Appl. Res. Comput.,2014

3. Cognitive Spectrum Allocation Mechanism in Internet of Vehicles Based on Clustering Structure;Xue;Comput. Sci.,2019

4. Radio resource management for D2D-based V2V communication;Sun;IEEE Trans. Veh. Technol.,2016

5. Cluster-based radio resource management for D2D-supported safety-critical V2X communications;Sun;IEEE Trans. Wirel. Commun.,2016

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3