Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks-Reference-Cited by-同舟云学术

Multi-Agent Reinforcement Learning for Joint Cooperative Spectrum Sensing and Channel Access in Cognitive UAV Networks

Published:2022-02-20 Issue:4 Volume:22 Page:1651
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Jiang Weiheng^ORCID,Yu Wanxin^ORCID,Wang Wenbo^ORCID,Huang Tiancong

Abstract

This paper studies the problem of distributed spectrum/channel access for cognitive radio-enabled unmanned aerial vehicles (CUAVs) that overlay upon primary channels. Under the framework of cooperative spectrum sensing and opportunistic transmission, a one-shot optimization problem for channel allocation, aiming to maximize the expected cumulative weighted reward of multiple CUAVs, is formulated. To handle the uncertainty due to the lack of prior knowledge about the primary user activities as well as the lack of the channel-access coordinator, the original problem is cast into a competition and cooperation hybrid multi-agent reinforcement learning (CCH-MARL) problem in the framework of Markov game (MG). Then, a value-iteration-based RL algorithm, which features upper confidence bound-Hoeffding (UCB-H) strategy searching, is proposed by treating each CUAV as an independent learner (IL). To address the curse of dimensionality, the UCB-H strategy is further extended with a double deep Q-network (DDQN). Numerical simulations show that the proposed algorithms are able to efficiently converge to stable strategies, and significantly improve the network performance when compared with the benchmark algorithms such as the vanilla Q-learning and DDQN algorithms.

Funder

National Natural Science Foundation of China

Pre-research Fund Project

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/4/1651/pdf

Reference36 articles.

1. Cellular UAV-to-Device Communications: Trajectory Design and Mode Selection by Multi-Agent Deep Reinforcement Learning

2. Impact of UAV Rotation on MIMO Channel Characterization for Air-to-Ground Communication Systems

3. Reinforcement Learning-Based Multislot Double-Threshold Spectrum Sensing With Bayesian Fusion for Industrial Big Spectrum Data

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Experimental testing and analysis of regression algorithms for spectrum sensing in cognitive radio networks;Wireless Networks;2024-05-14

2. Dynamic Spectrum Sharing Based on Deep Reinforcement Learning in Mobile Communication Systems;Sensors;2023-02-27

3. Toward Autonomous Multi-UAV Wireless Network: A Survey of Reinforcement Learning-Based Approaches;IEEE Communications Surveys & Tutorials;2023

4. A Novel Method for Few-Shot Specific Emitter Identification in Non-Cooperative Scenarios;IEEE Access;2023

5. A Multi-AUV Maritime Target Search Method for Moving and Invisible Objects Based on Multi-Agent Deep Reinforcement Learning;Sensors;2022-11-07