Joint Beamforming, Power Allocation, and Splitting Control for SWIPT-Enabled IoT Networks with Deep Reinforcement Learning and Game Theory-Reference-Cited by-同舟云学术

Joint Beamforming, Power Allocation, and Splitting Control for SWIPT-Enabled IoT Networks with Deep Reinforcement Learning and Game Theory

Published:2022-03-17 Issue:6 Volume:22 Page:2328
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Liu JainShing^ORCID,Lin Chun-Hung Richard^ORCID,Hu Yu-Chen^ORCID,Donta Praveen Kumar^ORCID

Abstract

Future wireless networks promise immense increases on data rate and energy efficiency while overcoming the difficulties of charging the wireless stations or devices in the Internet of Things (IoT) with the capability of simultaneous wireless information and power transfer (SWIPT). For such networks, jointly optimizing beamforming, power control, and energy harvesting to enhance the communication performance from the base stations (BSs) (or access points (APs)) to the mobile nodes (MNs) served would be a real challenge. In this work, we formulate the joint optimization as a mixed integer nonlinear programming (MINLP) problem, which can be also realized as a complex multiple resource allocation (MRA) optimization problem subject to different allocation constraints. By means of deep reinforcement learning to estimate future rewards of actions based on the reported information from the users served by the networks, we introduce single-layer MRA algorithms based on deep Q-learning (DQN) and deep deterministic policy gradient (DDPG), respectively, as the basis for the downlink wireless transmissions. Moreover, by incorporating the capability of data-driven DQN technique and the strength of noncooperative game theory model, we propose a two-layer iterative approach to resolve the NP-hard MRA problem, which can further improve the communication performance in terms of data rate, energy harvesting, and power consumption. For the two-layer approach, we also introduce a pricing strategy for BSs or APs to determine their power costs on the basis of social utility maximization to control the transmit power. Finally, with the simulated environment based on realistic wireless networks, our numerical results show that the two-layer MRA algorithm proposed can achieve up to 2.3 times higher value than the single-layer counterparts which represent the data-driven deep reinforcement learning-based algorithms extended to resolve the problem, in terms of the utilities designed to reflect the trade-off among the performance metrics considered.

Funder

Ministry of Science and Technology, Republic of China

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/22/6/2328/pdf

Reference66 articles.

1. Energy-Efficient Offloading for Mobile Edge Computing in 5G Heterogeneous Networks

2. Energy Efficient SWIPT Based Mobile Edge Computing Framework for WSN-Assisted IoT

3. Simultaneous Wireless Information and Power Transfer for Internet of Things Sensor Networks

4. Energy-Efficient Optimal Power Allocation for SWIPT Based IoT-Enabled Smart Meter

Cited by 15 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Recent Developments of Game Theory and Reinforcement Learning Approaches: A Systematic Review;IEEE Access;2024

2. Optimizing Resource Swap Functionality in IoE-Based Grids Using Approximate Reasoning Reward-Based Adjustable Deep Double Q-Learning;IEEE Transactions on Consumer Electronics;2023-08

3. DRL at the Physical Layer;Deep Reinforcement Learning for Wireless Communications and Networking;2023-06-30

4. Machine Learning-Driven Ubiquitous Mobile Edge Computing as a Solution to Network Challenges in Next-Generation IoT;Systems;2023-06-16

5. Online Learning-Based Beamforming for Rate-Splitting Multiple Access: A Constrained Bandit Approach;ICC 2023 - IEEE International Conference on Communications;2023-05-28