Mobility-Aware Resource Allocation in IoRT Network for Post-Disaster Communications with Parameterized Reinforcement Learning
Author:
Kabir Homayun1, Tham Mau-Luen1ORCID, Chang Yoong Choon1, Chow Chee-Onn2ORCID, Owada Yasunori3
Affiliation:
1. Department of Electrical and Electronic Engineering, Lee Kong Chian Faculty of Engineering and Science, Universiti Tunku Abdul Rahman, Sungai Long Campus, Kajang 43000, Malaysia 2. Department of Electrical Engineering, Faculty of Engineering, Universiti Malaya, Lembah Pantai, Kuala Lumpur 50603, Malaysia 3. Resilient ICT Research Center, Network Research Institute, National Institute of Information and Communications Technology (NICT), Tokyo 184-8795, Japan
Abstract
Natural disasters, including earthquakes, floods, landslides, tsunamis, wildfires, and hurricanes, have become more common in recent years due to rapid climate change. For Post-Disaster Management (PDM), authorities deploy various types of user equipment (UE) for the search and rescue operation, for example, search and rescue robots, drones, medical robots, smartphones, etc., via the Internet of Robotic Things (IoRT) supported by cellular 4G/LTE/5G and beyond or other wireless technologies. For uninterrupted communication services, movable and deployable resource units (MDRUs) have been utilized where the base stations are damaged due to the disaster. In addition, power optimization of the networks by satisfying the quality of service (QoS) of each UE is a crucial challenge because of the electricity crisis after the disaster. In order to optimize the energy efficiency, UE throughput, and serving cell (SC) throughput by considering the stationary as well as movable UE without knowing the environmental priori knowledge in MDRUs aided two-tier heterogeneous networks (HetsNets) of IoRT, the optimization problem has been formulated based on emitting power allocation and user association combinedly in this article. This optimization problem is nonconvex and NP-hard where parameterized (discrete: user association and continuous: power allocation) action space is deployed. The new model-free hybrid action space-based algorithm called multi-pass deep Q network (MP-DQN) is developed to optimize this complex problem. Simulations results demonstrate that the proposed MP-DQN outperforms the parameterized deep Q network (P-DQN) approach, which is well known for solving parameterized action space, DQN, as well as traditional algorithms in terms of reward, average energy efficiency, UE throughput, and SC throughput for motionless as well as moveable UE.
Funder
Universiti Tunku Abdul Rahman ASEAN IVO
Subject
Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry
Reference59 articles.
1. Measuring the true human cost of natural disasters;Lawry;Disaster Med. Public Health Prep.,2008 2. Shimada, G. (2022). The impact of climate-change-related disasters on Africa’s economic growth, agriculture, and conflicts: Can hu-manitarian aid and food assistance offset the damage?. Int. J. Environ. Res. Public Health, 19. 3. Development of a separable search-and-rescue robot composed of a mobile robot and a snake robot;Kamegawa;Adv. Robot.,2020 4. Vera-Ortega, P., Vázquez-Martín, R., Fernandez-Lozano, J.J., García-Cerezo, A., and Mandow, A. (2023). Enabling Remote Responder Bio-Signal Monitoring in a Cooperative Human–Robot Architecture for Search and Rescue. Sensors, 23. 5. Paravisi, M., Santos, D.H., Jorge, V., Heck, G., Gonçalves, L.M., and Amory, A. (2019). Unmanned Surface Vehicle Simulator with Realistic Environmental Disturbances. Sensors, 19.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献
|
|