An Adaptive Multi-Level Quantization-Based Reinforcement Learning Model for Enhancing UAV Landing on Moving Targets-Reference-Cited by-同舟云学术

An Adaptive Multi-Level Quantization-Based Reinforcement Learning Model for Enhancing UAV Landing on Moving Targets

Published:2022-07-19 Issue:14 Volume:14 Page:8825
ISSN:2071-1050
Container-title:Sustainability
language:en
Short-container-title:Sustainability

Author:

Abo Mosali Najmaddin,Shamsudin Syariful Syafiq^ORCID,Mostafa Salama A.^ORCID,Alfandi Omar,Omar Rosli,Al-Fadhali Najib^ORCID,Mohammed Mazin Abed^ORCID,Malik R. Q.,Jaber Mustafa Musa^ORCID,Saif Abdu^ORCID

Abstract

The autonomous landing of an unmanned aerial vehicle (UAV) on a moving platform is an essential functionality in various UAV-based applications. It can be added to a teleoperation UAV system or part of an autonomous UAV control system. Various robust and predictive control systems based on the traditional control theory are used for operating a UAV. Recently, some attempts were made to land a UAV on a moving target using reinforcement learning (RL). Vision is used as a typical way of sensing and detecting the moving target. Mainly, the related works have deployed a deep-neural network (DNN) for RL, which takes the image as input and provides the optimal navigation action as output. However, the delay of the multi-layer topology of the deep neural network affects the real-time aspect of such control. This paper proposes an adaptive multi-level quantization-based reinforcement learning (AMLQ) model. The AMLQ model quantizes the continuous actions and states to directly incorporate simple Q-learning to resolve the delay issue. This solution makes the training faster and enables simple knowledge representation without needing the DNN. For evaluation, the AMLQ model was compared with state-of-art approaches and was found to be superior in terms of root mean square error (RMSE), which was 8.7052 compared with the proportional–integral–derivative (PID) controller, which achieved an RMSE of 10.0592.

Funder

Zayed University cluster award

Publisher

MDPI AG

Subject

Management, Monitoring, Policy and Law,Renewable Energy, Sustainability and the Environment,Geography, Planning and Development,Building and Construction

Link

https://www.mdpi.com/2071-1050/14/14/8825/pdf

Reference40 articles.

1. Drone delivery systems: job assignment and dimensioning

2. Drone-surveillance for search and rescue in natural disaster

3. Twin Delayed Deep Deterministic Policy Gradient-Based Target Tracking for Unmanned Aerial Vehicle With Achievement Rewarding and Multistage Training

4. Mission-driven autonomous perception and fusion based on UAV swarm;You;Chin. J. Aeronaut.,2020

5. Unsupervised Human Detection with an Embedded Vision System on a Fully Autonomous UAV for Search and Rescue Operations

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Using Reinforcement Learning and Error Models for Drone Precise Landing;ACM Transactions on Internet Technology;2024-07-15

2. A Survey of Offline- and Online-Learning-Based Algorithms for Multirotor Uavs;Drones;2024-03-22

3. Machine learning for enhancing transportation security: A comprehensive analysis of electric and flying vehicle systems;Engineering Applications of Artificial Intelligence;2024-03

4. Drone Landing and Reinforcement Learning: State-of-Art, Challenges and Opportunities;IEEE Open Journal of Intelligent Transportation Systems;2024

5. Handling Imbalanced Data for Improved Classification Performance: Methods and Challenges;2023 3rd International Conference on Emerging Smart Technologies and Applications (eSmarTA);2023-10-10