A novel Q-learning algorithm based on improved whale optimization algorithm for path planning-Reference-Cited by-同舟云学术

A novel Q-learning algorithm based on improved whale optimization algorithm for path planning

Published:2022-12-27 Issue:12 Volume:17 Page:e0279438
ISSN:1932-6203
Container-title:PLOS ONE
language:en
Short-container-title:PLoS ONE

Author:

Li Ying,Wang Hanyu^ORCID,Fan Jiahao^ORCID,Geng Yanyu

Abstract

Q-learning is a classical reinforcement learning algorithm and one of the most important methods of mobile robot path planning without a prior environmental model. Nevertheless, Q-learning is too simple when initializing Q-table and wastes too much time in the exploration process, causing a slow convergence speed. This paper proposes a new Q-learning algorithm called the Paired Whale Optimization Q-learning Algorithm (PWOQLA) which includes four improvements. Firstly, to accelerate the convergence speed of Q-learning, a whale optimization algorithm is used to initialize the values of a Q-table. Before the exploration process, a Q-table which contains previous experience is learned to improve algorithm efficiency. Secondly, to improve the local exploitation capability of the whale optimization algorithm, a paired whale optimization algorithm is proposed in combination with a pairing strategy to speed up the search for prey. Thirdly, to improve the exploration efficiency of Q-learning and reduce the number of useless explorations, a new selective exploration strategy is introduced which considers the relationship between current position and target position. Fourthly, in order to balance the exploration and exploitation capabilities of Q-learning so that it focuses on exploration in the early stage and on exploitation in the later stage, a nonlinear function is designed which changes the value of ε in ε-greedy Q-learning dynamically based on the number of iterations. Comparing the performance of PWOQLA with other path planning algorithms, experimental results demonstrate that PWOQLA achieves a higher level of accuracy and a faster convergence speed than existing counterparts in mobile robot path planning. The code will be released at https://github.com/wanghanyu0526/improveQL.git.

Publisher

Public Library of Science (PLoS)

Subject

Multidisciplinary

Reference55 articles.

1. Motion planning in a plane using generalized Voronoi diagrams;O Takahashi;IEEE Transactions on Robotics and Automation,1989

2. An improved PSO algorithm for smooth path planning of mobile robots using continuous high-degree Bezier curve;BY Song;Applied Soft Computing,2021

3. A Path-Planning Algorithm Using Vector Potential Functions in Triangular Regions;AK Pamosoaji;IEEE Transactions on Systems, Man, and Cybernetics: Systems,2013

4. Addressing disasters in smart cities through UAVs path planning and 5G communications: A systematic review;Z Qadir;Computer Communications,2021

5. Survey of Deep Reinforcement Learning for Motion Planning of Autonomous Vehicles;S. Aradi;IEEE Transactions on Intelligent Transportation Systems,2022

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Expected-mean gamma-incremental reinforcement learning algorithm for robot path planning;Expert Systems with Applications;2024-09

2. Optimal truss design with MOHO: A multi-objective optimization perspective;PLOS ONE;2024-08-19

3. A Multi-layer Guidance Approach for Submerged Sensor Networks Integrating Acoustic and Optical Technologies;Tehnicki vjesnik - Technical Gazette;2024-06-15

4. Enhancing Mobile Robot Path Planning Through Advanced Deep Reinforcement Learning;Smart Innovation, Systems and Technologies;2024

5. Design of anti‐jamming decision‐making for cognitive radar;IET Radar, Sonar & Navigation;2023-10-24