A review of motion planning algorithms for intelligent robots-Reference-Cited by-同舟云学术

A review of motion planning algorithms for intelligent robots

Published:2021-11-25 Issue:2 Volume:33 Page:387-424
ISSN:0956-5515
Container-title:Journal of Intelligent Manufacturing
language:en
Short-container-title:J Intell Manuf

Author:

Zhou Chengmin^ORCID,Huang Bingding,Fränti Pasi^ORCID

Abstract

AbstractPrinciples of typical motion planning algorithms are investigated and analyzed in this paper. These algorithms include traditional planning algorithms, classical machine learning algorithms, optimal value reinforcement learning, and policy gradient reinforcement learning. Traditional planning algorithms investigated include graph search algorithms, sampling-based algorithms, interpolating curve algorithms, and reaction-based algorithms. Classical machine learning algorithms include multiclass support vector machine, long short-term memory, Monte-Carlo tree search and convolutional neural network. Optimal value reinforcement learning algorithms include Q learning, deep Q-learning network, double deep Q-learning network, dueling deep Q-learning network. Policy gradient algorithms include policy gradient method, actor-critic algorithm, asynchronous advantage actor-critic, advantage actor-critic, deterministic policy gradient, deep deterministic policy gradient, trust region policy optimization and proximal policy optimization. New general criteria are also introduced to evaluate the performance and application of motion planning algorithms by analytical comparisons. The convergence speed and stability of optimal value and policy gradient algorithms are specially analyzed. Future directions are presented analytically according to principles and analytical comparisons of motion planning algorithms. This paper provides researchers with a clear and comprehensive understanding about advantages, disadvantages, relationships, and future of motion planning algorithms in robots, and paves ways for better motion planning algorithms in academia, engineering, and manufacturing.

Funder

University of Eastern Finland (UEF) including Kuopio University Hospital

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence,Industrial and Manufacturing Engineering,Software

Link

https://link.springer.com/content/pdf/10.1007/s10845-021-01867-z.pdf

Reference106 articles.

1. Arkin, R. C., Riseman, E. M., & Hansen, A. (1887). AuRA: an architecture for vision-based robot navigation. Proceedings of the DARPA Image Understanding Workshop, Los Angeles, CA, February 1987, pp. 417–413.

2. Babaeizadeh, M., Frosio, I., Tyree, S., Clemons, J., Kautz J. (2016). Reinforcement learning through asynchronous advantage Actor-Critic on a GPU. arXiv, arXiv:1611.06256 [cs.LG].

3. Bae, H., Kim, G., Kim, J., Qian, D., & Lee, S. (2019). Multi-robot path planning method using reinforcement learning. Applied Science., 9, 3057.

4. Bai, H., Cai, S., Ye, N., Hsu, D., & Lee, W. S. (2015). Intention-aware online POMDP planning for autonomous driving in a crowd. 2015 IEEE International Conference on Robotics and Automation (ICRA), Seattle, WA, pp. 454–460.

5. Bautista, G. D., Perez, J., Milanés, V., & Nashashibi, F. (2015). A review of motion planning techniques for automated vehicles. IEEE Transactions on Intelligent Transportation Systems, 17(4), 1–11.

Cited by 54 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Collision-free path planning approach based on rule guided lazy-PRM with repulsion field for gantry welding robots;Robotics and Autonomous Systems;2024-04

2. Research on Time Series-Based Pipeline Ground Penetrating Radar Calibration Angle Prediction Algorithm;Sensors;2024-01-08

3. Reactive optimal motion planning for a class of holonomic planar agents using reinforcement learning with provable guarantees;Frontiers in Robotics and AI;2024-01-03

4. Self-Learning Robot Autonomous Navigation with Deep Reinforcement Learning Techniques;Applied Sciences;2023-12-30

5. A Comparison Study between Traditional and Deep-Reinforcement-Learning-Based Algorithms for Indoor Autonomous Navigation in Dynamic Scenarios;Sensors;2023-12-07