Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces-Reference-Cited by-同舟云学术

Model-Based Predictive Control and Reinforcement Learning for Planning Vehicle-Parking Trajectories for Vertical Parking Spaces

Published:2023-08-11 Issue:16 Volume:23 Page:7124
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Shi Junren¹^ORCID,Li Kexin¹^ORCID,Piao Changhao¹,Gao Jun²,Chen Lizhi¹

Affiliation:

1. School of Automation, Chongqing University of Posts and Telecommunications, Chongqing 400065, China

2. School of Computer Science and Technology, Chongqing University of Posts and Telecommunications, Chongqing 400065, China

Abstract

This paper proposes a vehicle-parking trajectory planning method that addresses the issues of a long trajectory planning time and difficult training convergence during automatic parking. The process involves two stages: finding a parking space and parking planning. The first stage uses model predictive control (MPC) for trajectory tracking from the initial position of the vehicle to the starting point of the parking operation. The second stage employs the proximal policy optimization (PPO) algorithm to transform the parking behavior into a reinforcement learning process. A four-dimensional reward function is set to evaluate the strategy based on a formal reward, guiding the adjustment of neural network parameters and reducing the exploration of invalid actions. Finally, a simulation environment is built for the parking scene, and a network framework is designed. The proposed method is compared with the deep deterministic policy gradient and double-delay deep deterministic policy gradient algorithms in the same scene. Results confirm that the MPC controller accurately performs trajectory-tracking control with minimal steering wheel angle changes and smooth, continuous movement. The PPO-based reinforcement learning method achieves shorter learning times, totaling only 30% and 37.5% of the deep deterministic policy gradient (DDPG) and twin-delayed deep deterministic policy gradient (TD3), and the number of iterations to reach convergence for the PPO algorithm with the introduction of the four-dimensional evaluation metrics is 75% and 68% shorter compared to the DDPG and TD3 algorithms, respectively. This study demonstrates the effectiveness of the proposed method in addressing a slow convergence and long training times in parking trajectory planning, improving parking timeliness.

Funder

The National Key Research and Development Program of China

Chongqing Postdoctoral Research Special Funding Project

school-level research projects

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/23/16/7124/pdf

Reference36 articles.

1. Decentralized optimal control of connected automated vehicles at signal-free intersections including comfort-constrained turns and safety guarantees;Yue;Automatica,2019

2. A survey of autonomous driving: Common practices and emerging technologies;Yurtsever;IEEE Access,2020

3. Pendleton, S.D., Andersen, H., Dux, X., Shen, X., Meghjani, M., Eng, Y.H., Rus, D., and Ang, M.H. (2017). Perception, planning, control, and coordination for autonomous vehicles. Machines, 5.

4. A review of motion planning for highway autonomous driving;Claussmann;IEEE Trans. Intell. Transp. Syst.,2019

5. Planning and decision-making for autonomous vehicles;Schwarting;Annu. Rev. Control Robot. Auton. Syst.,2018

Cited by 5 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Advanced Machine Learning-Driven Automated Multi-Level Parking System;2024 IEEE Symposium on Wireless Technology & Applications (ISWTA);2024-07-20

2. Fuzzy PID Control Design of Mining Electric Locomotive Based on Permanent Magnet Synchronous Motor;Electronics;2024-05-10

3. A study of model predictive control and reinforcement learning control system;Fourth International Conference on Signal Processing and Machine Learning (CONF-SPML 2024);2024-04-01

4. Fast Nonlinear Predictive Control Using Classical and Parallel Wiener Models: A Comparison for a Neutralization Reactor Process;Sensors;2023-11-30

5. GPU Rasterization-Based 3D LiDAR Simulation for Deep Learning;Sensors;2023-09-28