Policy-Gradient and Actor-Critic Based State Representation Learning for Safe Driving of Autonomous Vehicles-Reference-Cited by-同舟云学术

Policy-Gradient and Actor-Critic Based State Representation Learning for Safe Driving of Autonomous Vehicles

Published:2020-10-22 Issue:21 Volume:20 Page:5991
ISSN:1424-8220
Container-title:Sensors
language:en
Short-container-title:Sensors

Author:

Gupta Abhishek^ORCID,Khwaja Ahmed Shaharyar,Anpalagan Alagan,Guan Ling,Venkatesh Bala

Abstract

In this paper, we propose an environment perception framework for autonomous driving using state representation learning (SRL). Unlike existing Q-learning based methods for efficient environment perception and object detection, our proposed method takes the learning loss into account under deterministic as well as stochastic policy gradient. Through a combination of variational autoencoder (VAE), deep deterministic policy gradient (DDPG), and soft actor-critic (SAC), we focus on uninterrupted and reasonably safe autonomous driving without steering off the track for a considerable driving distance. Our proposed technique exhibits learning in autonomous vehicles under complex interactions with the environment, without being explicitly trained on driving datasets. To ensure the effectiveness of the scheme over a sustained period of time, we employ a reward-penalty based system where a negative reward is associated with an unfavourable action and a positive reward is awarded for favourable actions. The results obtained through simulations on DonKey simulator show the effectiveness of our proposed method by examining the variations in policy loss, value loss, reward function, and cumulative reward for ‘VAE+DDPG’ and ‘VAE+SAC’ over the learning process.

Publisher

MDPI AG

Subject

Electrical and Electronic Engineering,Biochemistry,Instrumentation,Atomic and Molecular Physics, and Optics,Analytical Chemistry

Link

https://www.mdpi.com/1424-8220/20/21/5991/pdf

Reference45 articles.

1. Deep Reinforcement Learning: A Brief Survey

2. Policy search in continuous action domains: An overview

3. Human-like autonomous car-following model with deep reinforcement learning

4. Towards data-driven car-following models

5. Learning to Drive in a Day;Kendall;arXiv,2018

Cited by 8 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep deterministic policy gradient algorithm: A systematic review;Heliyon;2024-05

2. Dimensionality Reduction Methods Using VAE for Deep Reinforcement Learning of Autonomous Driving;2023 Eleventh International Symposium on Computing and Networking Workshops (CANDARW);2023-11-27

3. Investigating gas furnace control practices with reinforcement learning;International Journal of Heat and Mass Transfer;2023-08

4. New Reward-Clipping Mechanism in Deep -Learning Enabled Internet of Things in 6G to Improve Intelligent Transmission Scheduling;2023 IEEE 13th Annual Computing and Communication Workshop and Conference (CCWC);2023-03-08

5. A Novel Variational Autoencoder with Multi-position Latent Self-attention and Actor-Critic for Recommendation;Advanced Data Mining and Applications;2023