Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm-Reference-Cited by-同舟云学术

Deep Ensemble Reinforcement Learning with Multiple Deep Deterministic Policy Gradient Algorithm

Published:2020-01-22 Issue: Volume:2020 Page:1-12
ISSN:1024-123X
Container-title:Mathematical Problems in Engineering
language:en
Short-container-title:Mathematical Problems in Engineering

Author:

Wu Junta¹,Li Huiyun¹^ORCID

Affiliation:

1. Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Shenzhen 518071, China

Abstract

Deep deterministic policy gradient algorithm operating over continuous space of actions has attracted great attention for reinforcement learning. However, the exploration strategy through dynamic programming within the Bayesian belief state space is rather inefficient even for simple systems. Another problem is the sequential and iterative training data with autonomous vehicles subject to the law of causality, which is against the i.i.d. (independent identically distributed) data assumption of the training samples. This usually results in failure of the standard bootstrap when learning an optimal policy. In this paper, we propose a framework of m-out-of-n bootstrapped and aggregated multiple deep deterministic policy gradient to accelerate the training process and increase the performance. Experiment results on the 2D robot arm game show that the reward gained by the aggregated policy is 10%–50% better than those gained by subpolicies. Experiment results on the open racing car simulator (TORCS) demonstrate that the new algorithm can learn successful control policies with less training time by 56.7%. Analysis on convergence is also given from the perspective of probability and statistics. These results verify that the proposed method outperforms the existing algorithms in both efficiency and performance.

Funder

National Natural Science Foundation of China

Publisher

Hindawi Limited

Subject

General Engineering,General Mathematics

Link

http://downloads.hindawi.com/journals/mpe/2020/4275623.pdf

Reference13 articles.

1. Deep Reinforcement Learning: A Brief Survey

2. Deep learning

3. Human-level control through deep reinforcement learning

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. I Know How: Combining Prior Policies to Solve New Tasks;2024 IEEE Conference on Games (CoG);2024-08-05

2. A Pilot Study of Observation Poisoning on Selective Reincarnation in Multi-Agent Reinforcement Learning;Neural Processing Letters;2024-05-02

3. Enhancing the landing guidance of a reusable launch vehicle by improving genetic algorithm-based deep reinforcement learning using Hybrid Deterministic-Stochastic algorithm;PLOS ONE;2024-02-29

4. Development of a Dynamic Semi-empirical Model for Simulation of Copper Electrowinning Processes;JOM;2024-01-24

5. Ensemble reinforcement learning: A survey;Applied Soft Computing;2023-12