Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training-Reference-Cited by-同舟云学术

Ensemble Reinforcement Learning in Continuous Spaces -- A Hierarchical Multi-Step Approach for Policy Training

Published:2023-08 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Thirty-Second International Joint Conference on Artificial Intelligence
language:
Short-container-title:

Author:

Chen Gang¹,Huang Victoria²

Affiliation:

1. Victoria University of Wellington

2. National Institute of Water and Atmospheric Research

Abstract

Actor-critic deep reinforcement learning (DRL) algorithms have recently achieved prominent success in tackling various challenging reinforcement learning (RL) problems, particularly complex control tasks with high-dimensional continuous state and action spaces. Nevertheless, existing research showed that actor-critic DRL algorithms often failed to explore their learning environments effectively, resulting in limited learning stability and performance. To address this limitation, several ensemble DRL algorithms have been proposed lately to boost exploration and stabilize the learning process. However, most of existing ensemble algorithms do not explicitly train all base learners towards jointly optimizing the performance of the ensemble. In this paper, we propose a new technique to train an ensemble of base learners based on an innovative multi-step integration method. This training technique enables us to develop a new hierarchical learning algorithm for ensemble DRL that effectively promotes inter-learner collaboration through stable inter-learner parameter sharing. The design of our new algorithm is verified theoretically. The algorithm is also shown empirically to outperform several state-of-the-art DRL algorithms on multiple benchmark RL problems.

Publisher

International Joint Conferences on Artificial Intelligence Organization

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Reinforcement Learning for Autonomous Driving in Amazon Web Services DeepRacer;Information;2024-02-15

2. Evolving Epidemic Management Rules Using Deep Neuroevolution: A Novel Approach to Inspection Scheduling and Outbreak Minimization;Lecture Notes in Computer Science;2023-11-27