Synthetic Experiences for Accelerating DQN Performance in Discrete Non-Deterministic Environments-Reference-Cited by-同舟云学术

Synthetic Experiences for Accelerating DQN Performance in Discrete Non-Deterministic Environments

Published:2021-07-27 Issue:8 Volume:14 Page:226
ISSN:1999-4893
Container-title:Algorithms
language:en
Short-container-title:Algorithms

Author:

Pilar von Pilchau Wenzel^ORCID,Stein Anthony^ORCID,Hähner Jörg

Abstract

State-of-the-art Deep Reinforcement Learning Algorithms such as DQN and DDPG use the concept of a replay buffer called Experience Replay. The default usage contains only the experiences that have been gathered over the runtime. We propose a method called Interpolated Experience Replay that uses stored (real) transitions to create synthetic ones to assist the learner. In this first approach to this field, we limit ourselves to discrete and non-deterministic environments and use a simple equally weighted average of the reward in combination with observed follow-up states. We could demonstrate a significantly improved overall mean average in comparison to a DQN network with vanilla Experience Replay on the discrete and non-deterministic FrozenLake8x8-v0 environment.

Publisher

MDPI AG

Subject

Computational Mathematics,Computational Theory and Mathematics,Numerical Analysis,Theoretical Computer Science

Link

https://www.mdpi.com/1999-4893/14/8/226/pdf

Cited by 3 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep Q-Network Updates for the Full Action-Space Utilizing Synthetic Experiences;2023 International Joint Conference on Neural Networks (IJCNN);2023-06-18

2. Automatic focal EEG identification based on deep reinforcement learning;Biomedical Signal Processing and Control;2023-05

3. Semi-model-Based Reinforcement Learning in Organic Computing Systems;Architecture of Computing Systems;2022