On the effect of the sampling ratio of past trajectories in the combination of evolutionary algorithm and deep reinforcement learning-Reference-Cited by-同舟云学术

On the effect of the sampling ratio of past trajectories in the combination of evolutionary algorithm and deep reinforcement learning

Published:2022-07-09 Issue: Volume: Page:
ISSN:
Container-title:Proceedings of the Genetic and Evolutionary Computation Conference Companion
language:
Short-container-title:

Author:

Wang Yasen¹,Akimoto Youhei¹

Affiliation:

1. University of Tsukuba, Tsukuba, Ibaraki, Japan

Funder

JSPS KAKENHI

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3520304.3533971

Reference17 articles.

1. Natural Gradient Works Efficiently in Learning

2. Deep Reinforcement Learning: A Brief Survey

3. Tim De Bruin Jens Kober Karl Tuyls and Robert Babuška. 2015. The importance of experience replay database composition in deep reinforcement learning. In Deep reinforcement learning workshop NIPS. Tim De Bruin Jens Kober Karl Tuyls and Robert Babuška. 2015. The importance of experience replay database composition in deep reinforcement learning. In Deep reinforcement learning workshop NIPS.

4. Lih-Yuan Deng. 2006. The cross-entropy method: a unified approach to combinatorial optimization Monte-Carlo simulation and machine learning. Lih-Yuan Deng. 2006. The cross-entropy method: a unified approach to combinatorial optimization Monte-Carlo simulation and machine learning.

5. Scott Fujimoto , Herke Hoof , and David Meger . 2018 . Addressing function approximation error in actor-critic methods . In International conference on machine learning. PMLR, 1587--1596 . Scott Fujimoto, Herke Hoof, and David Meger. 2018. Addressing function approximation error in actor-critic methods. In International conference on machine learning. PMLR, 1587--1596.