Regularly updated deterministic policy gradient algorithm-Reference-Cited by-同舟云学术

Regularly updated deterministic policy gradient algorithm

Published:2021-02 Issue: Volume:214 Page:106736
ISSN:0950-7051
Container-title:Knowledge-Based Systems
language:en
Short-container-title:Knowledge-Based Systems

Author:

Han Shuai,Zhou Wenbo,Lü Shuai,Yu Jiayu

Funder

National Key Research and Development Program of China

National Natural Science Foundation of China

Publisher

Elsevier BV

Subject

Artificial Intelligence,Information Systems and Management,Management Information Systems,Software

Reference65 articles.

1. Human-level control through deep reinforcement learning;Mnih;Nature,2015

2. M. Hessel, J. Modayil, H. van Hasselt, T. Schaul, G. Ostrovski, W. Dabney, D. Horgan, B. Piot, M.G. Azar, D. Silver, Rainbow: Combining improvements in deep reinforcement learning, in: Thirty-Second AAAI Conference on Artificial Intelligence, 2018, pp. 3215–3222.

3. M.G. Bellemare, W. Dabney, R. Munos, A distributional perspective on reinforcement learning, in: 34th International Conference on Machine Learning, 2017, pp. 449–458.

4. W. Dabney, M. Rowland, M.G. Bellemare, R. Munos, Distributional reinforcement learning with quantile regression, in: Thirty-Second AAAI Conference on Artificial Intelligence, 2017, pp. 2892–2901.

5. Hierarchical deep reinforcement learning: Integrating temporal abstraction and intrinsic motivation;Kulkarni,2016

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A multi-step on-policy deep reinforcement learning method assisted by off-policy policy evaluation;Applied Intelligence;2024-09-09

2. An efficient and lightweight off-policy actor–critic reinforcement learning framework;Applied Soft Computing;2024-09

3. End-to-End Autonomous Driving Decision Method Based on Improved TD3 Algorithm in Complex Scenarios;Sensors;2024-07-31

4. APSN: adaptive prediction sample network in Deep Q learning;Third International Conference on Algorithms, Microchips, and Network Applications (AMNA 2024);2024-06-08

5. Explorer-Actor-Critic: Better actors for deep reinforcement learning;Information Sciences;2024-03