An Overview of the Action Space for Deep Reinforcement Learning-Reference-Cited by-同舟云学术

An Overview of the Action Space for Deep Reinforcement Learning

Published:2021-12-22 Issue: Volume: Page:
ISSN:
Container-title:2021 4th International Conference on Algorithms, Computing and Artificial Intelligence
language:
Short-container-title:

Author:

Zhu Jie¹,Wu Fengge¹,Zhao Junsuo¹

Affiliation:

1. Institute of Software Chinese Academy of Sciences, China and University of Chinese Academy of Sciences, China

Publisher

ACM

Link

https://dl.acm.org/doi/pdf/10.1145/3508546.3508598

Reference44 articles.

1. Abbas Abdolmaleki Jost Tobias Springenberg Yuval Tassa Remi Munos Nicolas Heess and Martin Riedmiller. 2018. Maximum a posteriori policy optimisation. arXiv preprint arXiv:1806.06920(2018). Abbas Abdolmaleki Jost Tobias Springenberg Yuval Tassa Remi Munos Nicolas Heess and Martin Riedmiller. 2018. Maximum a posteriori policy optimisation. arXiv preprint arXiv:1806.06920(2018).

2. Gabriel Barth-Maron Matthew W Hoffman David Budden Will Dabney Dan Horgan Dhruva Tb Alistair Muldal Nicolas Heess and Timothy Lillicrap. 2018. Distributed distributional deterministic policy gradients. arXiv preprint arXiv:1804.08617(2018). Gabriel Barth-Maron Matthew W Hoffman David Budden Will Dabney Dan Horgan Dhruva Tb Alistair Muldal Nicolas Heess and Timothy Lillicrap. 2018. Distributed distributional deterministic policy gradients. arXiv preprint arXiv:1804.08617(2018).

3. Marc G Bellemare , Will Dabney , and Rémi Munos . 2017 . A distributional perspective on reinforcement learning . In International Conference on Machine Learning. PMLR, 449–458 . Marc G Bellemare, Will Dabney, and Rémi Munos. 2017. A distributional perspective on reinforcement learning. In International Conference on Machine Learning. PMLR, 449–458.

4. Craig J Bester Steven D James and George D Konidaris. 2019. Multi-pass q-networks for deep reinforcement learning with parameterised action spaces. arXiv preprint arXiv:1905.04388(2019). Craig J Bester Steven D James and George D Konidaris. 2019. Multi-pass q-networks for deep reinforcement learning with parameterised action spaces. arXiv preprint arXiv:1905.04388(2019).

Cited by 17 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Development of algorithms for augmenting and replacing conventional process control using reinforcement learning;Computers & Chemical Engineering;2024-11

2. Cooperative MARL-PPO Approach for Automated Highway Platoon Merging;Electronics;2024-08-05

3. AK-MADDPG-Based Antijamming Strategy Design Method for Frequency Agile Radar;Sensors;2024-05-27

4. Enhancement of power quality in three-phase GC solar photovoltaics;Electrical Engineering;2024-03-07

5. Deep Reinforcement Learning-based scheduling for optimizing system load and response time in edge and fog computing environments;Future Generation Computer Systems;2024-03