Understanding Failures of Deterministic Actor-Critic with Continuous Action Spaces and Sparse Rewards-Reference-Cited by-同舟云学术

Understanding Failures of Deterministic Actor-Critic with Continuous Action Spaces and Sparse Rewards

Published:2020 Issue: Volume: Page:308-320
ISSN:0302-9743
Container-title:Artificial Neural Networks and Machine Learning – ICANN 2020
language:
Short-container-title:

Author:

Matheron Guillaume^ORCID,Perrin Nicolas^ORCID,Sigaud Olivier^ORCID

Publisher

Springer International Publishing

Link

https://link.springer.com/content/pdf/10.1007/978-3-030-61616-8_25

Reference17 articles.

1. Achiam, J., Knight, E., Abbeel, P.: Towards characterizing divergence in deep q-learning. arXiv:1903.08894 (2019)

2. Ahmed, Z., Roux, N.L., Norouzi, M., Schuurmans, D.: Understanding the impact of entropy on policy optimization. arXiv:1811.11214 (2019)

3. Baird, L.C., Klopf, A.H.: Technical Report WL-TR-93-1147. Wright-Patterson AIr Force Base, Ohio, Wright Laboratory (1993)

4. Boyan, J.A., Moore, A.W.: Generalization in reinforcement learning: safely approximating the value function. In: Advances in Neural Information Processing Systems, pp. 369–376 (1995)

5. Colas, C., Sigaud, O., Oudeyer, P.Y.: GEP-PG: Decoupling Exploration and Exploitation in Deep Reinforcement Learning Algorithms. arXiv:1802.05054 (2018)

Cited by 12 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning continuous multi-UAV controls with directed explorations for flood area coverage;Robotics and Autonomous Systems;2024-10

2. Cognitive mapping and episodic memory emerge from simple associative learning rules;Neurocomputing;2024-08

3. Reliability assessment of off-policy deep reinforcement learning: A benchmark for aerodynamics;Data-Centric Engineering;2024

4. Artificial Intelligence Algorithms in Flood Prediction: A General Overview;Geo-information for Disaster Monitoring and Management;2024

5. Learning Energy-Efficient Transmitter Configurations for Massive MIMO Beamforming;IEEE Transactions on Machine Learning in Communications and Networking;2024