1. Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor;haarnoja;ArXiv e-prints,2018
2. Addressing Function Approximation Error in Actor-Critic Methods;fujimoto;ArXiv e-prints,2018
3. Reinforcement Learning with Deep Energy-Based Policies;haarnoja;ArXiv e-prints,2017
4. Learning to Walk via Deep Reinforcement Learning;haarnoja;ArXiv e-prints,2018
5. Path Planning for Multi-Arm Manipulators Using Deep Reinforcement Learning: Soft Actor–Critic with Hindsight Experience Replay