Author:
Hu Jwu-Sheng,Zheng Li-Jing
Reference13 articles.
1. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor;haarnoja,2018
2. Adam: A method for stochastic optimization;kingma,2014
3. Deep residual learning for image recognition;he;2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR),2016
4. Temporal Cycle-Consistency Learning
5. Multifingered Grasping Based on Multimodal Reinforcement Learning