1. Rajeswaran , A. , Kumar , V. , Gupta , A. , Vezzani , G. , Schulman , J. , Todorov , E. , and Levine , S . 2018. Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations. In Robotics: Science and Systems XIV , Carnegie Mellon University , Pittsburgh, Pennsylvania, USA , June 26-30, 2018 . Rajeswaran, A., Kumar, V., Gupta, A., Vezzani, G., Schulman, J., Todorov, E., and Levine, S. 2018. Learning Complex Dexterous Manipulation with Deep Reinforcement Learning and Demonstrations. In Robotics: Science and Systems XIV, Carnegie Mellon University, Pittsburgh, Pennsylvania, USA, June 26-30, 2018.
2. Sham Kakade . 2001 . A natural policy gradient . In Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic (NIPS'01) . MIT Press, Cambridge, MA, USA, 1531–1538. Sham Kakade. 2001. A natural policy gradient. In Proceedings of the 14th International Conference on Neural Information Processing Systems: Natural and Synthetic (NIPS'01). MIT Press, Cambridge, MA, USA, 1531–1538.
3. Nachum , O. , Gu , S. , Lee , H. , and Levine , S . 2018. Data-efficient hierarchical reinforcement learning . In Proceedings of the 32nd International Conference on Neural Information Processing Systems (NIPS '18) . Curran Associates Inc., Red Hook, NY, USA, 3307–3317. Nachum, O., Gu, S., Lee, H., and Levine, S. 2018. Data-efficient hierarchical reinforcement learning. In Proceedings of the 32nd International Conference on Neural Information Processing Systems (NIPS '18). Curran Associates Inc., Red Hook, NY, USA, 3307–3317.
4. Dot-to-Dot: Explainable Hierarchical Reinforcement Learning for Robotic Manipulation
5. Zhang , J. , Yu , H. , and Xu , W . 2021. Hierarchical Reinforcement Learning by Discovering Intrinsic Options . In International Conference on Learning Representations. Zhang, J., Yu, H., and Xu, W. 2021. Hierarchical Reinforcement Learning by Discovering Intrinsic Options. In International Conference on Learning Representations.