1. On the theory of policy gradient methods: Optimality, approximation, and distribution shift;Agarwal A.;JMLR,2021
2. Canonical piecewise-linear approximations
3. Y. Lu , M.S. Squillante , C.W. Wu . Markov Decision Process Framework for Control-Based Reinforcement Learning . Preprint , May 2023 . Y. Lu, M.S. Squillante, C.W. Wu. Markov Decision Process Framework for Control-Based Reinforcement Learning. Preprint, May 2023.
4. A. Nagabandi , Neural network dynamics for model-based deep reinforcement learning with model-free fine-tuning. arXiv:1708.02596v2 , 2017 . A. Nagabandi, et al. Neural network dynamics for model-based deep reinforcement learning with model-free fine-tuning. arXiv:1708.02596v2, 2017.
5. Nonlinear optimal feedback control for lunar module soft landing