1. Variable Stability In-Flight Simulation System Based on Existing Autopilot Hardware
2. Trust region policy optimization;schulman;International Conference on Machine Learning,2015
3. Continuous control with deep reinforcement learning;lillicrap,2015
4. Soft actor-critic: Off-policy maximum entropy deep reinforcement learning with a stochastic actor;haarnoja;International Conference on Machine Learning,2018
5. Openai gym;brockman,2016