1. Pieter Abbeel , Adam Coates , Morgan Quigley , and Andrew Ng. 2006. An application of reinforcement learning to aerobatic helicopter flight. Advances in neural information processing systems , Vol. 19 ( 2006 ). Pieter Abbeel, Adam Coates, Morgan Quigley, and Andrew Ng. 2006. An application of reinforcement learning to aerobatic helicopter flight. Advances in neural information processing systems, Vol. 19 (2006).
2. State-of-the-art in artificial neural network applications: A survey
3. Learning Robust Control Policies for End-to-End Autonomous Driving From Data-Driven Simulation
4. Marcin Andrychowicz , Filip Wolski , Alex Ray , Jonas Schneider , Rachel Fong , Peter Welinder , Bob McGrew , Josh Tobin , Open AI Pieter Abbeel , and Wojciech Zaremba . 2017 . Hindsight Experience Replay . In Advances in Neural Information Processing Systems , Vol. 30 . Curran Associates, Inc. Marcin Andrychowicz, Filip Wolski, Alex Ray, Jonas Schneider, Rachel Fong, Peter Welinder, Bob McGrew, Josh Tobin, OpenAI Pieter Abbeel, and Wojciech Zaremba. 2017. Hindsight Experience Replay. In Advances in Neural Information Processing Systems, Vol. 30. Curran Associates, Inc.
5. Greg Brockman , Vicki Cheung , Ludwig Pettersson , Jonas Schneider , John Schulman , Jie Tang , and Wojciech Zaremba . 2016. Openai gym. arXiv preprint arXiv:1606.01540 ( 2016 ). Greg Brockman, Vicki Cheung, Ludwig Pettersson, Jonas Schneider, John Schulman, Jie Tang, and Wojciech Zaremba. 2016. Openai gym. arXiv preprint arXiv:1606.01540 (2016).