1. A survey of pomdp applications;cassandra;Working Notes of AAAI 1998 Fall Symposium on Planning with Partially Observable Markov Decision Processes,1998
2. Learning to utilize shaping rewards: A new approach of reward shaping;hu;Advances in neural information processing systems,2020
3. Unity: A general platform for intelligent agents;juliani,2018
4. Q-Learning: Theory and Applications
5. Empirical evaluation of gated recurrent neural networks on sequence modeling;chung,2014