1. Dynamic programming and stochastic control processes;Bellman;Information and Control,1958
2. Value and policy iterations in optimal control and adaptive dynamic programming;Bertsekas;IEEE Transactions on Neural Networks and Learning Systems,2017
3. Bonanno, D., Roberts, M., Smith, L., & Aha, D. W. (2016). Selecting subgoals using deep learning in minecraft: A preliminary report. In IJCAI workshop on deep learning for artificial intelligence.
4. Samap: An user-oriented adaptive system for planning tourist visits;Castillo;Expert Systems with Applications,2008
5. Learning to plan from raw data in grid-based games;Dittadi,2018