1. Approximability of constant-horizon constrained pomdp;Khonji,2019
2. E.J. Sondik, The optimal control of partially observable markov decision processes, PhD thesis, Stanford University.
3. Planning and acting in partially observable stochastic domains;Kaelbling;Artif. Intell.,1998
4. Reinforcement learning: a survey;Kaelbling;J. Artif. Intell. Res.,1996
5. A point-based pomdp algorithm for robot planning;Spaan,2004