1. Abadi, M., et al. (2016). Tensorflow: A system for large-scale machine learning. In 12th symposium on operating systems design and implementation (16) (pp. 265–283).
2. Araya, M., Buffet, O., Thomas, V., & Charpillet, F. (2010). A POMDP extension with belief-dependent rewards. In Advances in neural information processing systems (pp. 64–72).
3. An introduction to cybernetics;Ashby,1961
4. Optimal control of Markov processes with incomplete state information;Åström;Journal of Mathematical Analysis and Applications,1965
5. Revisiting active perception;Bajcsy;Autonomous Robots,2018