Author:
Lu Xiaonong,Peng Zhanglin,Zhang Qiang,Yang Shanlin
Funder
Natural Science Foundation of Anhui Province
National Natural Science Foundation of China
Humanities and Social Science Foundation of Ministry of Education in China
Publisher
Springer Science and Business Media LLC
Reference35 articles.
1. Puterman, M.: Markov Decision Processes: Discrete Stochastic Dynamic Programming. Wiley, New York (1994)
2. Aldhaheri, R., Khalil, H.: Aggregation of the policy iteration method for nearly completely decomposable Markov chains. IEEE Trans. Autom. Control 36(2), 178–187 (1991)
3. Ren, Z., Krogh, B.: State aggregation in Markov decision processes. In: Conference on Decision and Control. IEEE, pp. 3819–3824. Pittsburgh, USA (2002)
4. Cao, X., Ren, Z., Bhatnagar, S., et al.: A time aggregation approach to Markov decision processes. Automatica 38(6), 929–943 (2002)
5. Sun, T., Zhao, Q., Luh, P.: Incremental value iteration for time-aggregated Markov-decision processes. IEEE Trans. Autom. Control 52(11), 2177–2182 (2007)
Cited by
4 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献