Recursive adaptive control of Markov decision processes with the average reward criterion-Reference-Cited by-同舟云学术

Recursive adaptive control of Markov decision processes with the average reward criterion

Published:1991-01 Issue:1 Volume:23 Page:193-207
ISSN:0095-4616
Container-title:Applied Mathematics & Optimization
language:en
Short-container-title:Appl Math Optim

Author:

Cavazos-Cadena Rolando,Hern�ndez-Lerma On�simo

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Control and Optimization

Link

http://link.springer.com/content/pdf/10.1007/BF01442397.pdf

Reference18 articles.

1. Acosta-Abreu, R. S., and O. Hernández-Lerma (1985). Iterative adaptive control of denumerable stage average-cost Markov systems. Control and Cybernetics, 14, 313?322.

2. Ash, R. B. (1972). Real Analysis and Probability. Academic Press, New York

3. Cavazos-Cadena, R. (1987). Finite state approximations and adaptive control of discounted Markov decision processes with unbounded rewards. Control and Cybernetics, 16, 31?58.

4. Cavazos-Cadena, R. (1990). Nonparametric adaptive control of discounted stochastic systems with compact state space. Journal of Optimization Theory and Applications, 65(2), 191?207.

5. Cavazos-Cadena, R., and O. Hernández-Lerma (1989). Recursive Adaptive Control of Markov Decision Processes. Report No. 28, Departamento de Matemáticas, CINVESTAV-IPN, México, D.F.

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Limiting Discounted-Cost Control of Partially Observable Stochastic Systems;SIAM Journal on Control and Optimization;2001-01

2. Recurrence conditions for Markov decision processes with Borel state space: A survey;Annals of Operations Research;1991-12