Author:
Bhatnagar S.,Prasad H.,Prashanth L.
Reference30 articles.
1. Abdulla, M.S., Bhatnagar, S.: Reinforcement learning based algorithms for average cost Markov decision processes. Discrete Event Dynamic Systems 17(1), 23–52 (2007)
2. Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-Dynamic Programming. Athena Scientific, Belmont (1996)
3. Bhatnagar, S.: Adaptive multivariate three-timescale stochastic approximation algorithms for simulation based optimization. ACM Transactions on Modeling and Computer Simulation 15(1), 74–107 (2005)
4. Bhatnagar, S.: Adaptive Newton-based smoothed functional algorithms for simulation optimization. ACM Transactions on Modeling and Computer Simulation 18(1), 2:1–2:35 (2007)
5. Bhatnagar, S.: Simultaneous perturbation and finite difference methods. Wiley Encyclopedia of Operations Research and Management Science 7, 4969–4991 (2011)