Subject
Information Systems and Management,Management Science and Operations Research,Modelling and Simulation,General Computer Science,Industrial and Manufacturing Engineering
Reference24 articles.
1. Stochastic optimaization;Aleksandrov;Engineering Cybernetics,1968
2. Gradient descent for general reinforcement learning;Baird;Advances in Neural Information Processing Systems,1998
3. Infinite-horizon policy-gradient estimation;Baxter;Journal of Artificial Intelligence Research,2001
4. Semi-markov decision problems and performance sensitivity analysis;Cao;IEEE Transactions on Automatic Control,2003
Cited by
10 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献