A basic formula for performance gradient estimation of semi-Markov decision processes-Reference-Cited by-同舟云学术

A basic formula for performance gradient estimation of semi-Markov decision processes

Published:2013-01 Issue:2 Volume:224 Page:333-339
ISSN:0377-2217
Container-title:European Journal of Operational Research
language:en
Short-container-title:European Journal of Operational Research

Author:

Li Yanjie,Cao Fang

Publisher

Elsevier BV

Subject

Information Systems and Management,Management Science and Operations Research,Modelling and Simulation,General Computer Science,Industrial and Manufacturing Engineering

Reference24 articles.

1. Stochastic optimaization;Aleksandrov;Engineering Cybernetics,1968

2. Gradient descent for general reinforcement learning;Baird;Advances in Neural Information Processing Systems,1998

3. Infinite-horizon policy-gradient estimation;Baxter;Journal of Artificial Intelligence Research,2001

4. Semi-markov decision problems and performance sensitivity analysis;Cao;IEEE Transactions on Automatic Control,2003

Cited by 10 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deterministic policy gradient algorithms for semi‐Markov decision processes;International Journal of Intelligent Systems;2021-10-13

2. Data-driven control of micro-climate in buildings: An event-triggered reinforcement learning approach;Applied Energy;2020-11

3. Evolutionary reinforcement learning of dynamical large deviations;The Journal of Chemical Physics;2020-07-28

4. Event-based optimization approach for solving stochastic decision problems with probabilistic constraint;Optimization Letters;2019-02-18

5. The risk probability criterion for discounted continuous-time Markov decision processes;Discrete Event Dynamic Systems;2017-08-10