Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes-Reference-Cited by-同舟云学术

Simulation-Based Optimization Algorithms for Finite-Horizon Markov Decision Processes

Published:2008-12 Issue:12 Volume:84 Page:577-600
ISSN:0037-5497
Container-title:SIMULATION
language:en
Short-container-title:SIMULATION

Author:

Bhatnagar Shalabh¹,Abdulla Mohammed Shahid²

Affiliation:

1. Department of Computer Science and Automation Indian Institute of Science Bangalore 560 012, India

2. General Motors India Science Lab Bangalore

Abstract

We develop four simulation-based algorithms for finite-horizon Markov decision processes. Two of these algorithms are developed for finite state and compact action spaces while the other two are for finite state and finite action spaces. Of the former two, one algorithm uses a linear parameterization for the policy, resulting in reduced memory complexity. Convergence analysis is briefly sketched and illustrative numerical experiments with the four algorithms are shown for a problem of flow control in communication networks.

Publisher

SAGE Publications

Subject

Computer Graphics and Computer-Aided Design,Modelling and Simulation,Software

Link

http://journals.sagepub.com/doi/pdf/10.1177/0037549708098120

Reference40 articles.

1. Markov Decision Processes

Cited by 11 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Learning-based optimal control of linear time-varying systems over large time intervals;Systems & Control Letters;2024-03

2. A Policy Gradient Approach for Finite Horizon Constrained Markov Decision Processes;2023 62nd IEEE Conference on Decision and Control (CDC);2023-12-13

3. Optimal Management of the Flow of Parts for Gas Turbines Maintenance by Reinforcement Learning and Artificial Neural Networks;International Series in Operations Research & Management Science;2021-12-09

4. Optimal decisions for continuous time Markov decision processes over finite planning horizons;Computers & Operations Research;2017-01

5. Near-Optimal Tracking Control of Mobile Robots Via Receding-Horizon Dual Heuristic Programming;IEEE Transactions on Cybernetics;2016-11