Bayesian Learning of Noisy Markov Decision Processes-Reference-Cited by-同舟云学术

Bayesian Learning of Noisy Markov Decision Processes

Published:2013-01 Issue:1 Volume:23 Page:1-25
ISSN:1049-3301
Container-title:ACM Transactions on Modeling and Computer Simulation
language:en
Short-container-title:ACM Trans. Model. Comput. Simul.

Author:

Singh Sumeetpal S.¹,Chopin Nicolas²,Whiteley Nick³

Affiliation:

1. University of Cambridge

2. CREST---ENSAE and HEC Paris

3. University of Bristol

Abstract

We consider the inverse reinforcement learning problem, that is, the problem of learning from, and then predicting or mimicking a controller based on state/action data. We propose a statistical model for such data, derived from the structure of a Markov decision process. Adopting a Bayesian approach to inference, we show how latent variables of the model can be estimated, and how predictions about actions can be made, in a unified framework. A new Markov chain Monte Carlo (MCMC) sampler is devised for simulation from the posterior distribution. This step includes a parameter expansion step, which is shown to be essential for good convergence properties of the MCMC sampler. As an illustration, the method is applied to learning a human controller.

Funder

Agence Nationale de la Recherche

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Science Applications,Modeling and Simulation

Link

https://dl.acm.org/doi/pdf/10.1145/2414416.2414420

Reference33 articles.

1. Apprenticeship learning via inverse reinforcement learning

2. Swapping the Nested Fixed Point Algorithm: A Class of Estimators for Discrete Markov Decision Models

3. Bayesian analysis of binary and polychotomous response data;Albert J.;J. Amer. Statis. Assn.,1993

Cited by 2 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. A Non‐parametric Estimation of Productivity with Idiosyncratic and Aggregate Shocks: The Role of Research and Development (R&D) and Corporate Tax;Oxford Bulletin of Economics and Statistics;2024-01-12

2. Trace-class Gaussian priors for Bayesian learning of neural networks with MCMC;Journal of the Royal Statistical Society Series B: Statistical Methodology;2023-01-31