Approximate planning for bayesian hierarchical reinforcement learning-Reference-Cited by-同舟云学术

Approximate planning for bayesian hierarchical reinforcement learning

Published:2014-07-20 Issue:3 Volume:41 Page:808-819
ISSN:0924-669X
Container-title:Applied Intelligence
language:en
Short-container-title:Appl Intell

Author:

Vien Ngo Anh,Ngo Hung,Lee Sungyoung,Chung TaeChoong

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

http://link.springer.com/content/pdf/10.1007/s10489-014-0565-6.pdf

Reference62 articles.

1. Abbeel P, Coates A, Quigley M, Ng AY (2006) An application of reinforcement learning to aerobatic helicopter flight. In: Advances in neural information processing systems (NIPS), pp 1–8

2. Abdoos M, Mozayani N, Bazzan ALC (2014) Hierarchical control of traffic signals using q-learning with tile coding. Appl Intell 40(2):201–213

3. Asmuth J, Littman ML (2011) Learning is planning: near Bayesoptimal reinforcement learning via Monte-Carlo tree search. In: UAI, pp 19–26

4. Atkeson CG (1997) Nonparametric model-based reinforcement learning. In: Advances in neural information processing systems (NIPS)

5. Bai H, Hsu D, Lee WS, Vien NA (2010) Monte Carlo value iteration for continuous-state POMDPs. In: Algorithmic foundations of robotics IX, pp 175–191

Cited by 6 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. High-efficiency online planning using composite bounds search under partial observation;Applied Intelligence;2022-07-30

2. A Partially Observable Markov Decision Process-Based Blackboard Architecture for Cognitive Agents in Partially Observable Environments;IEEE Transactions on Cognitive and Developmental Systems;2020

3. Single Trajectory Learning: Exploration Versus Exploitation;International Journal of Pattern Recognition and Artificial Intelligence;2018-02-21

4. A Deep Hierarchical Reinforcement Learning Algorithm in Partially Observable Markov Decision Processes;IEEE Access;2018

5. Continuous-Observation Partially Observable Semi-Markov Decision Processes for Machine Maintenance;IEEE Transactions on Reliability;2017-03