Hessian matrix distribution for Bayesian policy gradient reinforcement learning-Reference-Cited by-同舟云学术

Hessian matrix distribution for Bayesian policy gradient reinforcement learning

Published:2011-05 Issue:9 Volume:181 Page:1671-1685
ISSN:0020-0255
Container-title:Information Sciences
language:en
Short-container-title:Information Sciences

Author:

Vien Ngo Anh,Yu Hwanjo,Chung TaeChoong

Publisher

Elsevier BV

Subject

Artificial Intelligence,Information Systems and Management,Computer Science Applications,Theoretical Computer Science,Control and Systems Engineering,Software

Reference29 articles.

1. J.A. Bagnell, J.G. Schneider, Covariant policy search, in: IJCAI, 2003, pp. 1019–1024.

2. Infinite-horizon policy-gradient estimation;Baxter;Journal of Artificial Intelligence Research (JAIR),2001

3. Dynamic Programming;Bellman,1957

4. D.P. Bertsekas, J.N. Tsitsiklis, Neuro-dynamic Programming, Athena Scientific, Belmont, Mass, 1996.

5. Natural actor-critic algorithms;Bhatnagar;Automatica,2009

Cited by 21 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Deep reinforcement learning-based air combat maneuver decision-making: literature review, implementation tutorial and future direction;Artificial Intelligence Review;2023-12-28

2. A covariance matrix adaptation evolution strategy in reproducing kernel Hilbert space;Genetic Programming and Evolvable Machines;2019-06-19

3. Bayes-adaptive hierarchical MDPs;Applied Intelligence;2016-01-29

4. Convex Optimization: Algorithms and Complexity;Foundations and Trends® in Machine Learning;2015

5. Using IDS fitted Q to develop a real-time adaptive controller for dynamic resource provisioning in Cloud's virtualized environment;Applied Soft Computing;2015-01