Residual Sarsa algorithm with function approximation-Reference-Cited by-同舟云学术

Residual Sarsa algorithm with function approximation

Published:2017-11-10 Issue:S1 Volume:22 Page:795-807
ISSN:1386-7857
Container-title:Cluster Computing
language:en
Short-container-title:Cluster Comput

Author:

Qiming Fu,Wen Hu,Quan Liu,Heng Luo,Lingyao Hu,Jianping Chen

Funder

National Natural Science Foundation of China

Natural Science Foundation of Jiangsu

High School Natural Foundation of Jiangsu

Fundation of Ministry of Housing and Urban-Rural Development of the People’s Republic of China

Key Laboratory of Symbolic Computation and Knowledge Engineering of Ministry of Education, Jilin University

Suzhou Industrial application of basic research program part

Publisher

Springer Science and Business Media LLC

Subject

Computer Networks and Communications,Software

Link

http://link.springer.com/content/pdf/10.1007/s10586-017-1303-8.pdf

Reference28 articles.

1. Sutton, R.S., Barto, A.G.: Reinforcement learning: an introduction. MIT press, Cambridge (1998)

2. Liu, Q., Fu, Q.M., Gong, S.R., Fu, Y.C., Cui, Z.M.: Reinforcement learning algorithm based on minimum state method and average reward. J. Commun. 32(1), 66–71 (2011)

3. Sutton, R.S.: Learning to predict by the method of temporal differences. Mach. Learn. 3, 9–44 (1988)

4. Go, C.K., Lao, B., Yoshimoto J., et al.: A reinforcement learning approach to the shepherding task using Sarsa. In: Proceedings of International Joint Conference on Neural Networks (IJCNN), Kuala Lumpur, Malaysia (2016)

5. Chettibi, S., Chikhi, S.: Dynamic fuzzy logic and reinforcement learning for adaptive energy efficient routing in mobile ad-hoc networks. Appl. Soft Comput. 38, 321–328 (2016)

Cited by 1 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Upper confident bound advantage function proximal policy optimization;Cluster Computing;2022-09-14