Learning and planning in environments with delayed feedback-Reference-Cited by-同舟云学术

Learning and planning in environments with delayed feedback

Published:2008-07-04 Issue:1 Volume:18 Page:83-105
ISSN:1387-2532
Container-title:Autonomous Agents and Multi-Agent Systems
language:en
Short-container-title:Auton Agent Multi-Agent Syst

Author:

Walsh Thomas J.,Nouri Ali,Li Lihong,Littman Michael L.

Publisher

Springer Science and Business Media LLC

Subject

Artificial Intelligence

Link

http://link.springer.com/content/pdf/10.1007/s10458-008-9056-7.pdf

Reference27 articles.

1. Altman, E., & Nain, P. Closed-loop control with delayed information. In Proceedings of the ACM SIGMETRICS and Performance 1–5, pp. 193–204.

2. Atkeson C.G., Moore A.W., Schaal S. (1997) Locally weighted learning for control. Artificial Intelligence Review 11(1–5): 75–113

3. Bander J.L., White C.C. III (1999) Markov decision processes with noise-corrupted and delayed state observations. Journal of the Operational Research Society 50: 660–668

4. Bertsekas, D. P. (2001). Dynamic programming and optimal control (2nd ed., Vol. 1/2). Athena Scientific.

5. Boyan, J. A., & Moore, A. W. (1995). Generalization in reinforcement learning: Safely approximating the value function. In Advances in neural information processing systems: Proceedings of the 1994 conference (pp. 369–376). Cambridge, MA: MIT Press.

Cited by 32 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Solving time-delay issues in reinforcement learning via transformers;Applied Intelligence;2024-09-10

2. Investigating the Effectiveness of Reinforcement Learning in Closed-Loop Systems with Time Delays;2024 American Control Conference (ACC);2024-07-10

3. Delayed MDPs with Feature Mapping;2024 International Joint Conference on Neural Networks (IJCNN);2024-06-30

4. Designing Long-term Group Fair Policies in Dynamical Systems;The 2024 ACM Conference on Fairness, Accountability, and Transparency;2024-06-03

5. Deep Reinforcement Learning-Driven Scheduling in Multijob Serial Lines: A Case Study in Automotive Parts Assembly;IEEE Transactions on Industrial Informatics;2024-02