On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning-Reference-Cited by-同舟云学术

On the Existence of Fixed Points for Approximate Value Iteration and Temporal-Difference Learning

Published:2000-06 Issue:3 Volume:105 Page:589-608
ISSN:0022-3239
Container-title:Journal of Optimization Theory and Applications
language:en
Short-container-title:Journal of Optimization Theory and Applications

Author:

De Farias D. P.,Van Roy B.

Publisher

Springer Science and Business Media LLC

Subject

Applied Mathematics,Management Science and Operations Research,Control and Optimization

Link

http://link.springer.com/content/pdf/10.1023/A:1004641123405.pdf

Reference11 articles.

1. BELLMAN, R., and DREYFUS, S., Functional Approximations and Dynamic Programming, Mathematical Tables and Other Aids to Computation, Vol. 13, pp. 247-251, 1959.

2. SUTTON, R. S., Learning to Predict by the Method of Temporal Differences, Machine Learning, Vol. 3, pp. 9-44, 1988.

3. GURVITS, L., LIN, L. J., and HANSON, S. J., Incremental Learning of Evaluation Functions for Absorbing Markov Chains: New Methods and Theorems, Preprint, 1994.

4. PINEDA, F., Mean-Field Analysis for Batched TD(ℓ), Neural Computation, Vol. 9, pp. 1403-1419, 1997.

5. TSITSIKLIS, J. N., and VAN ROY, B., An Analysis of Temporal-Difference Learning with Function Approximation, IEEE Transactions on Automatic Control, Vol. 42, pp. 674-690, 1997.

Cited by 29 articles. 订阅此论文施引文献订阅此论文施引文献，注册后可以免费订阅5篇论文的施引文献，订阅后可以查看论文全部施引文献

1. Nonparametric Approximate Dynamic Programming via the Kernel Method;Stochastic Systems;2023-09

2. Research of Multi-agent Deep Reinforcement Learning based on Value Factorization;Highlights in Science, Engineering and Technology;2023-04-01

3. Dynamic Inventory Repositioning in On-Demand Rental Networks;Management Science;2022-11

4. Forward ADP I: The Value of a Policy;Reinforcement Learning and Stochastic Optimization;2022-04-02

5. Analyzing Approximate Value Iteration Algorithms;Mathematics of Operations Research;2021-12-30