Author:
Bradtke Steven J.,Barto Andrew G.
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Software
Reference24 articles.
1. Technical Report 87-509.3;C. W. Anderson,1988
2. Barto, A. G., Sutton, R. S. & Anderson, C. W. (1983) Neuronlike elements that can solve difficult learning control problems.IEEE Transactions on Systems, Man, and Cybernetics, 13: 835?846.
3. Bradtke, S. J., (1994).Incremental Dynamic Programming for On-Line Adaptive Optimal Control. PhD thesis, University of Massachusetts, Computer Science Dept. Technical Report 94-62.
4. Darken, C. Chang, J. & Moody, J., (1992) Learning rate schedules for faster stochastic gradient search. InNeural Networks for Signal Processing 2 ? Proceedings of the 1992 IEEE Workshop. IEEE Press.
5. Dayan, P., (1992). The convergence of TD(?) for general ?.Machine Learning, 8: 341?362.
Cited by
220 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献