Author:
Wang Na,Yu Jing,Wang Lei,Hao Xiaomei
Reference16 articles.
1. Cheng, F., G. Zhong., Y. Li., and Z. Xu. 1996. Fuzzy control of a double-inverted pendulum. Fuzzy Sets and Systems 79 (3): 315–321.
2. Bradtke, S.J., and A.G. Barto. 1996. Linear least-squares algorithms for temporal difference learning. Machine Learning 22 (1–3): 33–57.
3. Peng, J., and R.J. Williams. 1996. Incremental multi-step Q-learning. Machine Learning 22 (1–3): 283–290.
4. Lin, L.J. 1992. Self-improving reactive agents based on reinforcement learning, planning and teaching. Machine Learning 8 (3–4): 293–321.
5. Dayan, P., and C.J.C.H. Watkins. 1992. Q-learning. Machine Learning 8 (3): 279–292.
Cited by
1 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献