Author:
Kalyanakrishnan Shivaram,Stone Peter
Publisher
Springer Science and Business Media LLC
Subject
Artificial Intelligence,Software
Reference128 articles.
1. Albus, J. S. (1981). Brains, behavior and robotics. New York: McGraw-Hill.
2. Åström, K. J. (1965). Optimal control of Markov processes with incomplete state information. Journal of Mathematical Analysis and Applications, 10, 174–205.
3. Baird, L., & Moore, A. (1999). Gradient descent for general reinforcement learning. In M. J. Kearns, S. A. Solla, & D. A. Cohn (Eds.), Advances in neural information processing systems 11 (NIPS 1998) (pp. 968–974). Cambridge: MIT Press.
4. Bakker, B., Zhumatiy, V., Gruener, G., & Schmidhuber, J. (2003). A robot that reinforcement-learns to identify and memorize important previous observations. In Proceedings of the 2003 IEEE/RSJ international conference on intelligent robots and systems (IROS 2003) (pp. 430–435). New York: IEEE Press.
5. Banko, M., & Brill, E. (2001). Scaling to very very large corpora for natural language disambiguation. In Proceedings of 39th annual meeting of the association for computational linguistics (ACL 2001) (pp. 26–33). Association for Computational Linguistics.
Cited by
16 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献