Author:
Arruda E.F.,Fragoso M.D.,do Val J.B.R.
Subject
Information Systems and Management,Management Science and Operations Research,Modelling and Simulation,General Computer Science,Industrial and Manufacturing Engineering
Reference34 articles.
1. Arruda, E.F., do Val, J.B.R., 2006. Approximate dynamic programming based on expansive projections. In: Proceedings of the 45th IEEE International Conference on Decision and Control. San Diego, pp. 5537–5542.
2. Arruda, E.F., Fragoso, M.D., do Val, J.B.R., 2008. An application of convex optimization concepts to approximate dynamic programming. In: Proceedings of the 2008 American Control Conference. Seattle, pp. 4238–4243.
3. Baird, L.C., 1995. Residual algorithms: Reinforcement learning with function approximation. In: Proceedings of the 12th International Conference on Machine Learning. Tahoe City CA, pp. 30–37.
4. Gradient descent for general reinforcement learning;Baird;Advances in Neural Information Processing Systems,1999
5. Nonlinear Programming: Theory and Algorithms;Bazaraa,1993
Cited by
8 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献