1. Real-time learning and control using asynchronous dynamic programming;Barto,1991
2. Learning to act using real-time dynamic programming;Barto;Artificial Intelligence,1995
3. Learning and sequential decision making;Barto,1989
4. Planning with incomplete information as heuristic search in belief space;Bonet,2000
5. Planning as heuristic search;Bonet;Artificial Intelligence,2001