1. Learning and sequential decision making;Barto,1991
2. Dynamic Programming;Bellman,1957
3. Dynamic programming: deterministic and stochastic models;Bertsekas,1987
4. Planning, learning and coordination in multiagent decision processes;Boutilier,1996
5. Convergence problems of general-sum multiagent reinforcement learning;Bowling,2000