Author:
Tavares Anderson Rocha,Bazzan Ana LC
Publisher
Springer Science and Business Media LLC
Reference23 articles.
1. Bazzan ALC, Junges R: Congestion tolls as utility alignment between agent and system optimum. In Proceedings of the fifth international joint conference on autonomous agents and multiagent systems. Edited by: Nakashima H, Wellman MP, Weiss G, Stone P. New York: ACM; 2006:126–128.
2. Sutton R, Barto A: Reinforcement learning: an introduction. Cambridge, MA: MIT Press; 1998.
3. Watkins CJCH, Dayan P: Q-learning. Mach Learn 1992, 8(3):279–292.
4. Claus C, Boutilier C: The dynamics of reinforcement learning in cooperative multiagent systems. In Proceedings of the fifteenth national conference on artificial intelligence. New York: ACM; 1998:746–752.
5. Buşoniu L, Babuska R, De Schutter B: A comprehensive survey of multiagent reinforcement learning. Syst Man Cybernet Part C: Appl Rev IEEE Trans 2008, 38(2):156–172.
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献