Publisher
Springer International Publishing
Reference22 articles.
1. Auer, P., Cesa-Bianchi, N., Fischer, P.: Finite-time analysis of the multiarmed bandit problem. Mach. Learn. 47(2–3), 235–256 (2002). https://doi.org/10.1023/A:1013689704352
2. Balcan, M.F., Blum, A., Haghtalab, N., Procaccia, A.D.: Commitment without regrets: online learning in stackelberg security games. In: Proceedings of the Sixteenth ACM Conference on Economics and Computation, EC 2015, pp. 61–78. ACM, New York (2015). https://doi.org/10.1145/2764468.2764478
3. Bard, N., Johanson, M., Burch, N., Bowling, M.: Online implicit agent modelling. In: Proceedings of the 2013 International Conference on Autonomous Agents and Multi-agent Systems, AAMAS 2013, pp. 255–262. International Foundation for Autonomous Agents and Multiagent Systems, Richland (2013). http://dl.acm.org/citation.cfm?id=2484920.2484963
4. Bellman, R.: A markovian decision process. Indiana Univ. Math. J. 6, 679–684 (1957)
5. Brown, G., Carlyle, M., Salmerón, J., Wood, K.: Defending critical infrastructure. Interfaces 36(6), 530–544 (2006). https://doi.org/10.1287/inte.1060.0252
Cited by
2 articles.
订阅此论文施引文献
订阅此论文施引文献,注册后可以免费订阅5篇论文的施引文献,订阅后可以查看论文全部施引文献