1. Afshar, R.R., Zhang, Y., Firat, M., Kaymak, U.: A decision support method to increase the revenue of ad publishers in waterfall strategy. In: 2019 IEEE Conference on Computational Intelligence for Financial Engineering Economics (CIFEr), pp. 1–8, May 2019.
https://doi.org/10.1109/CIFEr.2019.8759106
2. Afshar., R.R., Zhang., Y., Firat., M., Kaymak., U.: A reinforcement learning method to select ad networks in waterfall strategy. In: Proceedings of the 11th International Conference on Agents and Artificial Intelligence, ICAART, vol. 2, pp. 256–265. INSTICC, SciTePress (2019).
https://doi.org/10.5220/0007395502560265
3. Agrawal, S., Goyal, N.: Analysis of Thompson sampling for the multi-armed bandit problem. In: Mannor, S., Srebro, N., Williamson, R.C. (eds.) Proceedings of the 25th Annual Conference on Learning Theory. Proceedings of Machine Learning Research, vol. 23, pp. 39.1–39.26. PMLR, Edinburgh, 25–27 June 2012.
http://proceedings.mlr.press/v23/agrawal12.html
4. Agrawal, S., Goyal, N.: Further optimal regret bounds for Thompson sampling. In: Carvalho, C.M., Ravikumar, P. (eds.) Proceedings of the Sixteenth International Conference on Artificial Intelligence and Statistics. Proceedings of Machine Learning Research, vol. 31, pp. 99–107. PMLR, Scottsdale, 29 Apr–01 May 2013.
http://proceedings.mlr.press/v31/agrawal13a.html
5. Amin, K., Rostamizadeh, A., Syed, U.: Learning prices for repeated auctions with strategic buyers. In: Proceedings of the 26th International Conference on Neural Information Processing Systems, NIPS 2013, vol. 1, pp. 1169–1177. Curran Associates Inc., USA (2013).
http://dl.acm.org/citation.cfm?id=2999611.2999742