1. Agarwal, A., Bartlett, P., Dama, M.: Optimal allocation strategies for the dark pool problem. In: Proceedings of the International Conference on Artificial Intelligence and Statistics, pp. 9–16 (2010)
2. Agarwal, A., Kakade, S., Yang, L.F.: Model-based reinforcement learning with a generative model is minimax optimal. In: Conference on Learning Theory, pp. 67–83 (2020)
3. Agarwal, A., Kakade, S.M., Lee, J.D., Mahajan, G.: On the theory of policy gradient methods: Optimality, approximation, and distribution shift. J. Mach. Learn. Res. 22(98), 1–76 (2021)
4. Osband, I., Ghavamzadeh, M., Munos, R.: Minimax regret bounds for reinforcement learning. In: International Conference on Machine Learning (2017)
5. Abernethy, J., Kale, S.: Adaptive market making via online learning. In: Advances in Neural Information Processing Systems 26 (NIPS 2013) (2013)