1. Model-based reinforcement learning with a generative model is minimax optimal;A Agarwal;Conference on Learning Theory,2020
2. Minimax regret bounds for reinforcement learning;M G Azar;International Conference on Machine Learning,2017
3. Regret bounds for risk-sensitive reinforcement learning;O Bastani;Advances in Neural Information Processing Systems,2022
4. More risk-sensitive markov decision processes;N B�uerle;Mathematics of Operations Research,2014