1. Risk-averse multi-armed bandit problems under mean-variance measure;Vakili;IEEE Journal of Selected Topics in Signal Processing,2016
2. Zimin, A. , Ibsen-Jensen, R. , & Chatterjee, K. (2014) Generalized risk-aversion in stochastic multi-armed bandits. Preprint, arXiv:1405.0833.
3. What mean impacts miss: Distributional effects of welfare reform experiments;Bitler;American Economic Review,2006
4. Ma, X. , Zhang, Q. , Xia, L. , Zhou, Z. , Yang, J. , & Zhao, Q. (2020) Distributional soft actor critic for risk sensitive learning. Preprint, arXiv:2004.14547.
5. Kock, A.B. & Thyrsgaard, M. (2018) Optimal sequential treatment allocation. Preprint, arXiv:1705.09952.