1. Andrychowicz M, Wolski F, Ray A, Schneider J, Fong R, Welinder P, McGrew B, Tobin J, Abbeel P, Wojcieh Z (2018) Hindsight experience replay. https://arxiv.org/pdf/1707.01495.pdf
2. Arulkumaran K, Deisenroth MP, Brundage M, Bharath AA (2017) A brief survey of deep reinforcement learning. IEEE Signal Process Magazine. Special Issue on Deep Learning for Image Understanding (Arxiv Extended Version). https://arxiv.org/pdf/1708.05866.pdf
3. Barlow J, Sembi S, Parsons H, Kim S, Petrou S, Harnett P, Dawe S (2018 Nov 3) A randomized controlled trial and economic evaluation of the parents under pressure program for parents in substance abuse treatment. Drug Alcohol Depend 194:184–194. https://doi.org/10.1016/j.drugalcdep.2018.08.044
4. Barrett S (2013) Climate treaties and approaching catastrophes. J Environ Econ Manag 66:235–250. https://doi.org/10.1016/j.jeem.2012.12.004i
5. Bernkamp F, Turchetta M, Schoellig AP, Krause A (2017) Safe model-based reinforcement learning with stability guarantees. In: 31st Conference on Neural Information Processing Systems (NIPS 2017), Long Beach, CA, USA. https://papers.nips.cc/paper/6692-safe-model-based-reinforcement-learning-with-stability-guarantees.pdf