1. Deep reinforcement learning at the edge of the statistical precipice;Agarwal;Advances in Neural Information Processing Systems,2021
2. Constrained Markov decision processes;Altman,1999
3. Concrete problems in AI safety;Amodei,2016
4. Design for a brain;Ashby,1952
5. Layer normalization;Ba,2016