1. Constrained policy optimization;Achiam,2017
2. Machine learning for combinatorial optimization: a methodological tour d’horizon;Bengio,2018
3. Curriculum learning;Bengio,2009
4. Neuro-Dynamic Programming;Bertsekas,1996
5. Evolution strategies - A comprehensive introduction;Beyer;Natl. Comput.,2002