1. Proximal policy optimization algorithms;schulman,0
2. Weighted importance sampling for off-policy learning with linear function approximation;mahmood;Advances in neural information processing systems,2014
3. Transfer learning for reinforcement learning domains: A survey;taylor;Journal of Machine Learning Research,2009
4. Prioritized experience replay;schaul,2015
5. Fault tree analysis: A survey of the state-of-the-art in modeling, analysis and tools;hausken;Reliability Engineering & System Safety,2018