1. Andrychowicz, M., Wolski, F., Ray, A., Schneider, J., Fong, R., Welinder, P., McGrew, B., Tobin, J., Abbeel, P., Zaremba, W. (2017). Hindsight experience replay. In: Advances in neural information processing systems.
2. Bain, M., & Sammut, C. (1995). A framework for behavioural cloning. Machine Intelligence, 15, 103–129.
3. Chane-Sane, E., Schmid, C., Laptev, I. (2021). Goal-conditioned reinforcement learning with imagined subgoals. In: International conference on machine learning.
4. Charlesworth, H., & Montana, G. (2020). Plangan: Model-based planning with sparse rewards and multiple goals. Advances in Neural Information Processing Systems, 33, 8532–8542.
5. Chebotar, Y., Hausman, K., Lu, Y., Xiao, T., Kalashnikov, D., Varley, J., Irpan, A., Eysenbach, B., Julian, R., Finn, C., Levine, S. (2021). Actionable models: Unsupervised offline reinforcement learning of robotic skills. In: International conference on machine learning.