1. Bacon, P., Harb, J., Precup, D., 2017. The option-critic architecture, in: Singh, S.P., Markovitch, S. (Eds.), Proceedings of the Thirty-First AAAI Conference on Artificial Intelligence, February 4–9, 2017, San Francisco, California, USA, AAAI Press. pp. 1726–1734. URL: http://aaai.org/ocs/index.php/AAAI/AAAI17/paper/view/14858.
2. Bagaria, A., Konidaris, G., 2020. Option discovery using deep skill chaining, in: 8th International Conference on Learning Representations, ICLR 2020, Addis Ababa, Ethiopia, April 26-30, 2020, OpenReview.net. URL: https://openreview.net/forum?id=B1gqipNYwH.
3. Recent advances in hierarchical reinforcement learning;Barto;Discrete Event Dyn. Syst.,2003
4. Actor-critic algorithms for hierarchical markov decision processes;Bhatnagar;Automatica,2006
5. Language models are few-shot learners;Brown;Advances in neural information processing systems,2020