1. Policy-based branch-and-bound for infinite-horizon multi-model markov decision processes;Ahluwalia;Comput. Oper. Res.,2021
2. Aslanpour, et al., 2018. Resource provisioning for cloud applications: a 3-d, provident and flexible approach. J. Supercomputing 74, 6470–6501. URL:https://doi.org/10.1007/s11227-017-2156-x.
3. Brockman, G., et al., 2016. Openai gym. CoRR abs/1606.01540. arXiv:1606.01540. URL:http://arxiv.org/abs/1606.01540.
4. Burda, Y., et al., 2019. Large-scale study of curiosity-driven learning, in: 7th International Conference on Learning Representations, ICLR 2019, New Orleans, LA, USA. URL: https://openreview.net/forum?id=rJNwDjAqYX.
5. Colas, C., et al., 2020. Language as a cognitive tool to imagine goals in curiosity driven exploration, in: Advances in Neural Information Processing Systems 33, Annual Conference on Neural Information Processing Systems 2020, NeurIPS 2020, December 6–12, 2020, virtual. URL: https://proceedings.neurips.cc/paper/2021/hash/286674e3082feb7e5afb92777e48821f-Abstract.html.