1. Abel, D., Umbanhowar, N., Khetarpal, K., Arumugam, D., Precup, D., Littman, M.: Value preserving state-action abstractions. In: International Conference on Artificial Intelligence and Statistics, pp. 1639–1650. PMLR (2020)
2. Angelotti, G., Drougard, N., Chanel, C.P.C.: Offline learning for planning: a summary. In: Proceedings of the 1st Workshop on Bridging the Gap Between AI Planning and Reinforcement Learning at the 30th International Conference on Automated Planning and Scheduling, pp. 153–161 (2020)
3. Angelotti, G., Drougard, N., Chanel, C.P.C.: Expert-guided symmetry detection in markov decision processes. In: Proceedings of the 14th International Conference on Agents and Artificial Intelligence, vol. 2: ICAART, pp. 88–98. INSTICC, SciTePress (2022). https://doi.org/10.5220/0010783400003116
4. Angelotti, G., Drougard, N., Chanel, C.P.C.: Data augmentation through expert-guided symmetry detection to improve performance in offline reinforcement learning. In: Proceedings of the 15th International Conference on Agents and Artificial Intelligence, vol. 2: ICAART, pp. 115–124. INSTICC, SciTePress (2023). https://doi.org/10.5220/0011633400003393
5. Bertsekas, D.P., Tsitsiklis, J.N.: Neuro-dynamic programming: an overview. In: Proceedings of 1995 34th IEEE Conference on Decision and Control, vol. 1, pp. 560–564. IEEE (1995)