1. Abels, Axel. et al. 2019. Dynamic weights in multi-objective deep reinforcement learning. In International Conference on Machine Learning, 11–20. PMLR.
2. Adnan, Md Akhtaruzzaman, et al. 2013. Bio-mimic optimization strategies in wireless sensor networks: A survey. Sensors 14 (1): 299–345.
3. Alegre, Lucas Nunes, et al. 2022. Optimistic linear support and successor features as a basis for optimal policy transfer. In International Conference on Machine Learning, 394–413. PMLR.
4. Altman, Eitan. 2021. Constrained Markov Decision Processes. Milton Park: Routledge.
5. Barrett, Leon, and Srini Narayanan. 2008. Learning all optimal policies with multiple criteria. In Proceedings of the 25th International Conference on Machine Learning, 41–47.