1. Sutton, R., Barto, A.G.: Reinforcement Learning. An Introduction. MIT Press, Cambridge (2018)
2. Xu, Q., Li, J., Koenig, S., Ma, H.: Multi-goal multi-agent pickup and delivery. In: 2022 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS), pp. 9964–9971 (2022)
3. Liu, M., Ma, H., Li, J., Koenig, S.: Task and path planning for multi-agent pickup and delivery. In: Proceedings of the 18th International Conference on Autonomous Agents and MultiAgent Systems, International Foundation for Autonomous Agents and Multiagent Systems, pp. 1152–1160 (2019)
4. Hongtao, H., Xurui, Y., Shichang, X., Feiyang, W.: Anti-conflict AGV path planning in automated container terminals based on multi-agent reinforcement learning. Int. J. Prod. Res. 61(1), 65–80 (2023)
5. Watkins, C.J.C.H., Dayan, P.: Q-learning. Mach. Learn. 8, 279–292 (1992)