1. Adrian Agogino and Kagan Tumer. 2004. Efficient Evaluation Functions for Multi-rover Systems. In Genetic and Evolutionary Computation-GECCO 2004. Springer, 1--11.
2. Stuart Russell Andrew Y. Ng, Daishi Harada. 1999. Policy invariance under reward transformations: Theory and application to reward shaping. Proceedings of the 16th International Conference on Machine Learning (1999), 278--287.
3. Shepherding algorithm for heterogeneous flock with model-based discrimination
4. Jacopo Castellini, Sam Devlin, Frans A Oliehoek, and Rahul Savani. 2022. Difference rewards policy gradients. Neural Computing and Applications (2022), 1--24.
5. Guiding a Robot Flock via Informed Robots