Team formation through an assessor: choosing MARL agents in pursuit–evasion games-Reference-Cited by-同舟云学术

Team formation through an assessor: choosing MARL agents in pursuit–evasion games

Published:2024-02-10 Issue:3 Volume:10 Page:3473-3492
ISSN:2199-4536
Container-title:Complex & Intelligent Systems
language:en
Short-container-title:Complex Intell. Syst.

Author:

Zhao Yue^ORCID,Ju Lushan,Hernández-Orallo Josè^ORCID

Abstract

AbstractTeam formation in multi-agent systems usually assumes the capabilities of each team member are known, and the best formation can be derived from that information. As AI agents become more sophisticated, this characterisation is becoming more elusive and less predictive about the performance of a team in cooperative or competitive situations. In this paper, we introduce a general and flexible way of anticipating the outcome of a game for any lineups (the agents, sociality regimes and any other hyperparameters for the team). To this purpose, we simply train an assessor using an appropriate team representation and standard machine learning techniques. We illustrate how we can interrogate the assessor to find the best formations in a pursuit–evasion game for several scenarios: offline team formation, where teams have to be decided before the game and not changed afterwards, and online team formation, where teams can see the lineups of the other teams and can be changed at any time.

Funder

National Natural Science Foundation of China

Machine Teaching for Explainable AI

the Future of Life Institute, FLI

the EU (FEDER) and Spanish grant

EU’s Horizon 2020 research and innovation programme under grant agreement

Spanish grant

Publisher

Springer Science and Business Media LLC

Link

https://link.springer.com/content/pdf/10.1007/s40747-023-01336-5.pdf

Reference51 articles.

1. Juárez J, Santos C, Brizuela CA (2021) A comprehensive review and a taxonomy proposal of team formation problems. ACM Comput Surv (CSUR) 54(7):1–33

2. Kwa HL, Babineau V, Philippot J, Bouffanais R (2023) Adapting the exploration-exploitation balance in heterogeneous swarms: tracking evasive targets. Artif Life 29(1):21–36

3. Shishika D, Paulos J, Dorothy MR, Hsieh MA, Kumar V (2019) Team composition for perimeter defense with patrollers and defenders. In: 2019 IEEE 58th conference on decision and control (CDC). IEEE, pp 7325–7332

4. Jeong Y-S, Pan Y, Rathore S, Kim B, Park JH (2019) A parallel team formation approach using crowd intelligence from social network. Comput Hum Behav 101:429–434

5. Chen L, Ye Y, Zheng A, Xie F, Zheng Z, Lyu MR (2020) Incorporating geographical location for team formation in social coding sites. World Wide Web 23:153–174