Affiliation:
1. Department of Computer Science and Engineering, Waseda University, Tokyo 169-8555, Japan
Abstract
Decentralized execution is a widely used framework in multi-agent reinforcement learning. However, it has a well-known but neglected shortcoming, redundant computation, that is, the same/similar computation is performed redundantly in different agents owing to their overlapping observations. This study proposes a novel method, the locally centralized team transformer (LCTT), to address this problem. This method first proposes a locally centralized execution framework that autonomously determines some agents as leaders that generate instructions and other agents as workers to act according to the received instructions without running their policy networks. For the LCTT, we subsequently propose the team-transformer (T-Trans) structure, which enables leaders to generate targeted instructions for each worker, and the leadership shift, which enables agents to determine those that should instruct or be instructed by others. The experimental results demonstrated that the proposed method significantly reduces redundant computations without decreasing rewards and achieves faster learning convergence.
Reference39 articles.
1. Distributed multiagent coordinated learning for autonomous driving in highways based on dynamic coordination graphs;Yu;IEEE Trans. Intell. Transp. Syst.,2019
2. Wachi, A. (2019). Failure-scenario maker for rule-based agent using multi-agent adversarial reinforcement learning and its application to autonomous driving. arXiv.
3. Bhalla, S., Ganapathi Subramanian, S., and Crowley, M. (2020, January 13–15). Deep multi agent reinforcement learning for autonomous driving. Proceedings of the Canadian Conference on Artificial Intelligence, Ottawa, ON, Canada.
4. Palanisamy, P. (2020, January 19–24). Multi-agent connected autonomous driving using deep reinforcement learning. Proceedings of the 2020 International Joint Conference on Neural Networks (IJCNN), Glasgow, UK.
5. Shalev-Shwartz, S., Shammah, S., and Shashua, A. (2016). Safe, multi-agent, reinforcement learning for autonomous driving. arXiv.