Affiliation:
1. Control Methods and Robotics Lab , Technical University of Darmstadt , Darmstadt , Germany
Abstract
Abstract
This work provides methodological approaches to solve convex optimization problems arising in multi-agent systems which can be reformulated in terms of a so called N-cluster game. We consider different settings of information available to each agent in the system. First, we present a centralized algorithm, which requires a central coordinator having full access to information about agents’ actions and gradients of their cost functions, to demonstrate how the standard gradient descent method can be applied to achieve an optimal output in N-cluster games. After that we relax the full information setting and assume that only partial information is available to each agent. Focus lies on the following two cases. In the first case, the agents have access to their gradient functions and are allowed to exchange information with their local neighbors over a communication graph that connects the whole system. In the second case, the agents do not know the functional form of their objectives/gradients and can only access the current values of their objective functions at some query point. Moreover, the agents are allowed to communicate only with their local neighbors within the cluster to which they belong. For both settings we present the convergent optimization procedures and analyse their efficiency in simulations.
Funder
Deutsche Forschungsgemeinschaft
Subject
Electrical and Electronic Engineering,Computer Science Applications,Control and Systems Engineering
Reference26 articles.
1. Bach, F. R. and V. Perchet. 2016. Highly-smooth zero-th order online optimization. In: Conference on Learning Theory (COLT), pp. 257–283.
2. Belgioioso, G., W. Ananduta, S. Grammatico and C. Ocampo-Martinez. 2020. Energy management and peer-to-peer trading in future smart grids: A distributed game-theoretic approach. In: 2020 European Control Conference (ECC), pp. 1324–1329.
3. Berahas, A. S., L. Cao, K. Choromanski and K. Scheinberg. 2021. A theoretical and empirical comparison of gradient approximations in derivative-free optimization. Foundations of Computational Mathematics. https://doi.org/10.1007/s10208-021-09513-z.
4. Bravo, M., D. Leslie and P. Mertikopoulos. 2018. Bandit learning in concave n-person games. In: Proceedings of the 32nd International Conference on Neural Information Processing Systems, NIPS’18. Red Hook, NY, USA, Curran Associates Inc., pp. 5666–5676.
5. Chung, K. L. Sep 1954. On a stochastic approximation method. Ann. Math. Statist. 25(3): 463–483.