Federated Bandit: A Gossiping Approach-Reference-Cited by-同舟云学术

Federated Bandit: A Gossiping Approach

Published:2021-06-22 Issue:1 Volume:49 Page:3-4
ISSN:0163-5999
Container-title:ACM SIGMETRICS Performance Evaluation Review
language:en
Short-container-title:SIGMETRICS Perform. Eval. Rev.

Author:

Zhu Zhaowei¹,Zhu Jingxuan²,Liu Ji²,Liu Yang¹

Affiliation:

1. University of California, Santa Cruz, Santa Cruz, CA, USA

2. Stony Brook University, Stony Brook, NY, USA

Abstract

We study Federated Bandit, a decentralized Multi-Armed Bandit (MAB) problem with a set of N agents, who can only communicate their local data with neighbors described by a connected graph G. Each agent makes a sequence of decisions on selecting an arm from M candidates, yet they only have access to local and potentially biased feedback/evaluation of the true reward for each action taken. Learning only locally will lead agents to sub-optimal actions while converging to a no-regret strategy requires a collection of distributed data. Motivated by the proposal of federated learning, we aim for a solution with which agents will never share their local observations with a central entity, and will be allowed to only share a private copy of his/her own information with their neighbors. We first propose a decentralized bandit algorithm GossipUCB, which is a coupling of variants of both the classical gossiping algorithm and the celebrated Upper Confidence Bound (UCB) bandit algorithm. We show that GossipUCB successfully adapts local bandit learning into a global gossiping process for sharing information among connected agents, and achieves guaranteed regret at the order of O(max(poly(N,M) log T, poly(N,M) logλ2-1 N)) for all N agents, where λ2∈(0,1) is the second largest eigenvalue of the expected gossip matrix, which is a function of G. We then propose FedUCB, a differentially private version of GossipUCB, in which the agents preserve ε-differential privacy of their local data while achieving O(max poly(N,M)/ε log2.5 T, poly(N,M) (logλ2-1 N + log T)) regret.

Funder

NSF

Office of Naval Research

Publisher

Association for Computing Machinery (ACM)

Subject

Computer Networks and Communications,Hardware and Architecture,Software

Link

https://dl.acm.org/doi/pdf/10.1145/3543516.3453919

Reference5 articles.

1. Randomized gossip algorithms

2. Calibrating Noise to Sensitivity in Private Data Analysis

3. An Online Learning Approach to Improving the Quality of Crowd-Sourcing

4. Federated Machine Learning

5. Z. Zhu , J. Zhu , J. Liu , and Y. Liu . 2021 . Federated Bandit: A Gossiping Approach. Proceedings of the ACM on Measurement and Analysis of Computing Systems , Vol. 5 , 1 ( 2021 ), Article 2. Z. Zhu, J. Zhu, J. Liu, and Y. Liu. 2021. Federated Bandit: A Gossiping Approach. Proceedings of the ACM on Measurement and Analysis of Computing Systems, Vol. 5, 1 (2021), Article 2.