A Semi-Federated Active Learning Framework for Unlabeled Online Network Data-Reference-Cited by-同舟云学术

A Semi-Federated Active Learning Framework for Unlabeled Online Network Data

Published:2023-04-21 Issue:8 Volume:11 Page:1972
ISSN:2227-7390
Container-title:Mathematics
language:en
Short-container-title:Mathematics

Author:

Zhou Yuwen¹²,Hu Yuhan²,Sun Jing¹,He Rui¹,Kang Wenjie³^ORCID

Affiliation:

1. College of Intelligence and Computing, Tianjin University, Tianjin 300350, China

2. Science and Technology on Information Systems Engineering Laboratory, Changsha 410073, China

3. Hunan Provincial Key Laboratory of Network Investigational Technology, Hunan Police Academy, Changsha 410125, China

Abstract

Federated Learning (FL) is a newly emerged federated optimization technique for distributed data in a federated network. The participants in FL that train the model locally are classified into client nodes. The server node assumes the responsibility to aggregate local models from client nodes without data moving. In this regard, FL is an ideal solution to protect data privacy at each node of the network. However, the raw data generated on each node are unlabeled, making it impossible for FL to apply these data directly to train a model. The large volume of data annotating work prevents FL from being widely applied in the real world, especially for online scenarios, where the data are generated continuously. Meanwhile, the data generated on different nodes tend to be differently distributed. It has been proved theoretically and experimentally that non-independent and identically distributed (non-IID) data harm the performance of FL. In this article, we design a semi-federated active learning (semi-FAL) framework to tackle the annotation and non-IID problems jointly. More specifically, the server node can provide (i) a pre-trained model to help each client node annotate the local data uniformly and (ii) an estimation of the global gradient to help correct the local gradient. The evaluation results demonstrate our semi-FAL framework can efficiently handle unlabeled online network data and achieves high accuracy and fast convergence.

Funder

Excellent Youth funding of the Hunan Provincial Education Department

Hunan Province Legal Youth Research Project

Publisher

MDPI AG

Subject

General Mathematics,Engineering (miscellaneous),Computer Science (miscellaneous)

Link

https://www.mdpi.com/2227-7390/11/8/1972/pdf

Reference35 articles.

1. Federated learning;Yang;Synth. Lect. Artif. Intell. Mach. Learn.,2019

2. McMahan, B., Moore, E., Ramage, D., Hampson, S., and Arcas, B.A. (2017, January 20–22). Communication-efficient learning of deep networks from decentralized data. Proceedings of the Artificial Intelligence and Statistics, PMLR, Ft. Lauderdale, FL, USA.

3. Network anomaly detection based on federated learning;Zhao;J. Beijing Univ. Chem. Technol. Nat. Sci.,2021

4. Mun, H., and Lee, Y. (2020). Internet traffic classification with federated learning. Electronics, 10.

5. Machine learning: Algorithms, real-world applications and research directions;Sarker;SN Comput. Sci.,2021