Distributed Online Multi-Label Learning with Privacy Protection in Internet of Things

Author:

Huang  Fan1ORCID,Yang Nan1,Chen  Huaming1ORCID,Bao Wei1,Yuan Dong1

Affiliation:

1. School of Electrical and Information Engineering, The University of Sydney, Sydney, NSW 2008, Australia

Abstract

With the widespread use of end devices, online multi-label learning has become popular as the data generated by users using the Internet of Things devices have become huge and rapidly updated. However, in many scenarios, the user data are often generated in a geographically distributed manner that is often inefficient and difficult to centralize for training machine learning models. At the same time, current mainstream distributed learning algorithms always require a centralized server to aggregate data from distributed nodes, which inevitably causes risks to the privacy of users. To overcome this issue, we propose a distributed approach for multi-label classification, which trains the models in distributed computing nodes without sharing the source data from each node. In our proposed method, each node trains its model with its local online data while it also learns from the neighbour nodes without transferring the training data. As a result, our proposed method achieved the online distributed approach for multi-label classification without losing performance when taking existing centralized algorithms as a reference. Experiments show that our algorithm outperforms the centralized online multi-label classification algorithm in F1 score, being 0.0776 higher in macro F1 score and 0.1471 higher for micro F1 score on average. However, for the Hamming loss, both algorithms beat each other on some datasets, and our proposed algorithm loses 0.005 compared to the centralized approach on average, which can be neglected. Furthermore, the size of the network and the degree of connectivity are not factors that affect the performance of this distributed online multi-label learning algorithm.

Publisher

MDPI AG

Subject

Fluid Flow and Transfer Processes,Computer Science Applications,Process Chemistry and Technology,General Engineering,Instrumentation,General Materials Science

Reference55 articles.

1. Zhang, X., Graepel, T., and Herbrich, R. (2010, January 13–15). Bayesian online learning for multi-label and multi-variate performance measures. Proceedings of the Thirteenth International Conference on Artificial Intelligence and Statistics, JMLR Workshop and Conference Proceedings, Sardinia, Italy.

2. Internet of things for smart cities;Zanella;IEEE Internet Things J.,2014

3. Spyromitros-Xioufis, E., Spiliopoulou, M., Tsoumakas, G., and Vlahavas, I. (2011, January 16–22). Dealing with concept drift and class imbalance in multi-label stream classification. Proceedings of the Twenty-Second International Joint Conference on Artificial Intelligence, Barcelona, Spain.

4. Büyükçakir, A., Bonab, H., and Can, F. (2018, January 22–26). A novel online stacked ensemble for multi-label stream classification. Proceedings of the 27th ACM International Conference on Information and Knowledge Management, Torino, Italy.

5. Li, P., Wang, H., Böhm, C., and Shao, J. (2021, January 7–15). Online semi-supervised multi-label classification with label compression and local smooth regression. Proceedings of the Twenty-Ninth International Conference on International Joint Conferences on Artificial Intelligence, Yokohama, Japan.

同舟云学术

1.学者识别学者识别

2.学术分析学术分析

3.人才评估人才评估

"同舟云学术"是以全球学者为主线,采集、加工和组织学术论文而形成的新型学术文献查询和分析系统,可以对全球学者进行文献检索和人才价值评估。用户可以通过关注某些学科领域的顶尖人物而持续追踪该领域的学科进展和研究前沿。经过近期的数据扩容,当前同舟云学术共收录了国内外主流学术期刊6万余种,收集的期刊论文及会议论文总量共计约1.5亿篇,并以每天添加12000余篇中外论文的速度递增。我们也可以为用户提供个性化、定制化的学者数据。欢迎来电咨询!咨询电话:010-8811{复制后删除}0370

www.globalauthorid.com

TOP

Copyright © 2019-2024 北京同舟云网络信息技术有限公司
京公网安备11010802033243号  京ICP备18003416号-3