Toward efficient resource utilization at edge nodes in federated learning
-
Published:2024-06
Issue:2
Volume:13
Page:101-117
-
ISSN:2192-6352
-
Container-title:Progress in Artificial Intelligence
-
language:en
-
Short-container-title:Prog Artif Intell
Author:
Alawadi SadiORCID, Ait-Mlouk Addi, Toor Salman, Hellander Andreas
Abstract
AbstractFederated learning (FL) enables edge nodes to collaboratively contribute to constructing a global model without sharing their data. This is accomplished by devices computing local, private model updates that are then aggregated by a server. However, computational resource constraints and network communication can become a severe bottleneck for larger model sizes typical for deep learning (DL) applications. Edge nodes tend to have limited hardware resources (RAM, CPU), and the network bandwidth and reliability at the edge is a concern for scaling federated fleet applications. In this paper, we propose and evaluate a FL strategy inspired by transfer learning in order to reduce resource utilization on devices, as well as the load on the server and network in each global training round. For each local model update, we randomly select layers to train, freezing the remaining part of the model. In doing so, we can reduce both server load and communication costs per round by excluding all untrained layer weights from being transferred to the server. The goal of this study is to empirically explore the potential trade-off between resource utilization on devices and global model convergence under the proposed strategy. We implement the approach using the FL framework FEDn. A number of experiments were carried out over different datasets (CIFAR-10, CASA, and IMDB), performing different tasks using different DL model architectures. Our results show that training the model partially can accelerate the training process, efficiently utilizes resources on-device, and reduce the data transmission by around 75% and 53% when we train 25%, and 50% of the model layers, respectively, without harming the resulting global model accuracy. Furthermore, our results demonstrate a negative correlation between the number of participating clients in the training process and the number of layers that need to be trained on each client’s side. As the number of clients increases, there is a decrease in the required number of layers. This observation highlights the potential of the approach, particularly in cross-device use cases.
Funder
Blekinge Institute of Technology
Publisher
Springer Science and Business Media LLC
Reference48 articles.
1. McMahan, B., Moore, E., Ramage, D., Hampson, S., Arcas, B.A.: Communication-efficient learning of deep networks from decentralized data. In: Artificial Intelligence and Statistics, pp. 1273–1282 (2017) 2. Konečnỳ, J., McMahan, H.B., Ramage, D., Richtárik, P.: Federated optimization: Distributed machine learning for on-device intelligence (2016). arXiv:1610.02527 3. Alawadi, S., Kebande, V.R., Dong, Y., Bugeja, J., Persson, J.A., Olsson, C.M.: A federated interactive learning iot-based health monitoring platform. In: European Conference on Advances in Databases and Information Systems, pp. 235–246. Springer (2021) 4. Ait-Mlouk, A., Alawadi, S.A., Toor, S., Hellander, A.: Fedqas: privacy-aware machine reading comprehension with federated learning. Appl. Sci. 12(6), 3130 (2022) 5. Chen, C., Xu, H., Wang, W., Li, B., Li, B., Chen, L., Zhang, G.: Communication-efficient federated learning with adaptive parameter freezing. In: 2021 IEEE 41st International Conference on Distributed Computing Systems (ICDCS), pp. 1–11. IEEE (2021)
|
|